-
Donér din stemme taledatasæt
I Doner din stemme indsamles danske taledata. Indsamlingen foregår på donerdinstemme.dk og sker igennem frivillige donationer fra borgere. Formålet er at skabe et åbent... -
Danish Semantic Reasoning Benchmark
A Danish semantic reasoning benchmark compiled from lexical semantic resources This benchmark is the first version of a semantic reasoning benchmark for Danish compiled semi-... -
Framenet DK
Ordbog (et Frame-leksikon) med verbers og verbalsubstantivers semantiske rammer (Frames) ud fra standarden Berkeley FrameNet https://framenet.icsi.berkeley.edu/fndrupal/ (der... -
Medical word embedding eval
In natural language processing, benchmarks are used to track progress and identify useful models. Currently, no benchmark for Danish clinical word embeddings exists. This paper... -
The Norwegian Colossal Corpus
"The Norwegian Colossal Corpus (NCC) is a collection of multiple smaller Norwegian corpuses suitable for training large language models. We have done extensive cleaning on the... -
Bilingual English-Danish parallel corpus from VisitDenmark - The official...
Contents of https://www.visitdenmark.dk were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the... -
Bilingual English-Danish parallel corpus from Visit Vejle website
Contents of https://www.visitvejle.com were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the... -
Bilingual English-Danish parallel corpus from The Viking Ship Museum website
Contents of https://www.vikingeskibsmuseet.dk were crawled, aligned on document and sentence level and converted into a parallel corpus. Contains 12403 translation units (EN-... -
Bilingual English-Danish parallel corpus from The Geological Survey of...
Contents of http://www.geus.dk/ were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework of... -
Bilingual English-Danish parallel corpus from The Danish Nature Agency website
Contents of https://naturstyrelsen.dk/ were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the... -
Bilingual English-Danish parallel corpus from The Danish Medicines Agency
Contents of https://laegemiddelstyrelsen.dk were crawled, aligned on document and sentence level and converted into a parallel corpus. Contains 22699 translation units between... -
Bilingual English-Danish parallel corpus from The Danish Gambling Authority website
Contents of https://spillemyndigheden.dk/ were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the... -
Bilingual English-Danish parallel corpus from The Danish Environmental...
Contents of https://eng.mst.dk/ and https://mst.dk/ were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created... -
Bilingual English-Danish parallel corpus from The Agency for Culture and...
Contents of https://slks.dk were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework of the... -
Bilingual English-Danish parallel corpus from Odense Municipality website
Contents of https://www.odense.dk/ were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework... -
Bilingual English-Danish parallel corpus from National Museum of Denmark website
Contents of https://natmus.dk/ were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework of... -
Bilingual English-Danish parallel corpus from Danmarks Statistik website
Contents of https://www.dst.dk were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework of... -
Bilingual English-Danish parallel corpus from Danish Ministry of Foreign...
Contents of http://um.dk were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework of the... -
Bilingual English-Danish parallel corpus from Danish Ministry of Finance website
Contents of https://uk.fm.dk/ were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework of... -
Bilingual English-Danish parallel corpus from Danish Maritime Authority website
Contents of https://www.dma.dk were crawled, aligned on document and sentence level and converted into a parallel corpus. This dataset has been created within the framework of...
Du kan også tilgå dette register med API (se API-dokumenter).