You are using an out of date browser. It may not display this or other websites correctly. You should upgrade or use an alternative browser.
multilingual datasets
About this tag
The multilingual datasets tag covers discussions about Microsoft's efforts to expand language resources for AI training, particularly in Europe. Recent content highlights Microsoft's Europe AI Language Initiative, which aims to enrich multilingual datasets for large language model (LLM) training and preserve cultural heritage through digital tools. This initiative is part of Microsoft's broader European Digital Commitments, focusing on linguistic diversity and AI development. The tag is relevant for users interested in how Microsoft is advancing multilingual AI, language model training data, and the intersection of technology with cultural preservation.
Microsoft’s latest initiative in Europe marks a watershed moment for the region’s digital and linguistic landscape, as the tech giant broadens its “European Digital Commitments” from mere policy to decisive action. In a move poised to reshape artificial intelligence (AI) language resources...
ai
ai collaboration
ai development
cultural digitization
cultural heritage
data enrichment
digital inclusion
digital sovereignty
european digital policy
european innovation
european languages
language diversity
language preservation
low-resource languages
microsoft
microsoft azure
multilingualdatasets
natural language ai
open data