The Common Language Resources and Technology Infrastructure (CLARIN) is a digital infrastructure which provides easy and sustainable access to a broad range of language data and tools to support research in the humanities and social sciences, and beyond. CLARIN provides access to digital language data in all modalities (text, audio, video) and advanced tools which can be used to analyse or combine these datasets.
Founded in 2012, CLARIN is an European Research Infrastructure Consortium (ERIC), an international legal entity established by the European Commission in 2009. In 2016, CLARIN received the status of a Landmark on the ESFRI roadmap. CLARIN is a distributed digital infrastructure, with participating centres all over Europe and further afield, which include universities, research centres, libraries and public archives. Tools and data from different centres are interoperable, so that data collections can be combined and tools from different sources can be chained to perform operations at different levels of complexity. Members can access all tools and resources with a single sign-on, and many of the resources are also open access for other interested communities of use, both within and outside of academia.
Promoting data registries and data management services that comply with the FAIR principles (Findable, Accessible, Interoperable, Reusable) underpins all aspects of CLARIN’s strategy, and the interoperability paradigm of what is now known as the Open Science agenda has been one of CLARIN’s distinguishing features from the outset. The interoperability of data and services across the CLARIN community has enabled large-scale data sharing and growing reuse of language resources. Interoperability has also proven crucial for the increased support of multidisciplinary collaboration and comparative research agendas. It is CLARIN’s ambition to consolidate its role in supporting the emerging research agendas for the SSH domain and to contribute to the innovation potential of the advanced models for interaction between people, data, and tools for data processing. The vision of borderless and seamless interoperability between data and services is further realised through CLARIN’s alignment with emerging cloud platforms such as the European Open Science Cloud (EOSC) and the SSH Open Marketplace.
CLARIN’s core community consists of academic researchers, developers and lecturers from a range of disciplines within the social sciences and humanities, who work with language data and language resources, technology, and knowledge. CLARIN also cooperates with a variety of stakeholders from outside of academia, including industry, governmental organisations, and the GLAM sector (Galleries, Libraries, Archives, and Museums) in the role of contributors as well as users of data, tools and know-how. The collaboration with non-academic parties is forged both at the national level and central level.