digitalWorld

ALIA: the “public and open” AI promoted by the State

The first ALIA language models. The aim of the ALIA is to promote the development of artificial intelligence (AI) by making resources available to everyone in Spanish and co-official languages ​​(Catalan and Valencian, Basque and Galician).

The idea is that individual users and companies can use these resources to carry out research or develop their own AI products, although this technology will also land in some public bodies. In fact, the activation of ALIA is accompanied by the launch of two pilot projects: an internal chatbot that promises to streamline the work of the Tax Agency, and a solution aimed at primary care medicine that will allow "an early and more precise diagnosis of heart failure."

ALIA is now available to everyone, verified by the Spanish Agency for the Supervision of Artificial Intelligence (AESIA). In the case of language models, these have been trained using part of the infrastructure of the Barcelona Supercomputing Center, in Specifically, the MareNostrum 5 supercomputer, a key piece for Spain's scientific ambitions, has been in operation since 2023 and has cost more than 200 million euros.

The available models are as follows:

  • ALIA-40B: large language model trained with 40 billion parameters, trained from scratch with 9.2 trillion tokens. It understands 36 languages.
  • Salamandra-7b: large language model trained with 7 billion parameters, trained from scratch with 7.8 trillion tokens. It understands 36 languages.
  • Salamandra-7b-instruct: large language model trained with 276 thousand instructions in English, Spanish and Catalan collected from various open corpora.
  • Salamandra-2b: large language model with 2 billion parameters, trained from scratch with 7.8 trillion tokens. It understands 35 languages.
  • Salamandra-2b-instruct: large language model with 276 thousand instructions in English, Spanish and Catalan collected from several open corpora.

A variety of sources have been used. Data from Common Crawl, GitHub, Wikimedia (Wikimedia, including Wikipedia, Wikilibros, Wikinoticias, Wikiquote, Wikisource and Wikivoyag), EurLex, among others.

Multimedia

NAVARRE GLOBAL SCIENCE

Learn about the science that is done in Navarre, SINAI

Daisy Wang, Representing the World Digital Economy Forum in Europe

Smart & Green Fundazioa Summer Courses from UPV/EHU

Disruption in applied sustainability to change the world through education, technology and the city. FROM EMOTION TO DIGITAL TRANSFORMATION

What are you waiting for? Sign up

https://www.uik.eus/es/curso/smart-green-disrupzion-jasangarria-apena-mundua zehar-hezkuntza-teknologia

Jorge Toledo. EU Ambassador to China

Europe Day Celebration in China

BUSINESS CARD

Blockchain Conference La Rioja

Montse Guardia Güel, Eduardo Aginako, Luis Garvía and Javier Sánchez Marcos

Facebook or the big challenges

"You promised me colonies on Mars; instead, I received Facebook."
More news

We use our own and third party cookies to improve your browsing experience.
By continuing to browse we understand that you have accepted our cookies policy .