Download IATE and the IATE extraction tool IATExtract

IATE is a living database, i.e. translators and terminologists are continuously updating its content. Using the IATE search interface (http://iate.europa.eu/) thus ensures that you are accessing the most complete and up-to-date data. However, in order to cater for specific needs, e.g. for linguistic research, you can also download a copy of some of the data contained in IATE. 

The distribution consists of a zip file with the following name: IATE_download.zip .
An extraction tool named IATExtract is made available on this site in order to help users create subsets of the IATE download file, using a number of possible filtering criteria. Users can extract data for one or more specific languages, for a given domain or domain cluster. The subsets created by the extraction tool IATExtract are provided in the same
TermBase eXchange (TBX) format as the uncompressed IATE download file. For further details see: TBXcoreStructV02.dtd, TBXXCS.xcs, tbxxcsdtd.dtd.
The size of the uncompressed file is about 2.1 gigabytes and the compressed (downloaded) file around 120 megabytes.

For information on the data structure and the data categories included in the download file, please see: IATE Data fields explained

You can download the file by clicking on the link below.

IATE_download.zip   (Publication date: 27/07/2016)

There is no need to unzip the IATE download file as the extraction tool IATExtract will access the data in the zip file directly.

Users also need to download the extraction tool IATExtract and copy it into a suitable directory on their computer. The tool IATExtract is distributed as a Java jar file. It can run with a graphical user interface on any operating system supporting the Java runtime of version 1.7 or newer.

 

How to produce subsets of the IATE download file

Users can extract subsets of data as follows, using the extraction tool IATExtract.

IATE Extract Tool interface

Statistics

The download file contains 1.3 million entries, 8 million terms in 24 official EU languages.

Language

Number of terms

Bulgarian

           37598  

Czech

           34930  

Danish

           560859  

German

           957833  

Greek

           497978  

English

           1266969  

Spanish

           555570  

Estonian

           43383  

Finnish

           308333  

French

           1211363  

Irish

           64566  

Croatian

           15689  

Hungarian

           39738  

Italian

           632678  

Lithuanian

           45797  

Latvian

           36874  

Maltese

           49819  

Dutch

           633792  

Polish

           65742  

Portuguese

           473576  

Romanian

           45488  

Slovak

           43151  

Slovenian

           49716  

Swedish

           287500  

Latin

           61128  

Multilinugal

           4930  

 All

      8025000  

 

 

Conditions for use

You are allowed to reproduce the data provided on this page for your personal needs, to distribute it for non-commercial and commercial purposes, and to make and distribute derivative works, provided the source is acknowledged as follows:

Download IATE, European Union, [year].

The software necessary for exploitation or extraction (IATExtract) is distributed with the export file. The Translation Centre for the Bodies of the European Union makes this extraction tool available under the EUPL licence.

You are not allowed to reproduce or distribute the Download IATE page or the IATE logo without prior permission.

Contact

For more information on the IATE download, please contact iate@cdt.europa.eu