Download IATE and the IATE extraction tool IATExtract

IATE is a living database, i.e. translators and terminologists are continuously updating its content. Using the IATE search interface (http://iate.europa.eu/) thus ensures that you are accessing the most complete and up-to-date data. However, in order to cater for specific needs, e.g. for linguistic research, you can also download a copy of some of the data contained in IATE. 

The distribution consists of a zip file with the following name: IATE_download.zip .
An extraction tool named IATExtract is made available on this site in order to help users create subsets of the IATE download file, using a number of possible filtering criteria. Users can extract data for one or more specific languages, for a given domain or domain cluster. The subsets created by the extraction tool IATExtract are provided in the same
TermBase eXchange (TBX) format as the uncompressed IATE download file. For further details see: TBXcoreStructV02.dtd, TBXXCS.xcs, tbxxcsdtd.dtd.
The size of the uncompressed file is about 2.1 gigabytes and the compressed (downloaded) file around 120 megabytes.

For information on the data structure and the data categories included in the download file, please see: IATE Data fields explained

You can download the file by clicking on the link below.

IATE_download.zip   (Publication date: 18/08/2017)

There is no need to unzip the IATE download file as the extraction tool IATExtract will access the data in the zip file directly.

Users also need to download the extraction tool IATExtract and copy it into a suitable directory on their computer. The tool IATExtract is distributed as a Java jar file. It can run with a graphical user interface on any operating system supporting the Java runtime of version 1.7 or newer.

 

How to produce subsets of the IATE download file

Users can extract subsets of data as follows, using the extraction tool IATExtract.

IATE Extract Tool interface

Statistics

The download file contains 1.3 million entries, 8 million terms in 24 official EU languages.

Language

Number of terms

Bulgarian

           41321  

Czech

           38845  

Danish

           573267  

German

           944297  

Greek

           492120  

English

           1305010  

Spanish

           582572  

Estonian

           47545  

Finnish

           325975  

French

           1231278  

Irish

           68990  

Croatian

           19063  

Hungarian

           45090  

Italian

           658604  

Lithuanian

           48848  

Latvian

           42413  

Maltese

           55499  

Dutch

           657496  

Polish

           72413  

Portuguese

           484414  

Romanian

           49942  

Slovak

           45593  

Slovenian

           53290  

Swedish

           304231  

Latin

           62602  

Multilinugal

           5540  

 All

      8256258  

 

 

Conditions for use

You are allowed to reproduce the data provided on this page for your personal needs, to distribute it for non-commercial and commercial purposes, and to make and distribute derivative works, provided the source is acknowledged as follows:

Download IATE, European Union, [year].

The software necessary for exploitation or extraction (IATExtract) is distributed with the export file. The Translation Centre for the Bodies of the European Union makes this extraction tool available under the EUPL licence.

You are not allowed to reproduce or distribute the Download IATE page or the IATE logo without prior permission.

Contact

For more information on the IATE download, please contact iate@cdt.europa.eu