Download IATE and the IATE extraction tool IATExtract

IATE is a living database, i.e. translators and terminologists are continuously updating its content. Using the IATE search interface (http://iate.europa.eu/) thus ensures that you are accessing the most complete and up-to-date data. However, in order to cater for specific needs, e.g. for linguistic research, you can also download a copy of some of the data contained in IATE. 

The distribution consists of a zip file with the following name: IATE_download.zip .
An extraction tool named IATExtract is made available on this site in order to help users create subsets of the IATE download file, using a number of possible filtering criteria. Users can extract data for one or more specific languages, for a given domain or domain cluster. The subsets created by the extraction tool IATExtract are provided in the same
TermBase eXchange (TBX) format as the uncompressed IATE download file. For further details see: TBXcoreStructV02.dtd, TBXXCS.xcs, tbxxcsdtd.dtd.
The size of the uncompressed file is about 2.1 gigabytes and the compressed (downloaded) file around 120 megabytes.

For information on the data structure and the data categories included in the download file, please see: IATE Data fields explained

You can download the file by clicking on the link below.

IATE_download.zip   (Publication date: 11/11/2016)

There is no need to unzip the IATE download file as the extraction tool IATExtract will access the data in the zip file directly.

Users also need to download the extraction tool IATExtract and copy it into a suitable directory on their computer. The tool IATExtract is distributed as a Java jar file. It can run with a graphical user interface on any operating system supporting the Java runtime of version 1.7 or newer.

 

How to produce subsets of the IATE download file

Users can extract subsets of data as follows, using the extraction tool IATExtract.

IATE Extract Tool interface

Statistics

The download file contains 1.3 million entries, 8 million terms in 24 official EU languages.

Language

Number of terms

Bulgarian

           38073  

Czech

           355569  

Danish

           556443  

German

           946909  

Greek

           489284  

English

           1263117  

Spanish

           554738  

Estonian

           44934  

Finnish

           308286  

French

           1209090  

Irish

           65197  

Croatian

           16627  

Hungarian

           41263  

Italian

           632285  

Lithuanian

           46430  

Latvian

           38783  

Maltese

           51038  

Dutch

           627498  

Polish

           67052  

Portuguese

           472341  

Romanian

           47053  

Slovak

           44368  

Slovenian

           51466  

Swedish

           286272  

Latin

           61008  

Multilinugal

           4944  

 All

      8320068  

 

 

Conditions for use

You are allowed to reproduce the data provided on this page for your personal needs, to distribute it for non-commercial and commercial purposes, and to make and distribute derivative works, provided the source is acknowledged as follows:

Download IATE, European Union, [year].

The software necessary for exploitation or extraction (IATExtract) is distributed with the export file. The Translation Centre for the Bodies of the European Union makes this extraction tool available under the EUPL licence.

You are not allowed to reproduce or distribute the Download IATE page or the IATE logo without prior permission.

Contact

For more information on the IATE download, please contact iate@cdt.europa.eu