Download IATE and the IATE extraction tool IATExtract

IATE is a living database, i.e. translators and terminologists are continuously updating its content. Using the IATE search interface (http://iate.europa.eu/) thus ensures that you are accessing the most complete and up-to-date data. However, in order to cater for specific needs, e.g. for linguistic research, you can also download a copy of some of the data contained in IATE. 

The distribution consists of a zip file with the following name: IATE_download_30032016.zip .
An extraction tool named IATExtract is made available on this site in order to help users create subsets of the IATE download file, using a number of possible filtering criteria. Users can extract data for one or more specific languages, for a given domain or domain cluster. The subsets created by the extraction tool IATExtract are provided in the same
TermBase eXchange (TBX) format as the uncompressed IATE download file. For further details see: TBXcoreStructV02.dtd, TBXXCS.xcs, tbxxcsdtd.dtd.
The size of the uncompressed file is about 2.1 gigabytes and the compressed (downloaded) file around 120 megabytes.

For information on the data structure and the data categories included in the download file, please see: IATE Data fields explained

You can download the file by clicking on the link below.

IATE_download_30032016.zip   (Publication date: 30/03/2016)

There is no need to unzip the IATE download file as the extraction tool IATExtract will access the data in the zip file directly.

Users also need to download the extraction tool IATExtract and copy it into a suitable directory on their computer. The tool IATExtract is distributed as a Java jar file. It can run with a graphical user interface on any operating system supporting the Java runtime of version 1.7 or newer.

 

How to produce subsets of the IATE download file

Users can extract subsets of data as follows, using the extraction tool IATExtract.

IATE Extract Tool interface

Statistics

The download file contains 1.3 million entries, 8 million terms in 24 official EU languages.

Language

Number of terms

Bulgarian

           36382  

Czech

           33366  

Danish

           560717  

German

           958064  

Greek

           497675  

English

           1268178  

Spanish

           555597  

Estonian

           42394  

Finnish

           307559  

French

           1214709  

Irish

           63005  

Croatian

           14171  

Hungarian

           37748  

Italian

           632350  

Lithuanian

           44213  

Latvian

           35305  

Maltese

           48205  

Dutch

           634800  

Polish

           63924  

Portuguese

           473401  

Romanian

           44333  

Slovak

           41627  

Slovenian

           48312  

Swedish

           286787  

Latin

           61237  

Multilinugal

           5102  

 All

      8009161  

 

 

Conditions for use

You are allowed to reproduce the data provided on this page for your personal needs, to distribute it for non-commercial and commercial purposes, and to make and distribute derivative works, provided the source is acknowledged as follows:

Download IATE, European Union, [year].

The software necessary for exploitation or extraction (IATExtract) is distributed with the export file. The Translation Centre for the Bodies of the European Union makes this extraction tool available under the EUPL licence.

You are not allowed to reproduce or distribute the Download IATE page or the IATE logo without prior permission.

Contact

For more information on the IATE download, please contact iate@cdt.europa.eu