Download IATE and the IATE extraction tool IATExtract

IATE is a living database, i.e. translators and terminologists are continuously updating its content. Using the IATE search interface (http://iate.europa.eu/) thus ensures that you are accessing the most complete and up-to-date data. However, in order to cater for specific needs, e.g. for linguistic research, you can also download a copy of some of the data contained in IATE. 

The distribution consists of a zip file with the following name: IATE_download.zip .
An extraction tool named IATExtract is made available on this site in order to help users create subsets of the IATE download file, using a number of possible filtering criteria. Users can extract data for one or more specific languages, for a given domain or domain cluster. The subsets created by the extraction tool IATExtract are provided in the same
TermBase eXchange (TBX) format as the uncompressed IATE download file. For further details see: TBXcoreStructV02.dtd, TBXXCS.xcs, tbxxcsdtd.dtd.
The size of the uncompressed file is about 2.1 gigabytes and the compressed (downloaded) file around 120 megabytes.

For information on the data structure and the data categories included in the download file, please see: IATE Data fields explained

You can download the file by clicking on the link below.

IATE_download.zip   (Publication date: 16/03/2017)

There is no need to unzip the IATE download file as the extraction tool IATExtract will access the data in the zip file directly.

Users also need to download the extraction tool IATExtract and copy it into a suitable directory on their computer. The tool IATExtract is distributed as a Java jar file. It can run with a graphical user interface on any operating system supporting the Java runtime of version 1.7 or newer.

 

How to produce subsets of the IATE download file

Users can extract subsets of data as follows, using the extraction tool IATExtract.

IATE Extract Tool interface

Statistics

The download file contains 1.3 million entries, 8 million terms in 24 official EU languages.

Language

Number of terms

Bulgarian

           39617  

Czech

           36740  

Danish

           555878  

German

           918579  

Greek

           487268  

English

           1244144  

Spanish

           553601  

Estonian

           45897  

Finnish

           308416  

French

           1194882  

Irish

           66332  

Croatian

           17745  

Hungarian

           42820  

Italian

           631580  

Lithuanian

           47342  

Latvian

           40395  

Maltese

           52626  

Dutch

           626772  

Polish

           68973  

Portuguese

           472218  

Romanian

           47938  

Slovak

           43487  

Slovenian

           52298  

Swedish

           285891  

Latin

           60792  

Multilinugal

           5295  

 All

      7947526  

 

 

Conditions for use

You are allowed to reproduce the data provided on this page for your personal needs, to distribute it for non-commercial and commercial purposes, and to make and distribute derivative works, provided the source is acknowledged as follows:

Download IATE, European Union, [year].

The software necessary for exploitation or extraction (IATExtract) is distributed with the export file. The Translation Centre for the Bodies of the European Union makes this extraction tool available under the EUPL licence.

You are not allowed to reproduce or distribute the Download IATE page or the IATE logo without prior permission.

Contact

For more information on the IATE download, please contact iate@cdt.europa.eu