GATE in brief
GATE has open-source models and algorithms for information extraction (IE), social media analytics, machine learning for IE, knowledge graphs and semantic annotation, and natural language processing-as-a-service. In total, the infrastructure and all its open source components and models comprise over 350,000 lines of code.
The GATE infrastructure is unique in its offering to both researchers and companies, a comprehensive platform-as-a-service – GATE Cloud. In the past year it attracted over 290 registered users, who used the services over 37,000 times.
Large-scale IE is a particular strength, where GATE is used as the platform to extract information from the web, news wires, scientific papers, and legal and medical documents. Selected example users include:
BBC in 2012, which used the development of robust, scalable IE from patents for automating the coverage of the Olympic games through semantic annotation.
WHO epidemiologists who found the first gene-disease association linking lung cancer and smoking.
Other examples in the UK:
Garlik (from the founders of Egg PLC) to fight identity theft
Innovantage for intelligent recruiting
Fizzback (now NICE) for analysing customer feedback
The UK National Archives
The Press Association
The Stationery Office
Text Mining Solutions
Public Health England
To send us feedback or if you are interested in collaboration, email firstname.lastname@example.org.
Visit the GATE website for more information about text analysis and language processing.
Find out more about the Computer Science Department at the University of Sheffield.