Social media,
Online Disinformation,
and Elections

GATE logo
TUoS logo

GATE in brief

GATE has open-source models and algorithms for information extraction (IE), social media analytics, machine learning for IE, knowledge graphs and semantic annotation, and natural language processing-as-a-service. In total, the infrastructure and all its open source components and models comprise over 350,000 lines of code.

The GATE infrastructure is unique in its offering to both researchers and companies, a comprehensive platform-as-a-service – GATE Cloud. In the past year it attracted over 290 registered users, who used the services over 37,000 times.

Large-scale IE is a particular strength, where GATE is used as the platform to extract information from the web, news wires, scientific papers, and legal and medical documents. Selected example users include:

  • BBC in 2012, which used the development of robust, scalable IE from patents for automating the coverage of the Olympic games through semantic annotation.

  • WHO epidemiologists who found the first gene-disease association linking lung cancer and smoking.

Other examples in the UK:

  • Garlik (from the founders of Egg PLC) to fight identity theft

  • Innovantage for intelligent recruiting

  • Fizzback (now NICE) for analysing customer feedback

  • The UK National Archives

  • The Press Association

  • Financial Times

  • The Stationery Office

  • Nesta

  • TechCity UK

  • Synaptica

  • Text Mining Solutions

  • Buzzfeed UK

  • Public Health England

Contact us

To send us feedback or if you are interested in collaboration, email

Visit the GATE website for more information about text analysis and language processing.

Find out more about the Computer Science Department at the University of Sheffield.

Follow us @GateAcUK

Latest blog posts