Profile icon

Ziqi Zhang

Lecturer/Senior Lecturer

School of Science & Technology


Dr Ziqi Zhang is a Lecturer in Computer Science, and is the module leader of the following modules:

  • ISYS10242: System Analysis & Design

He also assists in the teaching of the following modules:

  • COMP10081: Foundations of Comp & Tech
  • ISYS10241: System Analysis & Design

Career overview

Before joining the School of Science and Technology at NTU, Dr Zhang had been working since 2006 as a researcher at the Organisations, Information and Knowledge (OAK) group of the Department of Computer Science, University of Sheffield. During this time he completed his PhD (supervised by Professor Fabio Ciravegna) in 2013.

Research areas

Research interest

Dr Zhang's research addresses methods that enable machines to extract human knowledge from text, to represent such knowledge in a structured representation that is understandable and usable by machines. This ultimately enhances our capability of processing and sense-making of very large scale data, improving decision making. Specifically, his research interests include but are not limited to:

  • Information Extraction: how to automatically turn unstructured, natural language text into structured representation that could support machine understandability and reasoning. This could include the extraction of terms, concepts, named entities, and relations between them from texts.
  • Disambiguation: how to teach machines to automatically identify which meaning of a word or phrase is used within certain context.
  • Lexical semantics: how to represent the ‘meaning’ of a word, name, phrase, or sentence; how to measure the relatedness and similarity of these meanings (semantic relatedness and similarity).
  • Knowledge base construction: the use of all the above technology in the automatic creation of structured ‘databases’ that support machine understandability and reasoning; and methods of mapping such knowledge bases (ontology alignment, ontology mapping). An example of a knowledge base is the Google Knowledge Graph, or DBpedia.
  • Semantic Web and Linked Data: the use of all the above technology to enable the vision of tomorrow’s Web where machine understandable data are put on the Web, shared and reused across application, enterprise, and community boundaries.
  • Social media analysis: the application and adaptation of Information Extraction methods onto social media text analytics, to enable event discovery and monitoring.

Dr Zhang's PhD focused on exploiting background knowledge from various resources to support supervised Named Entity Recognition - a fundamental task of Information Extraction that extracts named entities from unstructured texts. He worked on the Linked Open Data for Information Extraction (LODIE) project funded by the EPSRC, to research and develop Information Extraction techniques able to (i) scale at web level and (ii) adapt to user information need, by exploiting linked data on the Web. He was also a core member of the research and development team behind the world’s first football transfer news prediction system,

Research software

Dr Zhang is the owner and lead developer of the following research software hosted on GitHub:

  • JATE – Java Automatic Term Extraction: an open source library for automatic domain specific term extraction from text corpora. Previously rated top 100 AI tools published on Google Project.
  • STI – Semantic Table Interpretation: a library implementing several state of the art algorithms and a few baselines for linking content in relational Web tables with a Knowledge base.

External activity

Event organiser

  • Co-organiser for the workshops on Linked Data for Information Extraction (LD4IE), 2016, 2015, 2014, 2013


  • IEEE Transactions on Knowledge and Data Engineering
  • ACM Transactions on Knowledge Discovery from Data
  • Semantic Web Journal
  • The Extended Semantic Web Conferences
  • The International Semantic Web Conferences
  • International Conference on Knowledge Engineering and Knowledge Management
  • International Workshops on Mining Scientific Publications
  • Open Knowledge Extraction challenge at the Extended Semantic Web Conference

Tutorials and Talks

  • Tutorial at the Semantics 2016 conference: (Semi)automatic conversion of tabular data into Linked Data, Leipzig (2016)
  • Invited talk at Schwa lab, the University of Sydney. Aligning relations on Linked Data (2013)
  • Tutorial at ISWC2013 Web Scale Information Extraction: Gentile, A., Zhang, Z.
  • Tutorial at ECML/PKDD2013 Web Scale Information Extraction: Gentile, A., Zhang, Z.
  • Tutorial at ECML/PKDD2011: Ciravegna, F., Varga, A., Zhang, Z. 2011. Mining Complex Entities from Heterogeneous Information Networks, in 22th European Conference on Machine Learning (ECML) and the 15th European Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD).
  • Tutorial at EKAW2010: Zhang, Z., Cano, E., Elbedweihy, K., Dadzie, A. 2010. Introduction to Knowledge Acquisition from Social Networking Sites, in the conference on Knowledge Engineering and Knowledge Management by the Masses, EKAW2010.


Effective and efficient semantic table interpretation using TableMiner+. Zhang Z, The Semantic Web Journal, 2016 (accepted)

JATE 2.0: Java Automatic Term Extraction with Apache Solr. Zhang Z, Gao J, Ciravegna F, Proceedings of the Tenth International Conference on Language Resources and Evaluation, 2016, 2262-2269

An unsupervised data-driven method to discover equivalent relations in large linked datasets. Zhang Z, Gentile A, Augenstein I, Blomqvist E, Ciravegna F, The Semantic Web Journal (special issue on ontology and linked data matching), 2015, 1-27

Towards efficient and effective semantic table interpretation. Zhang Z. 13th International Semantic Web Conference proceedings, Part I, 2014, 487-502

Mining equivalent relations from linked data. Zhang Z, Gentile A, Augenstein I, Blomqvist E, Ciravegna F, Proceedings of the 51st annual meeting of the Association for Computational Linguistics (ACL 2013): System Demonstrations, 2013, 289-293

Unsupervised wrapper induction using linked data. Gentile A, Zhang Z, Augenstein I, Ciravegna F, Proceedings of the seventh international conference on knowledge capture, 2013, 41-48

Topic-oriented words as features for named entity recognition. Zhang Z, Trevor Cohn, Ciravegna F, Proceedings of 14th International Conference on intelligent text processing and computational linguistics, 2013, 304-316

Recent advances in lexical semantic relatedness - a survey. Zhang Z, Gentile A, Ciravegna F, Journal of Natural Language Engineering, 2013, 19 (4), 411-479

See all of Ziqi Zhang's publications...