Xingyi Song
Room 153, Department of Computer Science
Regent Court (DCS), 211 Portobello
Sheffield, S1 4DP
I’m Dr. Xingyi Song (宋星仪), a Lecturer in the School of Computer Science at the University of Sheffield, UK. I am also a member of the Natural Language Processing (NLP) Group and the GATE Team (gate.ac.uk). My research interests span Natural Language Processing, Computational Social Science, Bio-medical Text Analysis, Speech Analysis, and Financial Technology. In addition to my academic role, I serve as the Chief Scientific Officer at Sentient Machines(https://sentientmachines.tech/), where I focus on applying NLP solutions to financial technology.
Previously, I worked as a Machine Translation Specialist at Iconic Translation Machine (2015–2016) and as a Research Associate on various EU-funded projects—such as Kconnect, Knowmak, and Risis2—at the University of Sheffield (2016–2021). I completed both my MSc and PhD within the NLP Group at the University of Sheffield.
I am open to research and industry collaborations, so feel free to contact me if you have any ideas or potential projects in mind.
selected publications
- Cross-modal augmentation for few-shot multimodal fake news detectionEngineering Applications of Artificial Intelligence, 2025
- Enhancing Data Quality through Simple De-duplication: Navigating Responsible Computational Social Science ResearchEMNLP, 2024
- Confidence Regulation Neurons in Language ModelsIn NeurIPS , 2024
- Examining the Limitations of Computational Rumor Detection Models Trained on Static DatasetsLREC-COLING, 2024
-
- Examining Temporalities on Stance Detection Towards COVID-19 VaccinationLREC-COLING, 2024
- Large Language Models Offer an Alternative to the Traditional Approach of Topic ModellingLREC-COLING, 2024
- Identifying and Aligning Medical Claims Made on Social Media with Medical EvidenceLREC-COLING, 2024
- Don’t waste a single annotation: improving single-label classifiers through soft labelsIn Findings of the Association for Computational Linguistics: EMNLP 2023 , Dec 2023
- VaxxHesitancy: A Dataset for Studying Hesitancy Towards COVID-19 Vaccination on TwitterIn Proceedings of the International AAAI Conference on Web and Social Media , Dec 2023
- GATE Teamware 2: An open-source tool for collaborative document classification annotationIn Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics , May 2023
- Similarity-Aware Multimodal Prompt Learning for fake news detectionInformation Sciences, May 2023
- Text mining occupations from the mental health electronic health record: a natural language processing approach using records from the Clinical Record Interactive Search (CRIS) platform in south London, UKBMJ open, May 2021
- Classification aware neural topic model for COVID-19 disinformation categorisationPloS one, May 2021
- Using ontologies to map between research data and policymakers’ presumptions: the experience of the KNOWMAK projectScientometrics, May 2020
- Comparing topic-aware neural networks for bias detection of newsIn Proceedings of 24th European Conference on Artificial Intelligence (ECAI 2020) , May 2020
- A Deep Neural Network Sentence Level Classification Method with Context InformationIn Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing , May 2018