profile

Wojciech Kusa

Natural Language Processing ∎ Machine Learning
I am a final year PhD Candidate at TU Wien supervised by Allan Hanbury and Petr Knoth. I am a Marie Skłodowska-Curie Research Fellow under the EU Horizon 2020 project DoSSIER, focused on Domain Specific Systems for Information Extraction and Retrieval.
My primary research revolves around automating the systematic literature review process. Specifically, I am engaged in developing more robust evaluation methods and exploring techniques to enhance the information retrieval capabilities of large language models. Furthermore, I have a keen interest in using information extraction approaches to increase the efficiency of biomedical document retrieval.
Prior to my current research, I held a position at Samsung R&D and had the opportunity to intern at Sony CSL and UNINOVA. I obtained my MSc and BSc in Computer Science from AGH University of Science and Technology in Cracow and a BA in Cognitive Science from Jagiellonian University.
I am an avid sailor and wayfarer. In 2019-2020, I had an incredible experience living on a sailing yacht with three friends. We set sail from England and covered a whopping 5,000 nautical miles, all the way to the stunning shores of Greece. If you're interested, we've documented our adventures on our YouTube channel.

News

Jan 04, 2024
I'm visiting UCL to work on LLM evaluation. I will be joining the Web Intelligence group and collaborating with Aldo Lipani.
Sep 23, 2023
Delighted to announce that our paper, CSMeD: Bridging the Dataset Gap in Automated Citation Screening for Systematic Literature Reviews, has been accepted to NeurIPS 2023 Track on Datasets and Benchmarks.
Sep 04, 2023
I'm helping in the organisation of this year's ICASR workshop. I'll also be presenting our work on citation screening metrics and datasets.
Aug 07, 2023
CRUISE-Screening was accepted as a demo at CIKM 2023. See you in Birmingham!
Mar 09, 2023
I gave a talk about effect-based evaluation of systematic review automation at the TIGER (The Information retrieval GEneral Reading) group meeting at RMIT University.
Dec 19, 2022
I've joined the University of Queensland as a Visiting Researcher. Thrilled to be collaborating with Guido Zuccon at the IELab.

Selected publications

NeurIPS
CSMeD: Bridging the Dataset Gap in Automated Citation Screening for Systematic Literature Reviews
Wojciech Kusa, Óscar E. Mendoza, Matthias Samwald, Petr Knoth, Allan Hanbury
37th Conference on Neural Information Processing Systems Track on Datasets and Benchmarks (accepted)
CIKM
CRUISE-Screening: Living Literature Reviews Toolbox
Wojciech Kusa, Petr Knoth, Allan Hanbury
32nd ACM International Conference on Information and Knowledge Management (accepted)
JBI
Effective Matching of Patients to Clinical Trials using Entity Extraction and Neural Re-ranking
Wojciech Kusa, Óscar E. Mendoza, Petr Knoth, Gabriella Pasi, Allan Hanbury
Journal of Biomedical Informatics, 104444
ICTIR
Outcome-based Evaluation of Systematic Review Automation
Wojciech Kusa, Guido Zuccon, Petr Knoth, Allan Hanbury
The 13th International Conference on the Theory of Information Retrieval
SIGIR
VoMBaT: A Tool for Visualising Evaluation Measure Behaviour in High-Recall Search Tasks
Wojciech Kusa, Aldo Lipani, Petr Knoth, Allan Hanbury
The 46th International ACM SIGIR Conference on Research and Development in Information Retrieval
ISWA
An Analysis of Work Saved over Sampling in the Evaluation of Automated Citation Screening in Systematic Literature Reviews
Wojciech Kusa, Aldo Lipani, Petr Knoth, Allan Hanbury
Intelligent Systems with Applications, pp. 200193, 2023, ISSN: 2667-3053
NeurIPS
BigBIO: A Framework for Data-Centric Biomedical Natural Language Processing
Jason Alan Fries, Leon Weber, Natasha Seelam, Gabriel Altay, Debajyoti Datta, Samuele Garda, Myungsun Kang, Ruisi Su, Wojciech Kusa, Samuel Cahyawijaya, Fabio Barth, Simon Ott, Matthias Samwald, Stephen Bach, Stella Biderman, Mario Sänger, Bo Wang, Alison Callahan, Daniel León Periñán, Théo Gigant, Patrick Haller, Jenny Chim, Jose David Posada, John Michael Giorgi, Karthik Rangasai Sivaraman, Marc Pàmies, Marianna Nezhurina, Robert Martin, Michael Cullan, Moritz Freidank, Nathan Dahlberg, Shubhanshu Mishra, Shamik Bose, Nicholas Michio Broad, Yanis Labrak, Shlok S. Deshmukh, Sid Kiblawi, Ayush Singh, Minh Chien Vu, Trishala Neeraj, Jonas Golde, Albert Villanova del Moral, Benjamin Beilharz
Thirty-sixth Conference on Neural Information Processing Systems (NeurIPS) Track on Datasets and Benchmarks
SDP
Benchmark for Research Theme Classification of Scholarly Documents
Óscar E. Mendoza, Wojciech Kusa, Alaa El-Ebshihy, Ronin Wu, David Pride, Petr Knoth, Drahomira Herrmannova, Florina Piroi, Gabriella Pasi, Allan Hanbury
Third Workshop on Scholarly Document Processing at COLING 2022
SIGIR
ORCAS-I: Queries Annotated with Intent using Weak Supervision
Daria Alexander, Wojciech Kusa, Arjen P. de Vries
The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval
BioNLP
DoSSIER at MedVidQA 2022: Text-based Approaches to Medical Video Answer Localization Problem
Wojciech Kusa, Georgios Peikos, Óscar Espitia, Allan Hanbury, Gabriella Pasi
The 21st Biomedical Natural Language Processing (BioNLP) Workshop @ ACL 2022
ECIR
Automation of Citation Screening for Systematic Literature Reviews using Neural Networks: A Replicability Study
Wojciech Kusa, Petr Knoth, Allan Hanbury
44th European Conference on Information Retrieval