Seminar: Why analyze biomedical text if we have data?

Title: Why analyze biomedical text if we have data?
Speaker: Francisco Couto (LASIGE/DI-FCUL)
Date: December 18, 14h30
Where: https://videoconf-colibri.zoom.us/my/tjvguerreiro
Organized by: Tiago Guerreiro

Data Science Seminars is a Ciências ULisboa Master Programme in Data Science’s course that offers an overview of data science, with a focus in its application areas. 

 

Article: COVID-19: A Semantic-Based Pipeline for Recommending Biomedical Entities

The article “COVID-19: A Semantic-Based Pipeline for Recommending Biomedical Entities”, co-authored by Marcia Afonso Barros, Andre Lamurias, Diana Sousa, Pedro Ruas, Francisco M. Couto was accepted and presented at the proceedings of the 1st Workshop on NLP for COVID-19 (Part 2) at EMNLP 2020.

The main contributions of this work are: a dataset of 9k articles automatically annotated with relevant items/concepts for CORD19; a sample dataset curated for CORD-19; a sample dataset with relations between the entities of the four ontologies; an implicit feedback matrix based on the previous datasets.

The work is freely available at: https://github.com/lasigeBioTM/knowledge-extraction-from-CORD-19