HIS 2022
HIS 2022

Keynote 1: Data-Driven Drug Repurposing Hypotheses Using Heterogeneous Types of Data: the DISNET Approach

Alejandro Rodríguez-González, Universidad Politécnica de Madrid, Spain

Abstract

Nowadays there are many sources and types of data available on the Internet. From textual sources to a semantic-enriched dataset, passing through data stored in relational databases or formats such as CSV, JSON, etc. In the biomedical domain, this information can be used in several data-driven approaches, but it is necessary for a complete process that allows the identification, extraction, and enrichment of the data to make it useful. DISNET is a project that was created to process heterogeneous sources of data in order to create a system able to integrate different types of biomedical information on a single platform to increase our current knowledge about diseases and, hence, to have a better understanding of how the diseases share components and behave. DISNET comprises a pipeline that goes from the crawling of non-structured (textual) information from different sources, passing through the application of NLP tools to extract useful disease-phenotype relationships and integrating this knowledge with other well-known sources of biomedical information from a biological and drug perspective, to create an enormous database of disease-related biomedical information and to create one of the largest sources of human disease networks data. As part of the results derived from the extraction and the analysis of the DISNET data, we have been able to have a real better understanding of the diseases, but also make use of the data to create a data-driven methodology for creating drug repurposing hypotheses, as well as, applying those analyses to study potential drugs to be used for COVID-19.

Bio

alejandro
Alejandro Rodríguez-González, PhD, is Full Professor at the Department of Computer Languages and Systems and Software Engineering at Universidad Politécnica de Madrid. He has a degree in Computer Science, a M.Sc in Computer Science and technology in the specialty of Artificial Intelligence, a M.Sc in Engineering Decision systems and a PhD in Computer Science. He is the principal investigator of the Medical Data Analysis laboratory (MEDAL) at Center for Biomedical Technology. His main research interests include Artificial Intelligence and Biomedical informatics field, with interest on the creation of Medical Diagnosis Systems, medical knowledge representation and the extraction of knowledge from different sources (text, social media, etc.) and the understanding of diseases. In 2019 he was also the General Chair of 32nd IEEE CBMS International Symposium on Computer-Based Medical Systems (CBMS 2019). At this moment he is the Chair of the Steering Committee of CBMS conference. Prof. Rodríguez research experience includes the participation in more than 20 research and innovation projects at national (Spain) and international level (H2020/EIT Health) level. This participation includes being the principal investigator of two projects funded by the Spanish ministry of industry and two funded by H2020 programme. He is currently the coordinator of P4-LUCAT (Personalized medicine for lung cancer treatment: using Big Data-driven approaches for decision support) project, funded under ERA NET on Personalized Medicine JCT 2019 call. His main research line is the analysis of the data referred to diseases to try to have a better understanding of those diseases. In this context, his expertise is focused on the concept of the human disease networks, which are characterized by their constituent elements at the biological and phenotypical levels. With this information, and applying various types of analysis, we aim to have a better understanding of how diseases are related among them based on their shared features. One of the use cases of this understanding is drug repositioning, trying to find new uses to existing drugs. In January 2018 he was awarded with the “2017 Best Research Trajectory” from Universidad Politécnica de Madrid.

Keynote 2: Explainable Knowledge Acquisition for Medical AI Models with a Global Generative Approach

Yunpeng Cai, Research Center for Biomedical Information Technology, Shezhen Institutes of Advanced Technology (SIAT), Chinese Academy of Sciences, China

Abstract

Explainability poses a major challenge to artificial intelligence (AI) techniques. Especially, in medical fields, lacking of explainability would lead to high potential risks and thus hinder the application of AI models. However, current studies on explainable AI (XAI) lack the efficiency of extracting global knowledge about the learning task, thus suffer deficiencies such as imprecise saliency, context-aware absence and vague meaning. In this talk we review current XAI researches and summarize their progresses and pitfalls. After that, we propose a new type of XAI technique which features in generative manipulation of samples that change the model outputs freely but keep the background characters. We adopt the framework on medical image classification tasks, showing that our approach reached more precise explanation of black-box models and helped with discovering biomedical rules.

Bio

alejandro
Yunpeng Cai is a Professor and the executive director of the Research Center for Biomedical Information Technology, Shezhen Institutes of Advanced Technology (SIAT), CAS. His research interests include bioinformatics, health informatics and health big data mining, with specialties in EMR big data mining, metagenomics and multi-omics analyses, graph neural networks and explainable machine learning. He published over 100 papers including those on top journals such as Nucleic Acid Research, Briefings in Bioinformatics, PLOS Computational Biology, IEEE Trans. On Cybenetics and IEEE. Trans Neural Networks and Learning Systems, which received over 2100 citations. He is currently the director of the Guangdong Engineer Center & Shenzhen Engineer Lab for Health Big Data Analysis, and editorial borad member of the Health Information Science and Systems journal.