Knowledge Graphs

Leverage Integrated Knowledge Graphs to Accelerate Biomedical Research, Drug Discovery, and Precision Medicine

April 15, 2024 ALL TIMES EST

The biopharma industry is increasingly embracing knowledge graphs, harnessing their power and versatility to accelerate the advancement of data science and AI-driven drug discovery. These innovative graphs facilitate knowledge sharing by creating a seamless integration point across diverse data types and sources. Consequently, shared data is transformed into actionable knowledge, and a systematic mechanism is established to extract this knowledge in a traceable and timely manner. At the Knowledge Graphs Symposium, you will gain firsthand insights into how biopharma organizations are actively engaged in constructing their own internal knowledge graphs and collaborating with external partners to implement them effectively. The presentations will showcase the tangible impact these knowledge graphs have on pharmaceutical research and beyond.

Sunday, April 14

Registration Open5:00 pm

Monday, April 15

Registration and Morning Coffee7:00 am

Organizer's Remarks8:00 am

KNOWLEDGE GRAPHS: INTRODUCTION, USE CASES, TECHNICAL FOUNDATION

8:05 am

Chairperson's Remarks

Michael Liebman, PhD, Managing Director, IPQ Analytics, LLC

8:10 am

An Introduction to Knowledge Graphs: A Technical Foundation

Peter V. Henstock, PhD, Former Machine Learning & AI Lead, Software Engineering & Statistics & Visualization, Pfizer Inc.

8:30 am

From Evidence to Insights: Cascading Insight Generation Using Knowledge Graphs

Sebastian Scharf, PhD, Data Scientist, Roche Pharma

The talk is about automated generation of insights from evidence with a special focus on provenance.

8:50 am

Charting the Patient Path: Harnessing Knowledge Graphs for Insightful Visualization and Management of Care Infrastructure

Ray Lukas, Principal Emerging Technologies Engineer, MITRE National Labs

9:10 am Data-Centric Initiatives in the Age of GenAI

Amar Doshi, President, TopQuadrant

Knowledge graphs are known to be a pivotal technology for improving efficiency in pharma operations. Today, adoption of knowledge graphs has accelerated and grown to be a key component in the larger enterprise data architecture as new AI initiatives are now an imperative. This session will explore three case studies that will demonstrate what data-centric strategies pharmas are now taking, with a playbook for how to get started with semantics.

Networking Coffee Break9:40 am

10:00 am

Knowledge-Driven Mechanistic Enrichment for Improved Phenotyping of Pediatric Rare Disease

Tiffany Callahan, PhD, Postdoctoral Research Scientist, IBM

The nearly universal adoption of electronic health records and rapid advancements in multiomics technologies have made tremendous amounts of data available for research. Unfortunately, these data are often highly distributed and heterogeneous which limits most researchers from utilizing them. Knowledge graphs (KGs) have frequently been used to systematically interrogate the biology underlying complicated systems and diseases. In this talk, I introduce novel methods designed to improve pediatric rare disease phenotyping.

10:20 am

An Ignorance-Base for Prenatal Nutrition: A Knowledge Graph to Explore the Literature's Known Unknowns

Mayla R. Boguslav, PhD, Postdoctoral Research Fellow, Colorado State University

Research progresses through accumulating knowledge such that a previously unexplored subject (an unknown unknown) becomes an active research area exploring the questions (known unknowns), until a body of established facts emerges (known knowns). Many knowledge-bases exists for known knowns, but no ignorance-bases exist for known unknowns. What novel connections and insights are in the unknowns? Using a knowledge graph, we created the first ignorance-base for prenatal nutrition to help find pertinent questions that could affect mothers and offspring globally.

10:40 am

Knowledge Graphs: Complex Reasoning in Clinical Research and Clinical Decision Support

Michael Liebman, PhD, Managing Director, IPQ Analytics, LLC

Knowledge graphs can provide an ideal environment for representation and reasoning about the complexities of the patient, of the disease, and of clinical practice. It is critical, however, to incorporate temporal considerations of processes as well as the continuing evolution of data and and concepts, e.g., clinical protocols for diagnosis and treatment. The application of knowledge graphs as a "healthcare learning environment" that addresses health equity issues in infant/maternal morbidity and mortality as well as hypertensive disorders of pregnancy will be presented.

11:00 am PANEL DISCUSSION:

Getting Started with Knowledge Graphs

PANEL MODERATOR:

Michael Liebman, PhD, Managing Director, IPQ Analytics, LLC

PANELISTS:

Mayla R. Boguslav, PhD, Postdoctoral Research Fellow, Colorado State University

Tiffany Callahan, PhD, Postdoctoral Research Scientist, IBM

Peter V. Henstock, PhD, Former Machine Learning & AI Lead, Software Engineering & Statistics & Visualization, Pfizer Inc.

Ray Lukas, Principal Emerging Technologies Engineer, MITRE National Labs

Sebastian Scharf, PhD, Data Scientist, Roche Pharma

Transition to Lunch11:30 am

11:40 am LUNCHEON PRESENTATION:Harnessing AI to Bridge the Gap Between your Data and Global Research Knowledge

Sebastian Schmidt, CEO, metaphacts

The data needed to create new opportunities and drive decisions is abundant, but it is distributed across heterogeneous sources and lacks the context needed to deliver insights. The Dimensions KG powered by metaphactory combines the power of symbolic AI and neural AI to transform data into knowledge, connect internal data with global research knowledge, and augment and scale business decisions. Customers benefit from actionable and explainable insights following a human-in-the-loop approach. 

Session Break12:40 pm

TAKING THE NEXT STEP: BUILDING ADVANCED GRAPHING CAPABILITIES

12:55 pm

Chairperson's Remarks

Janice McCallum, Managing Director, Health Content Advisors

1:00 pm

Generative AI and Knowledge Graphs for Frictionless Information Access

Nick Brown, Executive Director, Imaging & Data Analytics, AstraZeneca

In the vast world of biology data, finding the right information can feel like searching for a needle in a haystack. This talk introduces an exciting solution: using AI chatbots paired with knowledge graphs. These tools can dive deep into large datasets, unveiling crucial insights that might otherwise stay hidden. We'll explore how this blend of technology helps, its safety benefits, and the ways it promises to revolutionize biology research. Imagine a powerful magnifying glass for data—this is the future of scientific research. Join us to see how AI is reshaping our understanding of the biological world. 

1:20 pm

Ask ARCH: LLM Question Answering over Large-Scale Knowledge Graphs

Matt Docherty, Associate Principal, ZS Associates

Jon Stevens, PhD, AI Language Capability Lead, AbbVie, Inc.

Knowledge graphs provide a vehicle for grounding LLM answers in harmonized structured data, reducing hallucinations and allowing easy fact-checking. In turn, LLMs provide a natural way for end users to query knowledge graph data, without requiring a query language or deep understanding of database structure. We present our integration of AbbVie's 30-million-node R&D knowledge graph, the ARCH Graph, with GPT-based LLMs to create a scientific question-answering system. The ARCH Graph is a Neo4J graph that harmonizes and connects molecules, drugs, genes, health conditions, and other entities from a variety of data sources, allowing scientists to make connections between disparate data points. However, querying the graph can be challenging for end users without a natural language interface. The new Ask ARCH Graph provides such an interface, allowing users to ask questions in natural language (e.g., "What genetic markers are associated with acute myeloid leukemia?") and receive natural language answers ("Some genetic markers associated with acute myeloid leukemia include PICALM (ENSG00000073921), CEBPA (ENSG00000245848), ...") along with the underlying data and the Cypher query used to retrieve it. To achieve this, the system utilizes a combination of vector search, Cypher query generation and validation, and LLM-based summarization of the Cypher output. The process of accurately retrieving information from a large-scale knowledge graph is more complex and less researched than simpler RAG methods on document corpora. We discuss the evolution of our approach and evaluate its accuracy and performance. The integration of LLMs with knowledge graphs helps reduce hallucinations, improve reliability in specialized domains, enhance reasoning with context, and enable dynamic and interactive knowledge discovery.

1:40 pm

Insights through Knowledge Graphs, Quantum Computing, and Machine Learning in the NIH's Bridge2AI Initiative

Wade L. Schulz, MD, PhD, Assistant Professor; Director of Informatics, Laboratory Medicine; Director, CORE Center for Computational Health, Center for Outcomes Research & Evaluation (CORE), Yale School of Medicine

The Bridge2AI initiative, initiated by the National Institutes of Health (NIH), aims to broaden the application of artificial intelligence in both biomedical and behavioral research. This talk discusses a project we are engaged in that integrates knowledge graphs with quantum computing and machine learning, as a component of the NIH's Bridge2AI program.

Networking Refreshment Break2:00 pm

2:20 pm  Accelerating Drug Discovery with Comprehensive Curated Knowledge

Kyle Nilson, PhD, Senior Field Application Scientist, QIAGEN Digital Insights

In the rapidly evolving landscape of drug discovery, you need the ability to integrate high-quality research findings into knowledge graphs. For >20 years, our curated knowledge has been the foundation of QIAGEN Ingenuity Pathway Analysis (IPA). Now, you can tap into these data outside of IPA with QIAGEN Biomedical KB-HD. Discover the transformative role of knowledge graphs in predicting adverse outcomes and toxicity, understanding mechanisms of action and accelerating pharmaceutical R&D. 

2:50 pm CO-PRESENTATION:

Enhancing Decision-Making and Drug Portfolio Workflows at Takeda Pharmaceuticals: The Role of Information Architecture and Digital Tools

Seth Earley, CEO, Earley Information Science

Giovanni Piazza, Head of Knowledge Management Services, Takeda

This case study examines the redesign of the drug portfolio evaluation process. The existing workflow for Takeda Pharmaceutical’s Portfolio Review Committee (PRC) was cumbersome, reliant on manual updates, and lacked effective searchability, significantly impeding the efficiency of executive decision-making. Recognizing the critical need for a streamlined process, the organization embarked on a comprehensive overhaul involving the integration of workflows, enhancement of user experience, and implementation of an information architecture aligned with the corporate ontology and knowledge graph. The solution leveraged the existing SharePoint infrastructure but bore little resemblance to traditional SharePoint applications to create a unified information environment that facilitated efficient portfolio management, improved searchability, and provided executives with componentized views of submissions. This transformation not only optimized the drug portfolio evaluation process but also ensured secure and strategic management of sensitive information, demonstrating a significant leap in operational efficiency and decision-making speed for the organization.  A Large Language Model (LLM) PoC was also developed utilizing the PRC application information architecture which showed the improved recall and accuracy and virtually eliminated hallucinations using a Retrieval Augmented Generation (RAG) mechanism.

3:30 pm

Knowledge Graphs in Translational Research: Unleashing New Discoveries

Ewy A. Mathe, PhD, Director Informatics, Preclinical Innovation, NIH NCATS

Translational research is significantly enriched by public knowledge sources that provide annotations across various levels, such as drugs/treatments, diseases, targets, and metabolism. The value of these sources has surged in the last decade due to the broader availability and adoption of standardized identifiers, processes, and metadata capture. These enhancements have led to the creation of detailed graphs (networks) that amalgamate complementary data, offering new insights or revealing previously overlooked or unconsidered connections. Additionally, machine learning and artificial intelligence (AI/ML) methods can be utilized to predict outcomes, address a range of questions in translational research, and formulate innovative hypotheses. In this presentation, we highlight innovations within the National Center for Translational Sciences in generating graphs to identify repurposed drugs, novel biological mechanisms and to discover new associations between genes, targets and diseases.  Examples highlighted will include the NCATS Translator (https://ui.ci.transltr.io/demo) and the Rare Disease Alert System (https://rdas.ncats.nih.gov/) and more.

3:50 pm PANEL DISCUSSION:

Advanced Capabilities

PANEL MODERATOR:

Janice McCallum, Managing Director, Health Content Advisors

PANELISTS:

Nick Brown, Executive Director, Imaging & Data Analytics, AstraZeneca

Seth Earley, CEO, Earley Information Science

Ewy A. Mathe, PhD, Director Informatics, Preclinical Innovation, NIH NCATS

Giovanni Piazza, Head of Knowledge Management Services, Takeda

Wade L. Schulz, MD, PhD, Assistant Professor; Director of Informatics, Laboratory Medicine; Director, CORE Center for Computational Health, Center for Outcomes Research & Evaluation (CORE), Yale School of Medicine

Jon Stevens, PhD, AI Language Capability Lead, AbbVie, Inc.

Close of Symposium4:20 pm

Transition to Plenary Keynote4:20 pm

PLENARY KEYNOTE PROGRAM

4:30 pm

Organizer's Remarks

Cindy Crowninshield, Executive Event Director, Cambridge Healthtech Institute

4:35 pm Plenary Keynote Introduction

Greg Mazzu, Regional Sales Manager, WEKA

4:45 pm PLENARY KEYNOTE PRESENTATION:

Unleashing the Power of Advanced Computing in Biomedical Informatics: A Vision for Transformative Collaboration

Daniel Stanzione, PhD, Executive Director, Texas Advanced Computing Center (TACC)

In the dynamic intersection of life science and computing, our mission at the Texas Advanced Computing Center (TACC) is to propel biomedical informatics into a new era of discovery and innovation. As computational leaders, we are dedicated to harnessing the potential of high-performance computing (HPC), machine learning (ML), and data analytics to revolutionize medicine. In this visionary pursuit, we prioritize the development of user-friendly interfaces and intuitive platforms. This approach ensures accessibility for executives and leaders in the life sciences industry, promoting seamless interaction with computational tools and fostering an environment where scientific and technological advancements coalesce. This presentation shares our vision for shaping the future of biomedical informatics where innovation, collaboration, and cutting-edge technologies converge to redefine the boundaries of what is possible in the realm of medicine.

Welcome Reception in the Exhibit Hall with Poster Viewing (Sponsorship Opportunity Available)6:00 pm

Close of Day7:15 pm






Ways to Participate

Conference Tracks

Data Platforms & Storage Infrastructure