Data Management

Leverage Data for Value Creation in Life Sciences

April 16 - 17, 2024 ALL TIMES EST

With the growing demand for computing power among life science researchers and those addressing big data challenges, it is essential for data storage infrastructure to possess scalability to efficiently manage billions of data points and files. The primary challenge lies in the effective administration of data, ensuring its integration, accessibility, sharing, linkage, analysis, and maintenance to drive transformative change within the organization. Are data mesh and data fabric the true solutions to these challenges? How can one extract meaningful insights from data and its contents to generate value? The Data Management track delves into these inquiries and explores various topics, including FAIR data principles, data reuse, governance, literacy, data federation, and standards, as well as data curation and harmonization.

Monday, April 15

Recommended Pre-Conference Workshops and Symposia*8:00 am

On Monday, April 15, 2024, Cambridge Healthtech Institute is pleased to offer eight pre-conference Workshops scheduled across three time slots (8:00–10:00 am, 10:30 am–12:30 pm, and 2:00–4:00 pm) and six Symposia from 8:00 am–4:20 pm. All are designed to be instructional, and interactive and provide in-depth information on a specific topic. They allow for one-on-one interaction and provide a great way to explain more technical aspects that would otherwise not be covered during the main conference tracks that take place Tuesday–Wednesday.

*Separate registration required. See details on the Symposia here and details on the Workshops here.

PLENARY KEYNOTE PROGRAM

4:30 pm

Organizer's Remarks

Cindy Crowninshield, Executive Event Director, Cambridge Healthtech Institute

4:35 pm Plenary Keynote Introduction

Greg Mazzu, Regional Sales Manager, WEKA

4:45 pm PLENARY KEYNOTE PRESENTATION:

Unleashing the Power of Advanced Computing in Biomedical Informatics: A Vision for Transformative Collaboration

Daniel Stanzione, PhD, Executive Director, Texas Advanced Computing Center (TACC)

In the dynamic intersection of life science and computing, our mission at the Texas Advanced Computing Center (TACC) is to propel biomedical informatics into a new era of discovery and innovation. As computational leaders, we are dedicated to harnessing the potential of high-performance computing (HPC), machine learning (ML), and data analytics to revolutionize medicine. In this visionary pursuit, we prioritize the development of user-friendly interfaces and intuitive platforms. This approach ensures accessibility for executives and leaders in the life sciences industry, promoting seamless interaction with computational tools and fostering an environment where scientific and technological advancements coalesce. This presentation shares our vision for shaping the future of biomedical informatics where innovation, collaboration, and cutting-edge technologies converge to redefine the boundaries of what is possible in the realm of medicine.

Welcome Reception in the Exhibit Hall with Poster Viewing (Sponsorship Opportunity Available)6:00 pm

Close of Day7:15 pm

Tuesday, April 16

Registration and Morning Coffee7:00 am

PLENARY KEYNOTE PROGRAM

8:00 am

Organizer's Remarks

Allison Proffitt, Editorial Director, Bio-IT World and Clinical Research News

8:05 am Plenary Keynote Introduction

Josh Bond, Head of Product Management, Product Management, Revvity Signals

8:15 am PLENARY KEYNOTE PRESENTATION:

Unveiling Tomorrow's Possibilities: Embrace the Power of Digital Twins in Cancer Care and Research

Caroline Chung, MD, MSc, FRCPC, CIP, Vice President, Chief Data Officer, Director of Data Science Development & Implementation, Institute for Data Science in Oncology, MD Anderson Cancer Center

Explore the transformative potential of digital twins in revolutionizing cancer care and research. Gain insights into how digital twins can help deepen biological understanding, accelerate drug discovery, and personalize therapeutic strategies to optimize treatment outcomes for every individual. Amidst the exciting opportunities are the challenges that must be tackled to harness the power of digital twins to advance precision oncology, empower researchers and clinicians with unprecedented insights, and improve patient outcomes.

Coffee Break in the Exhibit Hall with Poster Viewing (Sponsorship Opportunity Available)9:30 am

Organizer's Welcome Remarks10:15 am

DATA SHARING AND KNOWLEDGE MANAGEMENT: PLATFORMS, FRAMEWORKS, AND TOOLS FOR COLLABORATING, INTEGRATING, ANALYZING, AND INTERPRETING

10:20 am

Chairperson's Remarks

Brigitte Ganter, PhD, Senior Director, Product Marketing, L7 Informatics, Inc.

10:25 am

Target to Lead: A Platform for Early Discovery Data Management

Rachana Ananthakrishnan, Executive Director, University of Chicago

A growing number of computationally intensive research activities require commensurate large-scale data management. In particular, high resolution imaging instruments such as cryogenic electron microscopes require automation of data flows to increase throughput and researcher productivity, as well as to ensure the instrument remains highly utilized. Globus was initially established as cyberinfrastructure for managed file transfer and secure data sharing. We have grown Globus into a comprehensive platform for research data management that includes services for data description and discovery, protected data management, and automation. The Globus platform-as-a-service is increasingly used to easily build and execute automated data flows in this context. We will describe how the platform facilitates end-to-end automation of complex research flows, and will present scenarios from research universities and national facilities that illustrate implementation of common use cases. 

10:55 am

How Generate is Managing Assay Data, and Our Assay Data Working Group

William Buchwald, Senior Software Engineer, Generate Biomedicines

Explore the role of data management in biomedical research at Generate Biomedicines. This talk delves into our strategic approach to managing assay data, showcasing the pivotal role of our Assay Data Working Group. Gain insights into the collaborative efforts shaping innovative data management practices. Discover how Generate optimizes the handling of assay data, ensuring efficiency, integrity, and accelerated advancements in biomedical research. Join us in navigating the complexities of data management for transformative breakthroughs.

11:25 am

Unleashing the Power of Material Properties: Accessing Pharmaceutical Manufacturability Driven by Data

Malte Bogdahn, PhD, Lead Scientist, Global CMC Development, Merck KGaA

This presentation explores the transition from raw data sheets from various departments to a centralized database, enabling the integration of previously disassociated data sets. Emphasizing a data-driven approach, this system empowers formulation and manufacturing decisions in the drug product development process. The application of low-code dashboarding facilitates comprehensive visualization and reveals valuable insights hidden in a scattered data landscape. Join us as we delve into this remarkable journey of scientific discovery and technological advancement.

11:55 am A Review of Uses of LLMs in Discovery Bioinformatics, the Role of Data Management & Lessons from the Field

Misha Kapushesky, PhD, CEO & Founder, Genestack Ltd.

LMM/AI promises to be the next paradigm shift in life science research however the theory and the reality are very different things. In this presentation we will discuss how we got powerful LLM models up and running quickly using Open Data Manager and some lessons learned along the way. 

12:10 pm Quilt: The Open Solution to Data Chaos

Ernest Prabhakar, PhD, Director of Product, Quilt Data, Inc.

The vast majority of Life Sciences companies fail to capture the full value of their data investment due to the organizational friction caused by data silos. We examine the underlying causes of this “data chaos” and explore how an open-source universal data abstraction--data packages--can help organizations rewrite the tradeoff between individual productivity and organizational velocity.

12:25 pm Harnessing AI to Bridge the Gap Between your Data and Global Research Knowledge

Sebastian Schmidt, CEO, metaphacts

The data needed to create new opportunities and drive decisions is abundant, but it is distributed across heterogeneous sources and lacks the context needed to deliver insights. The Dimensions KG powered by metaphactory combines the power of symbolic AI and neural AI to transform data into knowledge, connect internal data with global research knowledge, and augment and scale business decisions. Customers benefit from actionable and explainable insights following a human-in-the-loop approach.

Session Break & Transition to Lunch12:55 pm

1:05 pm LUNCHEON PRESENTATION:Escaping the Data Swamp! Case Studies in Building an AI-Ready Scientific Data Strategy

Nathan Clark, Founder, Ganymede

AI holds immense promise for scientific breakthroughs, but its success hinges on having high quality data. Join us for an informative and interactive session with audience participation as we explore how you can avoid the “data swamp” phenomenon that is plaguing our industry. Through real-world examples, we’ll navigate strategies for implementing FAIR principles in labs at every size, and review how robust data foundations can help you achieve effective AI deployment.

Refreshment Break in the Exhibit Hall with Poster Viewing (Sponsorship Opportunity Available)1:35 pm

CHALLENGES AND STRATEGIES OF DATA MANAGEMENT, ACCESSIBILITY, INTEROPERABILITY, AND LEGAL CONSIDERATIONS IN SCIENTIFIC RESEARCH

Chairperson's Remarks (Sponsorship Opportunity Available)2:25 pm

2:30 pm CO-PRESENTATION:

Empowering Users and Driving Adoption of Research Data Management Best Practices

Mark Jackson, Data Engineer, Data Science, Johnson & Johnson Innovative Medicine

Aleksandar Stojmirovic PhD, Director, Data Science, Johnson & Johnson Innovative Medicine

A core challenge when implementing scientific research data management workflows in large organizations is ensuring widespread adoption of best practices for organizing, annotating, and storing data. We developed the processes and supporting applications to promote user autonomy in managing translational data ingestion from diverse sources. Findability, Accessibility, Interoperability, and Reuse of ingested digital assets is ensured through automated pipelines that process and enrich their metadata and index it into integrated catalogs.

3:00 pm

Future-Proofing through Open Source and Data Management Policy

Terrell Russell, PhD, Executive Director, iRODS Consortium, Renaissance Computing Institute, University of North Carolina at Chapel Hill

The data management platforms being sold into the bio and pharmaceutical industries are expensive and incentivized to vertically integrate and capture the customer. Long-Term Data Management is best executed when policies are clear and infrastructure is abstracted and swappable. iRODS provides an open-source example of how this approach can be implemented to sustain FAIR data practices, consistency, and cost-savings across your enterprise.

3:30 pm

Decoding EU Privacy Legislation: Data Compliance Challenges and Solutions for Life Sciences

Robert Masson, CEO, The DPO Centre

The constantly evolving world of privacy and data protection is becoming increasingly complex. The laws about data sharing, gaining appropriate consent, and data ownership can create many challenges for organizations and cause project delays or even lead to clinical trial failure. This talk will draw on The DPO Centre’s extensive privacy experience across the spectrum of life sciences and healthcare, offering a comprehensive overview of these challenges and providing invaluable information with practical solutions for navigating the complexities of EU privacy regulations. Learn important insights and knowledge about current EU and UK privacy legislation and the implications for data processing, especially for research projects, whether for a pharmaceutical clinical trial, medical device or a biotech genomics study. This knowledge can help prevent project delays, avoid legal complications, and ultimately contribute to successful outcomes. Having a deeper understanding of the importance of data protection will not only benefit the audience but also contribute to the broader scientific community by promoting ethical and legally compliant research practices.

4:00 pm Building Data Management & Collaboration Technologies for Large Biomedical Research Organizations

John Wilbanks, Head of Product, Data Sciences Platform, Broad Institute

The Data Sciences Platform at the Broad Institute tackles the challenges of data discovery, data access, and data sharing, and has developed a suite of products that unify the research ecosystem. In this talk, we'll share how one solution, Terra on Azure, co-developed by the Broad Institute, Microsoft, and Verily, brings together the entire lifecycle of biomedical data science and helps to address these challenges.

Best of Show Awards Reception in the Exhibit Hall with Poster Viewing (Sponsorship Opportunity Available)4:30 pm

Close of Day5:45 pm

Wednesday, April 17

Registration and Morning Coffee7:30 am

PLENARY KEYNOTE PROGRAM

8:00 am

Organizer's Remarks

Cindy Crowninshield, Executive Event Director, Cambridge Healthtech Institute

8:05 am

Innovative Practices Awards

Joseph Cerro, Independent Consultant

John Conway, Chief Visioneer Officer, 20/15 Visioneers

Chris Dwan, Independent Consultant, Dwan, LLC

Allison Proffitt, Editorial Director, Bio-IT World and Clinical Research News

Since 2003, Bio-IT World has hosted an elite awards program with the goal of highlighting outstanding examples of how technology innovations and strategic initiatives are being applied to advance life sciences research. The 2024 Innovative Practices Awards winners represent excellence in innovation in the areas of informatics, pre-competitive collaboration, clinical and health IT, and genomics. Companies driving the winning entries include AstraZeneca, DNAnexus, Pistoia Alliance, Regeneron, Tempus, and UK Biobank.

8:20 am Plenary Keynote Introduction

Kshitij Kumar, Founder and CEO, Clovertex

8:30 am PLENARY KEYNOTE PRESENTATION:

Lights, Camera, Science: Film and Social Media Influence on Real-World Scientific Progress and Innovation

David Hewlett, Actor/Writer/Director; Creator, The Tech Bandits

Now, more than ever, life sciences are subject to misinterpretation, reduction, and inaccuracies at the hands of social media and Hollywood. And while it might be tempting to ignore the fake science streaming on YouTube and TikTok, there’s a generation of would-be investigators for whom those platforms might be their primary introduction to research and discovery. David Hewlett has had his share of big screen roles representing science—and science fiction—and he believes it’s imperative that the scientific and technology communities take back the narrative, filling gaps between what’s real and what could be real soon! He’s meeting this future generation where they are in schools, on YouTube, and on Twitch, championing real science in all its iterative, messy, exploratory glory, to recruit bright, diverse minds to lead the next generation of real scientists. He’s got our report from the front lines.

Coffee Break in the Exhibit Hall with Poster Viewing (Sponsorship Opportunity Available)9:45 am

Organizer's Remarks10:30 am

SCALABLE SOLUTIONS FOR MANAGING DIVERSE MULTIMODAL DATA

Chairperson's Remarks (Sponsorship Opportunity Available)10:35 am

10:40 am The Future of Decentralized Data

Karl Gutwin, Principal Consultant, BioTeam, Inc.

The web is experiencing a renewal of innovative, open platforms that are replacing the "walled gardens" that defined user experiences over the past two decades. This talk explores these trends in the context of scientific data, uncovering assumptions that have hindered the free flow of data. We will discuss both time-tested and novel approaches to building decentralized data mesh architectures that promote interoperability, reusability, and provenance within a data ecosystem.

11:10 am PANEL DISCUSSION:

Data Readiness for AI

PANEL MODERATOR:

Santha Ramakrishnan, PhD, Vice President, Head, R&D Data Strategy and Governance, Bayer Pharmaceuticals

Everyone is gearing up for AI but is their data ready for it? This panel will explore the key aspects to plan for in making data ready for AI.

PANELISTS:

Jesse Johnson, PhD, Founder/Principal, Merelogic

Shameer Khader, PhD, Executive Director, Global Head of Data Science, Data Engineering and Computational Biology, Sanofi

Gian Prakash, Director, Data Engineering, Information Research, AbbVie, Inc.

Jay Schuren, PhD, Chief Customer Officer, GM Generative AI, DataRobot, Inc.

Siping Wang, Founder, President, & Chief Technology Officer, Tetra Science, Inc.

12:10 pm Catalyzing Data Management in Life Sciences: Empowering R&D through Innovative Unified Digital Platforms

Brandon Varela, Principal - Scientific Software Products, Product Development, L7 Informatics

Improvements in instrument proficiency from high frequency measurements to automated high throughput capabilities have led to an exponential increase in the data volumes generated in life sciences. However, leveraging data into actionable insight is impossible without contextualization. The presentation will discuss how unified digital platforms break down data silos to streamline R&D processes, accelerate research, drug discovery, and decision-making, optimizing efficiency and cost-effectiveness.

12:40 pm Enhancing Precision Health Data Discoverability & Usability: The Collaborative Efforts of DNAnexus and Panomics

Nirav Amin, PhD, Director, Solutions Science, Solutions Science, DNAnexus

Explore how to empower researchers by streamlining data cataloging and ensuring adherence to FAIR (Findable, Accessible, Interoperable, and Reusable) principles and GxP compliance. Through case studies, we demonstrate the transformative impact of technologies on data management, emphasizing enhanced accessibility, interoperability, and innovation in biopharmaceutical R&D.
Delve into the collaborative efforts of DNAnexus and Panomics, showcasing their solutions for managing, analyzing and collaborating on multimodal data nested under the safeguards of TREs.

Session Break & Transition to Lunch1:10 pm

1:20 pm LUNCHEON PRESENTATION:Empowering Scientific Excellence: Revolutionizing Lab Workflows with ZONTAL Operations

Christof Gaenzler, PhD, Director PreSales and Product Marketing, ZONTAL GmbH

Introducing ZONTAL Operations, the latest innovation built upon the robust foundation of the ZONTAL platform. Designed to streamline repetitive lab operations, ZONTAL Operations offers a comprehensive solution that simplifies experiment integration, enhances data management, and ensures compliance with regulatory standards. Enrichment with metadata from third party systems and the establishment of a FAIR data catalog adds to secondary data and project analysis.

Refreshment Break in the Exhibit Hall with Last Chance Poster Viewing (Sponsorship Opportunity Available)1:50 pm

TRENDS FROM THE TRENCHES

Chairperson's Remarks (Sponsorship Opportunity Available)2:30 pm

2:35 pm

Trends from the Trenches

Ari E. Berman, PhD, CEO, BioTeam, Inc.

Laura Boykin Okalebo, PhD, Senior Scientific Consultant, BioTeam, Inc.

Since 2010, “Trends from the Trenches” has been one of the most popular annual traditions in the Bio-IT program. The intent of the talk is to deliver a candid (and occasionally blunt) assessment of the best, the most worthwhile, and the most overhyped information technologies (IT) for life sciences. Learn about computing, storage, data transfer, networks, cloud, data science, machine learning, and more that are involved in supporting data-intensive science.

Close of Conference4:05 pm






Ways to Participate

Conference Tracks

Data Platforms & Storage Infrastructure