Modern Data Platforms and Storage Infrastructure

Design, Deploy, and Oversee Data Storage Solutions Optimized for Optimal Speed, Performance, and Cost Efficiency

April 16 - 17, 2024 ALL TIMES EST

Is the challenge of managing your data growing daily? Do you possess a flexible and robust data management framework to effectively store, process, analyze, transfer, and safeguard vast data volumes in alignment with your organizational guidelines? Are you familiar with balancing availability and interoperability? What strategies exist for scalable distributed and federated data analytics? How are you navigating the discussions surrounding speed, performance, and cost trade-offs? Which vendors are the most suitable for your needs? How do you assess the strengths and weaknesses of technological solutions? Which data storage methods and types prove to be efficient? Considerable strides have been taken by pioneering organizations in advancing large-scale data management, encompassing storage platforms, integration and migration strategies, and governance. The Modern Data Platforms and Storage Infrastructure track explores these inquiries and shares best practices derived from these endeavors.

Monday, April 15

Recommended Pre-Conference Workshops and Symposia*8:00 am

On Monday, April 15, 2024, Cambridge Healthtech Institute is pleased to offer eight pre-conference Workshops scheduled across three time slots (8:00–10:00 am, 10:30 am–12:30 pm, and 2:00–4:00 pm) and six Symposia from 8:00 am–4:20 pm. All are designed to be instructional, and interactive and provide in-depth information on a specific topic. They allow for one-on-one interaction and provide a great way to explain more technical aspects that would otherwise not be covered during the main conference tracks that take place Tuesday–Wednesday.

*Separate registration required. See details on the Symposia here and details on the Workshops here.

PLENARY KEYNOTE PROGRAM

4:30 pm

Organizer's Remarks

Cindy Crowninshield, Executive Event Director, Cambridge Healthtech Institute

4:35 pm Plenary Keynote Introduction

Greg Mazzu, Regional Sales Manager, WEKA

4:45 pm PLENARY KEYNOTE PRESENTATION:

Unleashing the Power of Advanced Computing in Biomedical Informatics: A Vision for Transformative Collaboration

Daniel Stanzione, PhD, Executive Director, Texas Advanced Computing Center (TACC)

In the dynamic intersection of life science and computing, our mission at the Texas Advanced Computing Center (TACC) is to propel biomedical informatics into a new era of discovery and innovation. As computational leaders, we are dedicated to harnessing the potential of high-performance computing (HPC), machine learning (ML), and data analytics to revolutionize medicine. In this visionary pursuit, we prioritize the development of user-friendly interfaces and intuitive platforms. This approach ensures accessibility for executives and leaders in the life sciences industry, promoting seamless interaction with computational tools and fostering an environment where scientific and technological advancements coalesce. This presentation shares our vision for shaping the future of biomedical informatics where innovation, collaboration, and cutting-edge technologies converge to redefine the boundaries of what is possible in the realm of medicine.

Welcome Reception in the Exhibit Hall with Poster Viewing (Sponsorship Opportunity Available)6:00 pm

Close of Day7:15 pm

Tuesday, April 16

Registration and Morning Coffee7:00 am

PLENARY KEYNOTE PROGRAM

8:00 am

Organizer's Remarks

Allison Proffitt, Editorial Director, Bio-IT World and Clinical Research News

8:05 am Plenary Keynote Introduction

Josh Bond, Head of Product Management, Product Management, Revvity Signals

8:15 am PLENARY KEYNOTE PRESENTATION:

Unveiling Tomorrow's Possibilities: Embrace the Power of Digital Twins in Cancer Care and Research

Caroline Chung, MD, MSc, FRCPC, CIP, Vice President, Chief Data Officer, Director of Data Science Development & Implementation, Institute for Data Science in Oncology, MD Anderson Cancer Center

Explore the transformative potential of digital twins in revolutionizing cancer care and research. Gain insights into how digital twins can help deepen biological understanding, accelerate drug discovery, and personalize therapeutic strategies to optimize treatment outcomes for every individual. Amidst the exciting opportunities are the challenges that must be tackled to harness the power of digital twins to advance precision oncology, empower researchers and clinicians with unprecedented insights, and improve patient outcomes.

Coffee Break in the Exhibit Hall with Poster Viewing (Sponsorship Opportunity Available)9:30 am

Organizer's Welcome Remarks10:15 am

FOUNDATIONS OF MODERN DATA PLATFORMS: ARCHITECTURE AND DESIGN PRINCIPLES

Chairperson's Remarks (Sponsorship Opportunity Available)10:20 am

10:25 am

The Promise and Evolution of an Integrated Data Platform

Peng Cheng Zhang, PhD, Technical Associate Director Scientific Products, Integrated Data and Insights/SDP/NX, Novartis Institutes for BioMedical Research, Inc.

While the promise of an all-encompassing and integrated data platform may be clear to the community, the execution and value generation is never a straight line. Faced with decisions from the past, rapid changes in data and technology, a modern data platform must also continue to evolve and adapt based on the needs of the users.

10:55 am CO-PRESENTATION:

The IDMP Ontology: Collaborative Implementation of the IDMP Standards in Pharma

J Christian Baber, PhD, Chief Portfolio Officer, Pistoia Alliance

Sheila Elz, Master Data Manager, Bayer AG

Vada A. Perkins, DrSc, MSc, Vice President, Global Head of Regulatory Intelligence & Policy, Boehringer Ingelheim

The Identification of Medicinal Products (IDMP) Ontology is a cross-industry collaboration that shifts the model from reactive to a proactive Pharma-driven development of industry standards in collaboration with ISO standards authors and regulatory agencies. It provides a universal product data model as the backbone for the pharmaceutical industry, enabling patient safety. Thus, the IDMP Ontology bridges the gap between diverse perspectives on medicinal products in an innovative, ontological approach.

11:55 am Revolutionizing the Scientific Experience through Enhanced Connectivity

William Goodman, Senior Director, Product Management, Digital Solutions, Thermo Fisher Scientific

The scientific landscape is continuously evolving, with researchers and scientists seeking innovative ways to accelerate their discoveries and drive productivity. In this digital age, enhanced connectivity has emerged as a powerful tool to revolutionize the scientific experience. By harnessing the potential of advanced technologies and innovative software, researchers can unlock discovery, accelerate progress and drive productivity to unprecedented levels. 

12:25 pm How Are AI Fabrics Different from Data Center Fabrics

Paul Gilbert, AI /HPC Technical Lead, Arista Networks

The impact AI is having and will continue to have on Life Sciences is undeniable. AI driven applications are revolutionizing the life sciences industry, with real world examples of accelerated drug discovery, clinical trial matching systems and disease prediction technology being some of the successes we are seeing today. How are these AI Networks to support these application being built? What is needed to build them and run them? When do you build your own AI system rather than continue with Cloud? What do you need to look out for? We will give you a comprehensive overview of how these AI Network Fabrics are built, their capabilities and what is required to get started.

Session Break & Transition to Lunch12:55 pm

1:05 pm LUNCHEON PRESENTATION:Unified Data Access, Management, and Orchestration across Any Storage, Any Data Center, Any Cloud, Anywhere

Adam Marko, Field CTO-Life Sciences, Sales, Hammerspace

Effectively managing data across multiple locations and clouds, while ensuring performant access, security, and compliance is a big challenge facing life science organizations. Distributed and decentralized environments coupled with the convergence of HPC and AI require a new approach to data and storage architectures. This presentation will focus on how Hammerspace enables life sciences organizations to break down data silos, streamline collaboration, and unlock the full potential of their data assets.

Refreshment Break in the Exhibit Hall with Poster Viewing (Sponsorship Opportunity Available)1:35 pm

SCALABILITY AND PERFORMANCE OPTIMIZATION STRATEGIES IN CLOUD COMPUTING

2:25 pm

Chairperson's Remarks

John Damask, Vice President, Data & Systems Engineering, Flagship Pioneering

2:30 pm

Life Science Organizations in the Cloud—SNAFUs, FUBARs, and OMG Moments

John Damask, Vice President, Data & Systems Engineering, Flagship Pioneering

This talk will provide some balance to the abundance of cloud adoption success stories. We’ll explore real-world examples from start-ups and big pharma of things that didn’t go quite as expected. Some stories may be familiar, some not, but they all contribute to our understanding of how life science organizations can make the best use of the cloud. The experiences presented are with AWS but generalizable to other cloud providers.

3:00 pm PANEL DISCUSSION:

Digital Leadership Lessons: Reflecting and Correcting

PANEL MODERATOR:

John Damask, Vice President, Data & Systems Engineering, Flagship Pioneering

This discussion will convene executive leaders to share their experiences about technology, data, and cultural decisions that led to surprising outcomes. From unanticipated obstacles to valuable lessons learned, we'll delve into the real stories that shaped the journey of start-ups and big companies alike. Join us for a light-hearted conversation about serious topics.

PANELISTS:

Parul Bordia Doshi, Chief Data Officer, Cellarity

Rodney Marable, Executive Director, Scientific Computing & Informatics, Flare Therapeutics

Rania Khalaf, PhD, Chief Information and Data Officer, Inari

Eric Zimmerman, Principal Healthcare & Life Sciences BD, Venture Capital & Startups, Amazon Web Services, Inc. (AWS)

4:00 pm How NaaS Drives Multi-Cloud healthcare IT Infrastructure

Neil Velandria, Associate Vice President of Sales - Americas Enterprise, Sales, Console Connect

The public internet is a mish-mash of networks from different operators connected together and service is best effort rather than guaranteed, meaning you have no control of latency, jitter, or pathing.
Console Connect provides private, secure, and direct connectivity to all major cloud providers.  Also, giving you greater visibility of your network traffic.

Best of Show Awards Reception in the Exhibit Hall with Poster Viewing (Sponsorship Opportunity Available)4:30 pm

Close of Day5:45 pm

Wednesday, April 17

Registration and Morning Coffee7:30 am

PLENARY KEYNOTE PROGRAM

8:00 am

Organizer's Remarks

Cindy Crowninshield, Executive Event Director, Cambridge Healthtech Institute

8:05 am

Innovative Practices Awards

Joseph Cerro, Independent Consultant

John Conway, Chief Visioneer Officer, 20/15 Visioneers

Chris Dwan, Independent Consultant, Dwan, LLC

Allison Proffitt, Editorial Director, Bio-IT World and Clinical Research News

Since 2003, Bio-IT World has hosted an elite awards program with the goal of highlighting outstanding examples of how technology innovations and strategic initiatives are being applied to advance life sciences research. The 2024 Innovative Practices Awards winners represent excellence in innovation in the areas of informatics, pre-competitive collaboration, clinical and health IT, and genomics. Companies driving the winning entries include AstraZeneca, DNAnexus, Pistoia Alliance, Regeneron, Tempus, and UK Biobank.

8:20 am Plenary Keynote Introduction

Kshitij Kumar, Founder and CEO, Clovertex

8:30 am PLENARY KEYNOTE PRESENTATION:

Lights, Camera, Science: Film and Social Media Influence on Real-World Scientific Progress and Innovation

David Hewlett, Actor/Writer/Director; Creator, The Tech Bandits

Now, more than ever, life sciences are subject to misinterpretation, reduction, and inaccuracies at the hands of social media and Hollywood. And while it might be tempting to ignore the fake science streaming on YouTube and TikTok, there’s a generation of would-be investigators for whom those platforms might be their primary introduction to research and discovery. David Hewlett has had his share of big screen roles representing science—and science fiction—and he believes it’s imperative that the scientific and technology communities take back the narrative, filling gaps between what’s real and what could be real soon! He’s meeting this future generation where they are in schools, on YouTube, and on Twitch, championing real science in all its iterative, messy, exploratory glory, to recruit bright, diverse minds to lead the next generation of real scientists. He’s got our report from the front lines.

Coffee Break in the Exhibit Hall with Poster Viewing (Sponsorship Opportunity Available)9:45 am

Organizer's Remarks10:30 am

BEST PRACTICES IN TECHNOLOGY INNOVATION

Chairperson's Remarks (Sponsorship Opportunity Available)10:35 am

10:38 am

Innovative Practices Awards: Excellence in Technological Innovation

Cindy Crowninshield, Executive Event Director, Cambridge Healthtech Institute

Since 2003, Bio-IT World has hosted an elite awards program with the goal of highlighting outstanding examples of how technology innovations and strategic initiatives are being applied to advance life sciences research. The 2024 Innovative Practices Awards winners represent excellence in innovation in the areas of informatics, pre-competitive collaboration, clinical and health IT, and genomics. Companies driving the winning entries include AstraZeneca, DNAnexus, Pistoia Alliance, Regeneron, Tempus, and UK Biobank. Winners of the Innovative Practices Awards will present their solutions, revealing the keys to their programs' success. This exclusive session provides a unique opportunity to gain insights into why our judges selected these entries as the best of the best.

10:40 am

Identification of Medicinal Products (IDMP) Ontology (Innovative Practices Award Winner: Pre-competitive Collaboration Category)

J Christian Baber, PhD, Chief Portfolio Officer, Pistoia Alliance

Sheila Elz, Master Data Manager, Bayer AG

Identification of Medicinal Products (IDMP) implementation varies greatly across different organizations and regulatory jurisdictions, impacting drug safety and pharmacovigilance. The Pistoia Alliance has built an IDMP Ontology to enable deep, semantic interoperability based on FAIR (Findable, Accessible, Interoperable, Reusable) data principles, augmenting existing ISO IDMP standards set by the European Medical Agency. Using its framework for open innovation, the Pistoia Alliance brought together 11 pharmaceutical companies (Bayer, Novartis, GSK, Roche, Merck KGaA, Boehringer Ingelheim, Johnson & Johnson, AstraZeneca, Amgen, AbbVie and Pfizer and representatives from EDM Council, ACCURIDS, OSTHUS and Chemantics). This collaborative innovation has reduced duplicated efforts and has led to the development of a new freely available IDMP Ontology (IDMP-O Release 1) available under an open-source license. The project makes it possible for everybody to benefit from IDMP standardization, improving pharmacovigilance, enabling cross-border prescriptions, and helping the prevention of medication shortages through interoperability with manufacturers.

10:58 am

500k Whole-Genome Sequencing Data Release on UK Biobank Research Analysis Platform (Powered by DNAnexus) (Innovative Practices Awards Winner: Global Impact Award Category)

Asha Collins, PhD, Senior Vice President & General Manager, Biobanks, DNAnexus

In November 2023, UK Biobank unveiled incredible new whole genome sequencing data from all its 500,000 participants – and, in a landmark for medical research, it has made the data globally available via a purpose-built, cloud-based platform. After five years and over £200 million of investment, this was the most ambitious genetic sequencing project of its kind ever undertaken. The abundance of genomic data is unparalleled, but its real value comes from being combined with the existing wealth of data UK Biobank has collected from its participants over the past 15 years on their health and lifestyle, and from whole body imaging scans and proteins found in the blood. Securely sharing this amount of complex health data, over 30 petabytes, had not been done before. DNAnexus and UK Biobank built an online research analysis platform allowing approved, global researchers to access the secure data, and also gave them the required tools to analyze the de-identified data. Today, over 30,000 researchers from more than 90 countries are registered to use UK Biobank. 

11:16 am

Automated High-Throughput Flow Cytometry (HTFC) Data Processing Pipeline (Innovative Practices Awards Winner: Informatics to Achieve Operational Excellence Category)

Ronald Realubit, Principal Business Analyst (Therapeutic Ab Development), Regeneron Pharmaceuticals, Inc.

Regeneron created a high-throughput screening platform that allows our scientists to test multiple conditions simultaneously with a highly sensitive assay technology for antibody characterization. The Automated High-Throughput Flow Cytometry (HTFC) Data Processing Pipeline supports this platform by replacing manual scientist data workflows with a combination of engineering, automation, and self-service. Built on a foundation of existing applications and tools in our IT ecosystem, this pipeline is driven by configuration files created automatically by the scientists themselves through interactive dashboards. As a result, our scientists focus on the biological relevance of experiments while modified data processing scripts and metadata tagging workflows run in the background. The pipeline automates complex data processing steps while also delivering a more pleasant and effective quality control experience. This project allows Regeneron to choose a complex data-rich assay for routine screenings of our antibodies and accelerate lead antibody discovery for Regeneron to find new medicines.

11:34 am

TIDES: Transforming Information with Digital Experimental Solutions (Innovative Practices Awards Winner: Informatics to Achieve Operational Excellence Category)

Kristian Kolakowski, Scientific Business Analyst, Regeneron Pharmaceuticals, Inc.

Patrick Leblanc, Director Business Relationship Management, Research & Preclinical Development IT, Regeneron Pharmaceuticals, Inc.

Regeneron scientists partnered with IT engineers to implement a voice-to-text (VTT) solution, LabVoice, to solve an industry-wide challenge—digitizing scientific information in lab environments—and went a step further by creating custom analytics dashboards that provide live, actionable insights based on the data. We began by implementing custom low-code workflows on our scientists’ mobile devices, eliminating the need for memorization and transcription, while standardizing and optimizing our processes for data quality and searchability. As of today, LabVoice has effectively captured more than 174,000 data points, both automated and manual. The data is used to understand the number of new litters, test subject attributes, and natural breeding trends across facilities. Ultimately, our solution has delivered hands-free data capture for our research scientists while providing key insights from the data.

11:52 am

Regeneron Protein 3D Structure Prediction (Protein Folding) (Innovative Practices Awards Winner: Informatics to Achieve Operational Excellence Category)

Cuie Hu, Director, Digital Automation and Innovations (DA&I), Digital Transformation and Engineering (DTE), Regeneron Pharmaceuticals

Because proteins are involved in so many of our physiological processes (including tissue repair, virus-fighting, and nutrient transportation), understanding their 3D structures or how they interact with each other is critical in developing new drugs with specific targets. Regeneron computational scientists and engineers partnered to transform how we predict and understand protein folding and protein-protein interactions by leveraging our internal cloud computational platform. Now, our researchers use AI to deliver 3D renderings of protein-protein interactions, in parallel, and at large scale—effectively empowering us to speed up drug discovery.

12:10 pm Building Data Factories to Accelerate Discovery

Michael Hopkins, Principal Product Manager, Product Management, HighRes Biosolutions

Harnessing the power of modern data platforms is key to scientific efficiency and innovation. In this presentation, we will outline the technical aspects of building a modern data factory to accelerate scientific discovery through the examination of case studies and industry trends. By combining optimal workflows, enhanced collaboration, and high quality data assets, laboratories with automation can truly become ecosystems of innovation.

12:40 pm Accelerating Cancer Research with Powerful AI Infrastructure: Fireside Chat with Eikon Therapeutics and Quantum

Andy LeSage, Vice President WW Solutions Engineering, Quantum Corporation

John Leonardini, Principal Storage Engineer, Eikon Therapeutics

Eikon Therapeutics conducts groundbreaking research that generates massive amounts of data which is then leveraged by powerful AI workflows. This data requires extreme performance to capture and process insights and a large-scale archive to store the outcomes and raw data for future use. Eikon and Quantum will discuss how technology plays a critical role in Eikon’s research, and the requirements for an end-to-end AI infrastructure to further their mission.  

12:55 pm Bringing Together Best of Breed Technologies to Revolutionize Science

Christopher Botka, CTO, Healthcare & Life Sciences, Unstructured Data Solutions, Unstructured Data Solutions, Dell Technologies

 Artificial intelligence is revolutionizing healthcare and life sciences – enhancing and accelerating medical imaging, genomics, and drug discovery. NVIDIA's advanced GPUs and powerful software solutions accelerate AI applications, enabling faster analysis of complex medical and research data, while Dell Technologies’ powerful IT infrastructure ensures seamless integration, performance, and security. Join us to learn how together we can drive breakthroughs in personalized medicine, disease detection, and treatment and advance scientific research.

Session Break & Transition to Lunch1:10 pm

1:20 pm LUNCHEON PRESENTATION:Performing Genomics & Privacy Preserved Analytics with Snowflake, a View from Illumina, Immuta and Gatehouse Bio

Harini Gopalakrishnan, Field CTO, Healthcare & Life Sciences, Snowflake

Rami Mehio, Head of Global Software and Informatics, Illumina

Neal Foster, Co-Founder & Chief Business and Technology Officer, Gatehouse Bio

Lisa Arbogast, Industry Principal, Life Sciences, Snowflake

Matthew Carroll, Co-Founder & CEO, Immuta

How does Gatehouse bio analyze millions of small noncoding RNA features to understand their roles in diseases? How does Illumina manage their end to end data transformations within Snowflake and what are their ambitions for the future? What does Immuta offer in terms of privacy preservation, how does this complement Snowflake? This panel brings Illumina, Gatehouse bio and Immuta together to learn how Snowflake is leveraged in managing scientific data analysis.

Refreshment Break in the Exhibit Hall with Last Chance Poster Viewing (Sponsorship Opportunity Available)1:50 pm

TRENDS FROM THE TRENCHES

Chairperson's Remarks (Sponsorship Opportunity Available)2:30 pm

2:35 pm

Trends from the Trenches

Ari E. Berman, PhD, CEO, BioTeam, Inc.

Laura Boykin Okalebo, PhD, Senior Scientific Consultant, BioTeam, Inc.

Since 2010, “Trends from the Trenches” has been one of the most popular annual traditions in the Bio-IT program. The intent of the talk is to deliver a candid (and occasionally blunt) assessment of the best, the most worthwhile, and the most overhyped information technologies (IT) for life sciences. Learn about computing, storage, data transfer, networks, cloud, data science, machine learning, and more that are involved in supporting data-intensive science.

Close of Conference4:05 pm






Ways to Participate

Conference Tracks

Data Platforms & Storage Infrastructure