|Articles|September 29, 2017

How 100K Chest X-Ray Images Could Improve AI and Patient Health

The National Institutes of Health says it’s one of the largest data sets ever made public.

In the 24 hours since it went live this week, one of the largest-ever publicly available data sets had racked up dozens of downloads. Each click represented not only a curious researcher, but also a chance to unlock the data’s potential.

Comprising more than 100,000 chest X-ray images, the collection could improve artificial intelligence, diagnoses, and global heath, Ronald M. Summers, who led the effort, told Healthcare Analytics News™. The project came out of a radiology department at the National Institutes of Health’ Clinical Center, where Summers works as a senior investigator in an imaging and computer-aided diagnosis lab.

“Those folks who are trying to us AI for healthcare, they are starved for data sets,” Summers said. “We need really big data sets to train these latest deep-learning systems that are all the rage.”

A decade or 2 ago, impressive data sets consisted of several thousand images, he said. While valuable, groups of that size offer little to AI compared to sets with more than 100,000.

With that sort of bulk, AI can both learn and teach. Summers pointed to two similar endeavors—one on retinal photographs and the other on skin lesions—that broke ground on preventing blindness in people with diabetes and identifying skin cancer, respectively.

Hope for similarly lofty goals exist here.

As academic and research institutions get their hands on the data, they will teach computers to read and process the data, according to the NIH.

Then it may be used to pinpoint slow changes over a series of X-rays, which could otherwise go unnoticed, Summers said. AI may also help patients in developing countries, where the technology is available, but the radiologists who know how to read these images aren’t, he added. The effort could even spur the establishment of a “virtual radiology resident” that might be taught to read other types of images down the road.

It took more than a year for Summers and his team to get to this point.

They compiled the X-ray images from more than 30,000 patients, including many with advanced lung disease, at the NIH Clinical Center. Then Summers used natural language processing to extract information from corresponding radiology reports, he said.

Privacy was a big concern. The researchers removed each header, which contain patient information, and then two people manually reviewed every image, Summers said.

“I really needed to feel confident that the data were properly scrubbed,” he said.

Stay ahead of the evolving healthcare landscape with expert insights on leadership, operations, policy, innovation, and workforce strategy. Subscribe to Chief Healthcare Executive today.

How 100K Chest X-Ray Images Could Improve AI and Patient Health

Related Content

Changes: The hospital’s mission has gone beyond medicine

Intermountain Health plans $1.15B deal to expand presence in Idaho

Chief Healthcare Executive Roundtable: The struggle to retain staff

The workforce: Burnout, retention and where AI fits

Six ways to augment payer, pharmacy benefit manager and drug manufacturer partnerships | Viewpoint

Latest CME

Breast Cancer Tumor Board: Targeting TROP2 – Innovations in Triple-Negative Breast Cancer Treatment

Expert Guidance on Frequently Asked Questions Regarding the Use of ADCs in TNBC

Evaluating the Latest Data and Ongoing Trials for Novel ADC Approaches in TNBC

Establishing the Rationale for ADC and ICI Combinations in TNBC

Breaking Down the Rationale for Targeting TROP2 in TNBC

Dissecting Clinical Trial and Real-World Data for ADCs in TNBC

Breaking Down the Latest Clinical Data for First-line Maintenance and R/R SCLC

Cross-Disease Integration of Immunotherapy Innovations

Broadening the Frontline—Studies Informing the Use of Immunotherapy in Hepatocellular Carcinoma

Optimizing Treatment for Biliary Tract Cancers

PER Resource Center: Integrating Novel Approaches in TNBC – New Avenues for TROP2-Targeting ADCs and Beyond – Nursing

Practical Considerations and Future Directions for New Treatment Strategies in SCLC

Expert Roundtable and Panel Discussions: Current and Future Landscape of TNBC

Show Me the Data®: New and Emerging Roles for Oral SERD Therapy in the Treatment of ER+/HER2– Breast Cancer

Navigating Treatment Gaps in SCLC: Relapse, Resistance, and Need for New Options

Medical Crossfire® in Adjunctive Testing: Charting a New Course in Prostate Cancer Risk Assessment

BURST CME™ Resource Center: Integrating Novel PSMA-Directed Radioligand Approaches for Diagnosis and Management of Prostate Cancer

Radioligand Therapy 101: The Science Behind the Strategy

Ready for Radioligand Therapy? Patient Selection and Sequencing Simplified

Working Together: Overcoming Barriers to Optimize Outcomes in Patients Treated With Radioligand Therapy Through Multidisciplinary Care

Imaging Matters: Decoding PSMA PET for Better Decision-Making

A New Era of Targeted Therapy for Advanced NSCLC: Exploring Future Directions for Bispecific Antibodies and ADCs

Community Practice Connections™: Enhancing Melanoma Outcomes With Intratumoral Oncolytic Immunotherapy–Strategies for the Multidisciplinary Team

Advances in Managing EGFR-Mutant NSCLC: Applying Evidence Across the Disease Continuum

Navigating Advances in Neovascular Retinal Disease: Translating Evidence to Practice in AMD, DME, and RVO

Enhancing Prostate Cancer Outcomes – The Role of PSMA and Targeted Treatment Strategies

(CME Track) Antibody–Drug Conjugates in Oncology: The Essentials of AE Management for Better Patient Outcomes

Community Practice Connections™: Optimizing SCLC Treatment Strategies and Managing Adverse Events Across Disease Stages

Personalized Approaches in NSCLC: Early Detection, Molecular Testing, and Targeted Therapies

9th Annual School of Nursing Oncology™

Community Practice Connections™: DLL3-Targeting Bispecific Antibodies for Small Cell Lung Cancer—From Innovation to Practice

Hot Seat: How Experts Are Integrating the Latest Practice-Changing Data Into Their Breast Cancer Clinics

Cases and Conversations™: Transforming Small Cell Lung Cancer Treatment Through Emerging Evidence and Expert Insights

Biomarker Testing in HER2+ GEA: Diagnosis and Treatment Implications

Navigating the Adverse Event Landscape in HER2+ GEA Therapy

Hot Seat: Converging Lines in the Management of RAS-Altered Cancers

(CME Track) Tackling Oncologic Emergencies in Patients Treated With High-Dose Methotrexate

Cases & Conversations™: Unmasking Epithelioid Sarcoma – Enhancing Early Diagnosis and Multidisciplinary Care

Expert Illustrations & Commentaries: Translating the Science of Bispecific Antibodies in Solid Tumors – From Mechanisms to Emerging Data

SimulatEd™: A Roadmap to Personalized Care Plans and Shared Decision-Making in Low-Grade Serous Ovarian Cancer

The Rise of Novel HER2-Targeting Therapies in GEA: Mechanisms and Clinical Data

Show Me the Data™: Personalizing First-Line and Maintenance Therapy in HER2+ Metastatic Breast Cancer to Extend Survival and Elevate Quality of Life

Medical Crossfire®: The Who, When, and How of TROP2-Targeting ADCs, ICIs, and PARP inhibition in Triple-Negative Breast Cancer

Optimizing Multidisciplinary Care in TGCT

Revolutionizing TGCT Care with Multidisciplinary Perspectives and Cutting-Edge Targeted Therapies

From Frontline to Heavily Pretreated HR+/HER2- Metastatic Breast Cancer: Expert Perspectives on Optimizing the Expanding Treatment Armamentarium

Beyond Primary End Points: Digging Into Randomized and Real-World Data to Guide Challenging Treatment Decisions in HR+/HER2− Metastatic Breast Cancer

Diagnosis and Management of TGCT

Trending on Chief Healthcare Executive

Changes: The hospital’s mission has gone beyond medicine

Intermountain Health plans $1.15B deal to expand presence in Idaho

Chief Healthcare Executive Roundtable: The struggle to retain staff

The workforce: Burnout, retention and where AI fits

Six ways to augment payer, pharmacy benefit manager and drug manufacturer partnerships | Viewpoint