|Articles|October 31, 2018

How Machine Learning Could Detect Medicare Fraud

Researchers found that a random forest learning algorithm was most effective at detecting possible Medicare fraud.

^{Machine learning could become a new weapon in the fight against Medicare fraud.}

Machine learning could become a useful tool in helping to detect Medicare fraud, according to a new study, potentially reclaiming anywhere from $19 billion to $65 billion lost to fraud each year.

Researchers from Florida Atlantic University’s College of Engineering and Computer Science recently published the world’s first study using Medicare Part B data, machine learning and advanced analytics to automate fraud detection. They tested six different machine learners on balanced and imbalanced data sets, ultimately finding the RF100 random forest algorithm to be most effective at identifying possible instances of fraud. They also found that imbalanced data sets are more preferable than balanced data sets when scanning for fraud.

>> READ: In the Digital World, Health Insurance Cards Remain Analog

“There are so many intricacies involved in determining what is fraud and what is not fraud, such as clerical error,” Richard A. Bauder, senior author and a Ph.D. student at the school, said. “Our goal is to enable machine learners to cull through all of this data and flag anything suspicious. Then we can alert investigators and auditors, who will only have to focus on 50 cases instead of 500 cases or more.”

In the study, Bauder and colleagues examined Medicare Part B data from 2012 to 2015, which held 37 million cases, for instances such as patient abuse, neglect and billing for medical services that never occurred. The team narrowed the data set to 3.7 million cases, a number that would still represent a challenge for human investigators who are typically charged with pinpointing Medicare fraud.

The authors used the National Provider Identifier — a unique ID number issued by the government to healthcare providers — to match fraud labels to Medicare Part B data, which comprised provider details, payment and charge information, procedure codes, total procedures performed and medical specialty.

When researchers matched the NPI to the Medicare data, they flagged potentially fraudulent providers in a separate database. How?

“If we can predict a physician’s specialty accurately based on our statistical analyses, then we could potentially find unusual physician behaviors and flag these as possible fraud for further investigation,” Taghi M. Khoshgoftaar, Ph.D., co-author and a professor at the school, said.

So, if a cardiologist were incorrectly labeled a neurologist, that could be a sign of fraud.

Still, the data set itself remained a challenge. The small number of fraudulent providers and the large number of above-board providers made the data set imbalanced, which can fool machine learners. So, using random undersampling, investigators whittled down the set to 12,000 cases, with seven class distributions ranging from severely imbalanced to balanced.

From there, they unleashed their learners and reached their results regarding random forest and class distribution.

Surprisingly, researchers found that keeping the data set 90 percent normal and 10 percent fraudulent was the “sweet spot” for machine-learning algorithms tasked with identifying Medicare fraud. They thought the ratio would need to include more fraudulent providers for the learners to be effective.

A dean at the college of engineering said these machine-learning detection tools could become a “game changer” for Medicare fraud detection.

The journal Health Information Science and Systems published the study.

Get the best insights in healthcare analytics directly to your inbox.

Does Your Healthcare Organization Have the Chops for Machine Learning?

Threats to Health Data Often Come from Inside, Report Finds

Subscribe Now!

Latest CME

In-Person Event

20th Annual New York Lung Cancers Symposium®

November 15, 2025

How Machine Learning Could Detect Medicare Fraud

Newsletter

Related Content

As shutdown reaches one month, health systems see impact

Encouraging patients with breast cancer: ‘Don’t steal hope’

States vie for funds in $50B rural health program, and the deadline is coming

Strategies for the $50B rural health fund

Three Pennsylvania hospitals are close to being sold, again

Latest CME

20th Annual New York Lung Cancers Symposium®

PER® Liver Cancer Tumor Board: How Do Evolving Data for Immune-Based Strategies in Resectable and Unresectable HCC Impact Multidisciplinary Patient Management Today… and Tomorrow?

Community Practice Connections™: 6th Annual Precision Medicine Symposium – An Illustrated Tumor Board

Advances In™: Taking R/R B-Cell ALL Management to the Next Level With New CAR T Approval

Navigating Low-Grade Serous Ovarian Cancer – Enhancing Diagnosis, Sequencing Therapy, and Contextualizing Novel Advances

Cases & Conversations™: Integrating Novel Approaches to Treatment in First-line ALK+ mNSCLC – Enhancing Patient Outcomes with Real World Multidisciplinary Strategies

Burst CME™: Implementing Appropriate Recognition and Diagnosis of Low-Grade Serous Ovarian Cancer

Burst CME™: Understanding Novel Advances in LGSOC—A Focus on New Mechanisms of Action and Clinical Trials

Burst CME™: Stratifying Therapy Sequencing for LGSOC and Evaluating the Unmet Needs of the Standard of Care

Burst CME™: How is the Newly Approved CAR T-Cell Therapy Impacting R/R B-Cell ALL Management?

Community Practice Connections™: Case Discussions in TNBC… Navigating the Latest Advances and Impact of Disparities in Care

Epithelioid Sarcoma: Applying Clinical Updates to Real Patient Cases

Collaborating Across the Continuum®: Identifying and Treating Epithelioid Sarcoma

Mastering Epithelioid Sarcoma: Enhancing Diagnostic Precision and Tailoring Treatment Strategies

Clinical Showcase™: Selecting the Best Next Steps for a Patient with Epithelioid Sarcoma

Brain Mets: Brain & Spine Metastases Research and Emerging Therapy Conference

2nd Annual Hawaii Cancer Conference

Medical Crossfire®: Bridging Evidence to Practice in AML…Updates on FLT3, IDH1/2, Maintenance, Combos, and Clinical Trials

A Breath of Strength: Managing Cancer Associated LEMS and Lung Cancer as One

Show Me the Data™: Bridging Clinical Gaps Along the Continuum From Resectable, Early Stage to Advanced Gastric/Gastroesophageal Junction Cancers

Striking the Right Nerve: Managing Cancer Associated LEMS in Lung Cancer Patients

19th Annual New York GU Cancers Congress™

Medical Crossfire®: Expert Interpretations of the Latest Data in CLL Management – Understanding the Impact of Optimal Treatment Selection on Patient Outcomes

Virtual Testing Board: Digging Deeper on Your Testing Reports to Elevate Patient Outcomes in Advanced Non–Small Cell Lung Cancer

11th Annual School of Gastrointestinal Oncology® (SOGO®)

Addressing Unmet Needs in HER2+ Metastatic BTC

Community Practice Connections™: Tailored Treatment Approaches for Older Patients With Advanced HR+/HER2– Breast Cancer

Community Practice Connections™: Optimizing Treatment Outcomes and Preserving Fertility in Premenopausal HR+ Breast Cancer

From Bench to Bedside: Paradigm Shifts in HER2+ Metastatic BTC Treatment

Proactive Adverse Event Management for HER2+ BTC Treatments

Community Practice Connections™: Empowering Interventional Radiologists in the Emerging Era of Oncolytic Immunotherapies for Melanoma

A Case-Guided Discussion on Managing Immune Thrombocytopenic Purpura (ITP)

GI Tumor Board—Applying Recent Advances in Biomarker Testing and Treatment in Metastatic Colorectal Cancer

Evolving Treatment Strategies in Pancreatic Cancer: Current Standards, Emerging Targets, and the Role of Molecular Testing

Medical Crossfire®: Precision Medicine in Glioma Treatment — Integration of Molecular Profiling to Inform Targeted Therapies

Cases and Conversations™: Sorting Through the Expanding Treatment Options for Patients with Relapsed/Refractory Multiple Myeloma

PER Tumor Board®: Applying Recent Advances to Transform the Treatment Paradigm in SCLC—Expert Perspectives on New Approvals and Emerging Strategies

Medical Crossfire®: Harnessing the Power of Modern Therapies in Newly Diagnosed Multiple Myeloma

Medical Crossfire®: Improving Patient Outcomes in Myeloproliferative Neoplasms With Novel Therapeutic Approaches

Tumor Board: Expert Insights on Managing Classical 𝘌𝘎𝘍𝘙 Mutations, 𝘌𝘎𝘍𝘙 Exon 20 Insertions, and Atypical 𝘌𝘎𝘍𝘙 Mutations in Metastatic NSCLC

Medical Crossfire®: Expert Perspectives on Targeting c-Met Overexpression and 𝘔𝘌𝘛 Genomic Alterations in NSCLC – Unveiling the Complexities of 𝘔𝘌𝘛 Dysregulation

Cases & Conversations™: Transforming AML Care—Precision Strategies, Evolving Therapies, and Clinical Insights

Medical Crossfire®: Integrating Next-Generation Endocrine Targeting Therapies to Improve Outcomes for Patients With HR+/HER2- Breast Cancer

Medical Crossfire® in Adjunctive Testing: Charting a New Course in Prostate Cancer Risk Assessment

Trending on Chief Healthcare Executive

As shutdown reaches one month, health systems see impact

Encouraging patients with breast cancer: ‘Don’t steal hope’