FDA Artificial Intelligence (AI) Guidance Highlights

January 12, 2025

Modified on March 5, 2026 to remove the “subscribe” option. This blog has been retired and replaced by the S.P.I.R.I.T. newsletter.

Tasha Mohseni

Good morning, good afternoon, and good evening IRBers, clinical research educators, and investigators from around the world!

I hope everyone had a great first of full week back to work! Now, I can officially say, back to the grind. I should be used to how busy the start of the semester is with outreach training efforts. I was also busy reviewing submissions. We typically see an increase in submissions around this time since it is the start of Spring semester.

Besides being busy with work, I was also working on the Show-and-Tell Series of blog posts. You can read about the series here:

Considerations for the Use of Artificial Intelligence to Support Regulatory Decision-Making for Drug and Biological Products

Per the FDA, this guidance provides recommendations to sponsors and other interested parties on the use of AI to produce informations or data intended to support regulatory decision-making regarding the safety, effectiveness, or quality of drugs. Though I am not an expert in drug development, I will say that I have intermediate understanding of AI. Let’s see what the FDA has to say.

The guidance provides a risk-based credibility assessment framework that may be used for establishing and evaluating the credibility of an AI model for a particular context of use (COU).

The COU defines the specific role and scope of the AI model to address a specific question. As a former auditor, I also appreciate that the FDA has defined the word “should”. The definition of “should” means that the FDA recommends actions within this guidance, but they aren’t required. I remember carefully reviewing policies for words like “should”, “shall”, or “must”. It’s important for institutions to define this as well. This way, when folks are reviewing their institution’s policy or guidance, they know what is required versus what is recommended.

A Risk-Based Credibility Assessment Framework

This is a 7-step process:

Step 1: Define the question of interest that will be addressed by the AI model.
Step 2: Define the COU for the AI model.
Step 3: Assess the AI model risk.
Step 4: Develop a plan to establish credibility of AI model output within the COU.
Step 5: Execute the plan.
Step 6: Document the results of the credibility assessment plan and discuss deviations from the plan.
Step 7: Determine the adequacy of the AI model for the COU.

Okay, so we know the steps. What do we do for each of these steps?

Step 1 should describe the specific question, decision, or concern being addressed by the AI model. For step 2, the description of the COU should describe in detail what will be modeled and how model outputs will be used. It should also be notated on whether other information will be used in conjunction with the AI model’s output to answer the question of interest determined in step 1. Examples of other information include animal studies and/or clinical human research studies. In step 3, model risk is assessed by two factors: model influence and decision consequence. Model influence, like it sounds, compares data derived from the AI model to other evidence used to inform the question of interest in step 1. Decision consequence is the significance of an adverse outcome resulting from an incorrect decision concerning the question of interest in step 1. To appropriately assess these components of model risk, subject-matter expertise is strongly advised.

Step 4 describes what information should be in your credibility assessment plan. Below is a summarized list of information that should be considered:

Describe the datasets used for training and tuning the AI model and which model development activities were performed using these datasets
Describe how the development data have been or will be collected, processed, annotated, stored, controlled, and used for training and tuning the AI model
Describe how the development data is fit for the COU
Describe whether the development data are centralized
Describe how the AI model was trained
Specify if a pre-trained model was used
Describe the use of ensemble methods
Explain any calibration of the AI model
Describe the quality assurance and control procedures of computer softwares and how version changes were tracked (as well as code verification)
Describe the applicability of the test data to the COU to minimize data drift
Describe the agreement between the model prediction and the observed data
Provide rationale for the chosen model evaluation method
Describe any model limitations and biases

For step 5, discussing the credibility assessment plan with the FDA prior to execution may be helpful. The last section of the document describes early engagement options s with the FDA. Step 6 should involved documenting the results and deviations from steps 1-4. Once this is complete, you can proceed to step 7. Step 7 is where you determine if the AI model is appropriate for the COU. Finally, the document concludes with life cycle maintenance of the credibility of the AI model output in certain COUs. This can be referred to as the management of changes to an AI model (whether incidentally or deliberately).

Artificial Intelligence-Enabled Device Software Functions: Lifecycle Management and Marketing Submission Recommendation

Per the FDA:

This draft guidance, when finalized, will represent the current thinking of the FDA on this topic.

Though this document will represent FDA’s thoughts, I appreciate FDA’s flexibility in approaches towards these recommendations. So long as the applicable statutes and regulations are met and it has been discussed with the FDA, you can use an alternative approach. The guidance provides recommendations on the contents of marketing submissions for devices that include AI-enabled device software functions including documentation and information that will support FDA’s review. Similar to the previous guidance, the FDA defines the word “should” as “suggested” or “recommended”. Now…let’s get into this document!

The FDA promotes a total product life cycle (TPLC) approach to the oversight of medical devices. You can read more about TPLC here: Total Product Life Cycle for Medical Devices. They also discussed the recent efforts made such as the 10 tenets of Good Machine Learning Practice (GMLP). The document further defines terminology used by the FDA versus the general AI community. For example, using the term “validation” to represent “training” or “tuning” should be avoided in medical device marketing submissions. Instead, the word “development” should be used. The FDA Digital Health and Artificial Intelligence Glossary – Educational Resource provides a compilation of commonly used AI Terms and how the FDA defines them.

The next few sections within this guidance are what the FDA recommends including in marketing submissions. Each section provides a reason as to why it must be included, what must be included, and where to include it. Below is a general outline of what is recommended for submission:

General Outline for Marketing Submissions

Device description
- A statement that AI is used in the device
- A description of device inputs and outputs
- An explanation of how AI is used to achieve the device’s intended use
- A description of the intended users, their characteristics, and the level and type of training they are expected to have and/or receive
- A description of its intended use environment(s)
- A description of the intended workflow for the use of the device
- A description of installation and maintenance procedures
- A description of any calibration and/or configuration procedures
- If the device can be configured by a user, then the submission should include information about:
  - All configurable elements of the AI-enabled device
  - How these elements and their settings can be configured
  - The potential impact of the configurable elements on user decision-making
- If a device contains multiple connected applications with separate interfaces, then the device description should address all these applications
User Interface
- A graphical representation of the device and its user interface
- A written description of the device user interface
- An overview of the operational sequence of the device and the user’s expected interactions with the user interface
- Examples of the output format
- A demonstration of the device
Labeling
- The following should be included at the age-appropriate reading level for the intended user:
  - Inclusion of AI
  - Model input
  - Model output
  - Automation
  - Model architecture
  - Model development data
  - Performance data
  - Device performance metrics
  - Performance monitroing
  - Limitations
  - Installation and use
  - Customization
  - Metrics and visualization
  - Patient and caregiver information
Risk assessment
- Risk management file
Data management for both training and testing data
- Data collection
- Data processing and cleaning
- Reference standard
- Data annotation
- Data storage
- Management and independence of data
- Representativeness
Model description and development
Performance validation
Device performance monitoring
Cybersecurity
- Cybersecurity risk management report
- How cybersecurity testing addresses the risks in the report
- A security use case view(s) that covers the AI-enabled considerations for the Debi e
- A description of controls
Publication submission summary
- A statement that AI is used in the device
- An explanation of how AI is used as part of the device’s intended use
- A description of the class of model and its limitations
- A description of development and validation datasets
- A description of the statistical confidence level of predictions
- A description of how the model will be updated and maintained over time

I hope you found this content enlightening and useful! I strive to provide my readers “food for thought”. What did you think of my interpretation of the guidance documents? Please leave a comment below and let’s get this discussion started!

Comments

One response to “FDA Artificial Intelligence (AI) Guidance Highlights”

January 26, 2025

Trump’s Gender Ideology EO: Impact on HSR – Collaboration in Research Ethics and Scientific Trust (CREST) Innovation

[…] remember seeing it at least a few days after this post I made: FDA Artificial Intelligence (AI) Guidance Highlights. After that, I can’t be certain. In any case, as promised, you can download a copy of the […]

Loading…

FDA Artificial Intelligence (AI) Guidance Highlights

Considerations for the Use of Artificial Intelligence to Support Regulatory Decision-Making for Drug and Biological Products

A Risk-Based Credibility Assessment Framework

Artificial Intelligence-Enabled Device Software Functions: Lifecycle Management and Marketing Submission Recommendation

General Outline for Marketing Submissions

Like this:

Comments

One response to “FDA Artificial Intelligence (AI) Guidance Highlights”

Leave a Reply to Trump’s Gender Ideology EO: Impact on HSR – Collaboration in Research Ethics and Scientific Trust (CREST) InnovationCancel reply

More posts

The Science Policy Scoop (Week of April 13, 2026)

The Science Policy Scoop (Week of April 6, 2026)

White House Update: March 2026

Senate Update: March 2026

Your cart (items: 0)

FDA Artificial Intelligence (AI) Guidance Highlights

Considerations for the Use of Artificial Intelligence to Support Regulatory Decision-Making for Drug and Biological Products

A Risk-Based Credibility Assessment Framework

Artificial Intelligence-Enabled Device Software Functions: Lifecycle Management and Marketing Submission Recommendation

General Outline for Marketing Submissions

Share this:

Like this:

Comments

One response to “FDA Artificial Intelligence (AI) Guidance Highlights”

Leave a Reply to Trump’s Gender Ideology EO: Impact on HSR – Collaboration in Research Ethics and Scientific Trust (CREST) InnovationCancel reply

More posts

The Science Policy Scoop (Week of April 13, 2026)

The Science Policy Scoop (Week of April 6, 2026)

White House Update: March 2026

Senate Update: March 2026

Discover more from RD Research Services LLC