Why drug repurposing requires biomedical NLP and AI

Learn how NLP and generative AI are enhancing drug discovery and repurposing by extracting insights from complex biomedical data.
Published
Written by
Picture of Molly Bookner
Content Marketing Manager
Key Takeaways

Natural language processing (NLP) and deep learning are becoming core tools in the life sciences space – just this past January, the FDA issued its first guidance on the use of artificial intelligence (AI) in drug development. But making sense of biomedical text isn’t easy. It requires understanding how diseases, drugs, genes, and pathways connect – something that general-purpose language models, such as ChatGPT, often struggle to do independently. 

A prime example of this is drug repurposing, where NLP must identify mechanisms of action and therapeutic potential, as well as treatment relationships and off-label applications scattered across scientific literature, clinical data, and real-world evidenceThis is where advanced clinical AIdomain-specific NLP, and biomedical data science expertise become essential. 

Our white paper, NLP and generative AI in life sciences and precision medicine, explores the nuances of four key applications, demonstrating how specialized biomedical NLP and generative AI help extract more reliable insights for drug discovery and precision medicine. 

Only have time for an excerpt? Continue reading below. Otherwise, click the button to download. 

WHITE PAPER

NLP and generative AI in life sciences and precision medicine

Only have time for an excerpt? Continue reading below. Otherwise, click the button to download.

Drug discovery and drug repurposing

A critical step in drug discovery and drug repurposing is identifying evidence from massive and rapidly growing biomedical literature to help generate hypotheses. The use of systematic literature review and data mining support this work to build a knowledge base, assist research gap analysis, synthesize evidence, and direct research. Effective literature review also includes details that support traceability requirements for FDA regulatory submission.

The process of literature review poses many challenges. It is:

  • Labor intensive due to the volume of articles in sources such as PubMed Central® (PMC), MEDLINE®, and Online Mendelian Inheritance in Man (OMIM)
  • Prone to errors as a highly manual process
  • Difficult to stay current with sources that grow and evolve rapidly

Generative AI and NLP for drug discovery and drug repurposing

Generative AI can significantly improve literature analysis for drug discovery and drug repurposing. The combined use of NLP and generative AI supports each step of the process from study protocol setting and literature retrieval, to abstract screening, full-text screening, data element extraction from full-text articles, results summary, and data visualization. Unique NLP tasks predict articles’ relevance based on their title, abstract, and other metadata. Named entity recognition parses full-length articles and extracts data elements from both text and tables and highlights supporting information. With AI-automated literature review and mining, one can specify protocol with natural language, speed the process to define fine-tuning tasks, and create a “living” system that proactively and continuously updates relevant literature in a timely manner.

While the steps in each literature review are the same and can be more efficient with AI, the NLP requirements to address different business goals, diseases, and compounds vary in different contexts. Even work in the same disease space requires unique knowledge to produce reliable results. When augmented with domain expertise, data scientists can fine-tune and specify a protocol to customize and refine models that extract and summarize information unique to each study. Ultimately, scientists can dedicate more time to ensuring data quality and synthesizing evidence, while staying current.

Unique NLP tasks predict articles’ relevance based on their title, abstract, and other metadata.1

An IMO Health example

Accelerating drug repurposing with AI-drive framework

A recent study developed a framework to apply generative AI for drug repurposing studies. IMO Health scientists used NLP to extract biomedical entities and relations from 35 million PubMed abstracts. Using deep learning- based models, they built a knowledge graph of 20,000 entities (drugs, diseases, genes, etc.) and 10 million relations (“inhibits,” “treats,” “stimulates,” etc.) and scoring systems to predict the “treats” relations for each drug-disease pair. The evaluation module applied link prediction for 15 successful pairs of drugs and their new indications and found that all are ranked in the top 0.5% across all diseases.2

To learn more about the applications of biomedical NLP and generative AI in life sciences – including clinical trials, adverse drug reactions, and disease phenotyping – click below. 

1Soysal E, Warner J, Wang J, et al. Developing Customizable Cancer Information Extraction Modules for Pathology Reports Using CLAMP. Stud Health Technol Inform. Aug 2019. Accessed via: https://pubmed.ncbi.nlm.nih.gov/31438083/

2Huang LC, Li Y, Lee K, et al. Knowledgesphere: An Automated and Integrative Framework for Drug Repurposing Empowered By Knowledge Graph and AI. Value in Health, Volume 26, Issue 6, S2. June 2023. Accessed via: https:// www.ispor.org/heor-resources/presentations-database/presentation/intl2023-3668/127231 

Related Content

Latest Resources​

Funding for the RHT Program will be allocated this year, empowering rural health systems to improve healthcare access, quality, and outcomes.
For the second year in a row, Built In has deemed IMO Health one of the best midsize companies to work for...
Specialized terminologies help clinicians document with the precision complex care demands, supporting analytics, research, and care delivery.
ICYMI: BLOG DIGEST

The latest insights and expert perspectives from IMO Health

In your inbox, twice per month.