Dr Vladimir Makarov, AI Community lead, The Pistoia Alliance, explains that despite the willingness of the life sciences R&D industry to adopt artificial intelligence (AI), data integrity is a key area that must be addressed to ensure successful integration of this fast-evolving technology.
With the life sciences and healthcare sectors increasing integration of artificial intelligence (AI) into their infrastructure, according to research by GlobalData, one-third of healthcare professionals considered data privacy is the primary obstacle, citing challenges such as “robust safeguards, workflow integration, and stakeholder acceptance [being] essential to ensure AI’s ethical implementation and long-term success”.1
These concerns are held alongside the promise of this technology and its potential for “improving efficiency, enhancing patient outcomes, and addressing critical challenges like staffing shortages. However, the road to successful implementation is fraught with concerns around data privacy, workflow integration, and acceptance by both patients and physicians,” stated Sachin Gharat, Associate Project Manager, Pharma at GlobalData.1
What does the AI hype cycle mean for the life sciences R&D?
[For] AI use in life sciences R&D… 2025 will see the industry enter the “plateau of enlightenment”
AI use in life sciences R&D has now passed through most major stages of the Gartner Hype Cycle2 and 2025 will see the industry enter the ‘plateau of enlightenment’. This means that AI technology is widely used, and that at the same time, best practices and applications are still being discovered and recorded. Strengths and weaknesses of AI are generally well understood by the life sciences industry, meaning we are now seeing more rational investments in AI and specific use cases, rather than impulsive spending. Yet, AI remains the top area for investment over the next two years, according to 62 percent of respondents in Pistoia’s 2024 Lab of the Future report.3
This report addresses the key factors shaping pharmaceutical formulation, including regulation, QC and analysis.
Access the full report now to discover the techniques, tools and innovations that are transforming pharmaceutical formulation, and learn how to position your organisation for long-term success.
What you’ll discover:
Key trends shaping the pharmaceutical formulation sector
Innovations leading progress in pharmaceutical formulation and how senior professionals can harness their benefits
Considerations and best practices when utilising QbD during formulation of oral solid dosage forms
How are attitudes changing towards AI from corporate researchers in industries including pharmaceuticals and life sciences?
Low quality and poorly curated datasets is the number one barrier to AI implementation (cited by 52 percent)”
There is a widespread willingness among researchers to use AI, with 68 percent3 of life science professionals using it in 2024, compared to 54 percent in 2023. However, 28 percent of researchers still hold the perception that AI is not trustworthy, reliable, or responsible. Partly this perception is driven by challenges around the quality of content used to train AI. Low quality and poorly curated datasets is the number one barrier to AI implementation (cited by 52 percent).
Researchers are very aware that poor quality inputs can lead to inaccurate and biased AI outputs, which can have huge consequences in a field such as drug development, where patient health is at stake. In 2025, the industry must collaborate on these data challenges to enable the remaining third of researchers who are not using AI to do so safely and confidently.
What about the ongoing need for trusted AI systems and data in a world where everyone has access to AI tools?
Trustworthy AI4 centres around transparency of what data is being input to models, explainability of results, and confidentiality of sensitive data. These ideas are central to emerging AI regulations, such as such as the 2023 US AI Executive Order,5 the 2024 Memorandum,6 and the EU AI Act, which will require high-risk applications of AI to file conformity assessments to prove their transparency and trustworthiness.
FAIR data is the backbone which underpins any good AI model”
Meanwhile, FAIR data is the backbone which underpins any good AI model. Adhering to the FAIR principles ensures data can more freely move through the research environment, so greater value can be unlocked over longer periods of time, including enabling more effective secondary reuse.
However, from speaking to our members, there is still some way to go before the industry ticks both the FAIR and transparency boxes. Yet 38 percent still cite data that is not FAIR as an AI barrier.
In 2025, organisations must focus on addressing data integrity challenges. Learning from each other’s mistakes and successes will be key to advancing AI adoption. The Pistoia Alliance has long been a pioneer of FAIR data, and we invite organisations to access some of the resources we have developed with the industry. For example, our 2024 paper on good machine learning practices,7 or our FAIR Toolkit.8
About the interviewee
Dr Vladimir Makarov is a consultant and project lead at The Pistoia Alliance. He is Programme Manager for the Alliance’s AI and ML Centre of Excellence, a hub for pre-competitive research for the pharmaceutical industry. His past experiences are mostly centered around informatics, including at Illumina, Pfizer, and BT Global Services. He has a PhD in computational biology from Baylor College of Medicine and is former faculty at California State University and the University of Maryland.
References
1. One Third Of Hcps Prioritize Data Privacy As The Top Challenge For AI In Clinical Practice, Says GlobalData. [Internet] GlobalData. 2024. Available from: https://www.globaldata.com/media/pharma/artificial-intelligence-in-clinical-practice-physician-perspective-2024/
2. Gartner Hype Cycle – Interpreting Technology Hype. [Internet] Gartner. Available from: https://www.gartner.com/en/research/methodologies/gartner-hype-cycle
3. Lab of the Future Survey 2024. [Internet] Pistoia Alliance. 2024. Available from: https://www.pistoiaalliance.org/blog/lab-of-the-future-2024-global-survey/
4. AI/ML Webinar – Trustworthy AI. [Internet] Pistoia Alliance. 2022. Available from: https://www.pistoiaalliance.org/pistoia-webinars/trustworthy-ai/
5. Safe, Secure, and Trustworthy Development and Use of Artificial Intelligence. [Internet] US Federal Office. 2023. Available from: https://www.federalregister.gov/documents/2023/11/01/2023-24283/safe-secure-and-trustworthy-development-and-use-of-artificial-intelligence
6. Memorandum on Advancing the United States’ Leadership in Artificial Intelligence; Harnessing Artificial Intelligence to Fulfill National Security Objectives; and Fostering the Safety, Security, and Trustworthiness of Artificial Intelligence. [Internet] wh.gov. 2024. Available from:
https://www.whitehouse.gov/briefing-room/presidential-actions/2024/10/24/memorandum-on-advancing-the-united-states-leadership-in-artificial-intelligence-harnessing-artificial-intelligence-to-fulfill-national-security-objectives-and-fostering-the-safety-security/
7. Makarov V, Chabbert C, Koletou E et al. Good Machine Learning Practices: Learnings From The Modern Pharmaceutical Discovery Enterprise. Computers in Biology and Medicine. 2024; 177: 108632.
8. FAIR Implementation. [Internet] Pistoia Alliance. 2024. Available from: https://www.pistoiaalliance.org/projects/current-projects/fair-implementation/
This article highlights critical considerations for integrating AI in life sciences R&D and healthcare, shedding light on challenges like data integrity, privacy, and stakeholder acceptance. Dr. Vladimir Makarov’s emphasis on data integrity aligns with the industry’s increasing reliance on high-quality data for effective AI deployment. It’s essential to address these challenges proactively to realize AI’s potential for improving efficiencies and patient outcomes.
This website uses cookies to enable, optimise and analyse site operations, as well as to provide personalised content and allow you to connect to social media. By clicking "I agree" you consent to the use of cookies for non-essential functions and the related processing of personal data. You can adjust your cookie and associated data processing preferences at any time via our "Cookie Settings". Please view our Cookie Policy to learn more about the use of cookies on our website.
This website uses cookies to improve your experience while you navigate through the website. Out of these cookies, the cookies that are categorised as ”Necessary” are stored on your browser as they are as essential for the working of basic functionalities of the website. For our other types of cookies “Advertising & Targeting”, “Analytics” and “Performance”, these help us analyse and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these different types of cookies. But opting out of some of these cookies may have an effect on your browsing experience. You can adjust the available sliders to ‘Enabled’ or ‘Disabled’, then click ‘Save and Accept’. View our Cookie Policy page.
Necessary cookies are absolutely essential for the website to function properly. This category only includes cookies that ensures basic functionalities and security features of the website. These cookies do not store any personal information.
Cookie
Description
cookielawinfo-checkbox-advertising-targeting
The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Advertising & Targeting".
cookielawinfo-checkbox-analytics
This cookie is set by GDPR Cookie Consent WordPress Plugin. The cookie is used to remember the user consent for the cookies under the category "Analytics".
cookielawinfo-checkbox-necessary
This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-performance
This cookie is set by GDPR Cookie Consent WordPress Plugin. The cookie is used to remember the user consent for the cookies under the category "Performance".
PHPSESSID
This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.
viewed_cookie_policy
The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
zmember_logged
This session cookie is served by our membership/subscription system and controls whether you are able to see content which is only available to logged in users.
Performance cookies are includes cookies that deliver enhanced functionalities of the website, such as caching. These cookies do not store any personal information.
Cookie
Description
cf_ob_info
This cookie is set by Cloudflare content delivery network and, in conjunction with the cookie 'cf_use_ob', is used to determine whether it should continue serving “Always Online” until the cookie expires.
cf_use_ob
This cookie is set by Cloudflare content delivery network and is used to determine whether it should continue serving “Always Online” until the cookie expires.
free_subscription_only
This session cookie is served by our membership/subscription system and controls which types of content you are able to access.
ls_smartpush
This cookie is set by Litespeed Server and allows the server to store settings to help improve performance of the site.
one_signal_sdk_db
This cookie is set by OneSignal push notifications and is used for storing user preferences in connection with their notification permission status.
YSC
This cookie is set by Youtube and is used to track the views of embedded videos.
Analytics cookies collect information about your use of the content, and in combination with previously collected information, are used to measure, understand, and report on your usage of this website.
Cookie
Description
bcookie
This cookie is set by LinkedIn. The purpose of the cookie is to enable LinkedIn functionalities on the page.
GPS
This cookie is set by YouTube and registers a unique ID for tracking users based on their geographical location
lang
This cookie is set by LinkedIn and is used to store the language preferences of a user to serve up content in that stored language the next time user visit the website.
lidc
This cookie is set by LinkedIn and used for routing.
lissc
This cookie is set by LinkedIn share Buttons and ad tags.
vuid
We embed videos from our official Vimeo channel. When you press play, Vimeo will drop third party cookies to enable the video to play and to see how long a viewer has watched the video. This cookie does not track individuals.
wow.anonymousId
This cookie is set by Spotler and tracks an anonymous visitor ID.
wow.schedule
This cookie is set by Spotler and enables it to track the Load Balance Session Queue.
wow.session
This cookie is set by Spotler to track the Internet Information Services (IIS) session state.
wow.utmvalues
This cookie is set by Spotler and stores the UTM values for the session. UTM values are specific text strings that are appended to URLs that allow Communigator to track the URLs and the UTM values when they get clicked on.
_ga
This cookie is set by Google Analytics and is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. It stores information anonymously and assign a randomly generated number to identify unique visitors.
_gat
This cookies is set by Google Universal Analytics to throttle the request rate to limit the collection of data on high traffic sites.
_gid
This cookie is set by Google Analytics and is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visited in an anonymous form.
Advertising and targeting cookies help us provide our visitors with relevant ads and marketing campaigns.
Cookie
Description
advanced_ads_browser_width
This cookie is set by Advanced Ads and measures the browser width.
advanced_ads_page_impressions
This cookie is set by Advanced Ads and measures the number of previous page impressions.
advanced_ads_pro_server_info
This cookie is set by Advanced Ads and sets geo-location, user role and user capabilities. It is used by cache busting in Advanced Ads Pro when the appropriate visitor conditions are used.
advanced_ads_pro_visitor_referrer
This cookie is set by Advanced Ads and sets the referrer URL.
bscookie
This cookie is a browser ID cookie set by LinkedIn share Buttons and ad tags.
IDE
This cookie is set by Google DoubleClick and stores information about how the user uses the website and any other advertisement before visiting the website. This is used to present users with ads that are relevant to them according to the user profile.
li_sugr
This cookie is set by LinkedIn and is used for tracking.
UserMatchHistory
This cookie is set by Linkedin and is used to track visitors on multiple websites, in order to present relevant advertisement based on the visitor's preferences.
VISITOR_INFO1_LIVE
This cookie is set by YouTube. Used to track the information of the embedded YouTube videos on a website.
This article highlights critical considerations for integrating AI in life sciences R&D and healthcare, shedding light on challenges like data integrity, privacy, and stakeholder acceptance. Dr. Vladimir Makarov’s emphasis on data integrity aligns with the industry’s increasing reliance on high-quality data for effective AI deployment. It’s essential to address these challenges proactively to realize AI’s potential for improving efficiencies and patient outcomes.