With companies acknowledging the value of their data, it becomes increasingly important that data integrity be assured – and data stewardship implemented. In this article, Ian Harrow and Thomas Liener, Consultant Project Managers at The Pistoia Alliance explain the importance of the FAIR (Findable, Accessible, Interoperable, Reusable) data principles, factors currently limiting their implementation and the potential benefits should these challenges be overcome.
TODAY, MANY LIFE SCIENCE companies have shifted to essentially become big data enterprises. They have come to understand the value of their data as a powerful corporate asset. However, the industry also has one of the biggest varieties of data sources from which to extract insight, continuously being generated in large volumes. This results in a ‘data deluge’. Much of the data used by life science companies are siloed in widely differing formats and locations – making it difficult to retrieve, query and share – rendering it essentially unusable.
Organisations understand this needs to change, that data integrity must be ensured and that principles of data stewardship need to be implemented with clear practical guidance. Especially as the industry shifts towards greater adoption of artificial intelligence (AI) and automation, without robust data management strategies for the entire lifecycle of data these efforts will fail to deliver. The EU revealed that this lack of FAIR (Findable, Accessible, Interoperable, Reusable) research data and associated metadata is costing the European economy at least €10.2bn every year.1
Furthermore, by drawing a rough parallel with the European open data economy, it concluded that the downstream inefficiencies arising from not implementing FAIR could account for a further €16bn in losses annually.
This report addresses the key factors shaping pharmaceutical formulation, including regulation, QC and analysis.
Access the full report now to discover the techniques, tools and innovations that are transforming pharmaceutical formulation, and learn how to position your organisation for long-term success.
What you’ll discover:
Key trends shaping the pharmaceutical formulation sector
Innovations leading progress in pharmaceutical formulation and how senior professionals can harness their benefits
Considerations and best practices when utilising QbD during formulation of oral solid dosage forms
The FAIR guiding principles contribute to better scientific R&D of new medical therapies for the efficacious and safe treatment of disease and health conditions. The principles place “…specific emphasis on enhancing the ability of machines to automatically find and use the data, in addition to supporting its reuse by individuals”.2 They were published in 2016 with the goal of emphasising machine-actionability (the idea that computational systems can find, access, interoperate and reuse data with minimal human intervention) as humans increasingly need computational support to manage the vast volume and complexity of data generated daily.
FAIR can enable future innovation
Currently, many data are not standardised or easily retrievable. This makes it difficult or, in some cases, impossible to use for algorithms without scientists spending extended amounts of time wrangling data. However, the Lab of The Future (LoTF) will be digitally driven3 and will rely on high levels of automation. Both AI and machine learning (ML) systems will play a key role, both of which will also need quality data to ‘learn’ from. When AI is used to aid decision‑making regarding health, then harmonised, comparable and trusted data is paramount. In the LoTF, clear data management is required to continue to modernise lab environments and help the industry continue to make breakthroughs. FAIR implementation strategies must also, for example, provide for technology and format obsolescence to ensure data remains reusable.
What is holding back implementation?
For most life science companies, there are still many barriers to overcome during the implementation of FAIR. The biggest is often the financial and time investment required to begin the change process. Senior leadership teams need to be convinced that FAIR implementation will lead to long-term return on investment (ROI). As discussed above, FAIR can act as an enabler. AI thrives on accurate, reliable and machine-readable data to function; by making data machine-readable and AI-ready, it can usher in a new level of efficiency and effectiveness in biopharma R&D. FAIR can also provide ROI, as it greatly reduces the need for data wrangling and enhances data integration, meaning researchers and scientists can spend more time on innovative activities.
Workplace culture also needs to change. There must be a shift from the traditional approach of “my lab, my data” to a data-centric culture. It needs to be company-wide data at a minimum – and a clear strategy on how to store and reference data and metadata is needed. Often today, big breakthroughs are made through cross-company and cross-discipline collaboration and through sharing pre-competitive data. Take the collaborations and data sharing seen in the last 18 months to tackle COVID-19, which has resulted in the creation of life-saving treatments, diagnostics and vaccines in record time. If strong data management infrastructure and FAIR principles were adopted industry wide, we could see a repeat of such breakthrough efforts in other therapeutic areas.
However, the life science industry often has to protect sensitive data, especially given the highly regulated nature of the industry, which has made many hesitant to change their perspective to a data-centric culture. We need to educate employees and leaders on the value of FAIR implementation. This should be done through industry-wide training, development and education programmes. The implementation of FAIR principles requires an incentivised cultural change underpinned by clear and frequent communications. Awareness of the needs and challenges of all stakeholders in the early planning phase of FAIR implementation will lead to better solutions and better acceptance. The designation of data stewards in each company, charged with ensuring that data management and governance is embedded and maintained within the organisation, can catalyse this change.
Realising productivity gains
Adopting FAIR implementation can unlock the long-term potential of data for future research, improve collaboration, enable scientific queries to be answered more rapidly and support alignment with compliance requirements. All these factors can help to create productivity gains, which are now essential in biopharma as the time and cost of creating new therapies continues to skyrocket. In the same vein, the implementation of a FAIR mindset can reduce the need for work to be repeated, as it enables more data to be used for secondary purposes. In addition, the implementation of FAIR principles can also aid the drive towards precision medicine, by enabling the development of more segmented or personalised real-world data to match best treatment to relevant patient cohorts.
A crucial aspect of implementing the FAIR guiding principles is the measurement of the level of FAIRness. This is achieved using various metrics or maturity indicators that quantify FAIRness to seek improvements in an iterative manner to optimal levels, determined by business needs.
The Pistoia Alliance facilitates the implementation of the FAIR guiding principles with its FAIR Implementation Project.4 It shares best practices through the ongoing development of the FAIR Toolkit5 and through the development of a guide for the implementation of FAIR for Clinical (FAIR4Clin) trial and healthcare data. We welcome involvement from any stakeholder who would benefit from FAIR implementation for clinical data and those who are keen to share their knowledge and learnings.
About the authors
Ian Harrow (left) and Thomas Liener (right) are independent consultants and managers of The Pistoia Alliance6 FAIR implementation projects. Ian has been an active member of the Pistoia Alliance since its inception 10 years ago, previously working as a senior principal scientist in Animal and then Human Health at Pfizer for 27 years. Prior to this he undertook postdoctoral research in neurobiology at Columbia University and a PhD in neuropharmacology and electrophysiology at the University of Cambridge. Thomas Liener has ample experience with semantic web technologies like RDF and SPARQL, with ontologies and FAIR data management. He is an EMBL-EBI alumnus.
Wilkinson M, Dumontier M, Aalbersberg I, et al. The FAIR Guiding Principles for scientific data management and stewardship. Scientific Data. 2016;3(1). Available from: nature.com/articles/sdata201618
Harrow I. The FAIR Toolkit and FAIR Life Science Industry Implementation Project [Internet]. Pistoia Alliance. 2021 [cited 10 June 2021]. Available from: https://www.pistoiaalliance.org/projects/…
FAIR Toolkit – The FAIR Toolkit by Pistoia Alliance – A FAIR Toolkit for Life Science Industry [Internet]. Fairtoolkit.pistoiaalliance.org. 2021 [cited 10 June 2021]. Available from: https://fairtoolkit.pistoiaalliance.org/
Pistoia Alliance | Collaborate to Innovate Life Science R&D [Internet]. Pistoia Alliance. 2021 [cited 10 June 2021]. Available from: https://www.pistoiaalliance.org/
This website uses cookies to enable, optimise and analyse site operations, as well as to provide personalised content and allow you to connect to social media. By clicking "I agree" you consent to the use of cookies for non-essential functions and the related processing of personal data. You can adjust your cookie and associated data processing preferences at any time via our "Cookie Settings". Please view our Cookie Policy to learn more about the use of cookies on our website.
This website uses cookies to improve your experience while you navigate through the website. Out of these cookies, the cookies that are categorised as ”Necessary” are stored on your browser as they are as essential for the working of basic functionalities of the website. For our other types of cookies “Advertising & Targeting”, “Analytics” and “Performance”, these help us analyse and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these different types of cookies. But opting out of some of these cookies may have an effect on your browsing experience. You can adjust the available sliders to ‘Enabled’ or ‘Disabled’, then click ‘Save and Accept’. View our Cookie Policy page.
Necessary cookies are absolutely essential for the website to function properly. This category only includes cookies that ensures basic functionalities and security features of the website. These cookies do not store any personal information.
Cookie
Description
cookielawinfo-checkbox-advertising-targeting
The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Advertising & Targeting".
cookielawinfo-checkbox-analytics
This cookie is set by GDPR Cookie Consent WordPress Plugin. The cookie is used to remember the user consent for the cookies under the category "Analytics".
cookielawinfo-checkbox-necessary
This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-performance
This cookie is set by GDPR Cookie Consent WordPress Plugin. The cookie is used to remember the user consent for the cookies under the category "Performance".
PHPSESSID
This cookie is native to PHP applications. The cookie is used to store and identify a users' unique session ID for the purpose of managing user session on the website. The cookie is a session cookies and is deleted when all the browser windows are closed.
viewed_cookie_policy
The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
zmember_logged
This session cookie is served by our membership/subscription system and controls whether you are able to see content which is only available to logged in users.
Performance cookies are includes cookies that deliver enhanced functionalities of the website, such as caching. These cookies do not store any personal information.
Cookie
Description
cf_ob_info
This cookie is set by Cloudflare content delivery network and, in conjunction with the cookie 'cf_use_ob', is used to determine whether it should continue serving “Always Online” until the cookie expires.
cf_use_ob
This cookie is set by Cloudflare content delivery network and is used to determine whether it should continue serving “Always Online” until the cookie expires.
free_subscription_only
This session cookie is served by our membership/subscription system and controls which types of content you are able to access.
ls_smartpush
This cookie is set by Litespeed Server and allows the server to store settings to help improve performance of the site.
one_signal_sdk_db
This cookie is set by OneSignal push notifications and is used for storing user preferences in connection with their notification permission status.
YSC
This cookie is set by Youtube and is used to track the views of embedded videos.
Analytics cookies collect information about your use of the content, and in combination with previously collected information, are used to measure, understand, and report on your usage of this website.
Cookie
Description
bcookie
This cookie is set by LinkedIn. The purpose of the cookie is to enable LinkedIn functionalities on the page.
GPS
This cookie is set by YouTube and registers a unique ID for tracking users based on their geographical location
lang
This cookie is set by LinkedIn and is used to store the language preferences of a user to serve up content in that stored language the next time user visit the website.
lidc
This cookie is set by LinkedIn and used for routing.
lissc
This cookie is set by LinkedIn share Buttons and ad tags.
vuid
We embed videos from our official Vimeo channel. When you press play, Vimeo will drop third party cookies to enable the video to play and to see how long a viewer has watched the video. This cookie does not track individuals.
wow.anonymousId
This cookie is set by Spotler and tracks an anonymous visitor ID.
wow.schedule
This cookie is set by Spotler and enables it to track the Load Balance Session Queue.
wow.session
This cookie is set by Spotler to track the Internet Information Services (IIS) session state.
wow.utmvalues
This cookie is set by Spotler and stores the UTM values for the session. UTM values are specific text strings that are appended to URLs that allow Communigator to track the URLs and the UTM values when they get clicked on.
_ga
This cookie is set by Google Analytics and is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. It stores information anonymously and assign a randomly generated number to identify unique visitors.
_gat
This cookies is set by Google Universal Analytics to throttle the request rate to limit the collection of data on high traffic sites.
_gid
This cookie is set by Google Analytics and is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visited in an anonymous form.
Advertising and targeting cookies help us provide our visitors with relevant ads and marketing campaigns.
Cookie
Description
advanced_ads_browser_width
This cookie is set by Advanced Ads and measures the browser width.
advanced_ads_page_impressions
This cookie is set by Advanced Ads and measures the number of previous page impressions.
advanced_ads_pro_server_info
This cookie is set by Advanced Ads and sets geo-location, user role and user capabilities. It is used by cache busting in Advanced Ads Pro when the appropriate visitor conditions are used.
advanced_ads_pro_visitor_referrer
This cookie is set by Advanced Ads and sets the referrer URL.
bscookie
This cookie is a browser ID cookie set by LinkedIn share Buttons and ad tags.
IDE
This cookie is set by Google DoubleClick and stores information about how the user uses the website and any other advertisement before visiting the website. This is used to present users with ads that are relevant to them according to the user profile.
li_sugr
This cookie is set by LinkedIn and is used for tracking.
UserMatchHistory
This cookie is set by Linkedin and is used to track visitors on multiple websites, in order to present relevant advertisement based on the visitor's preferences.
VISITOR_INFO1_LIVE
This cookie is set by YouTube. Used to track the information of the embedded YouTube videos on a website.