Digital Data Flow

Published Date: 27 June 2023

CDISC is collaborating with TransCelerate as a part of TransCelerate’s Digital Data Flow Project to develop a Study Definition Reference Architecture that will serve as a standard model for the development of conformant study definition technologies, including the TransCelerate-developed Study Definitions Repository Reference Implementation. Additional information can be found on the TransCelerate Digital Data Flow Solutions page.

Digital Data Flow - Phase 1 (July 2021 - July 2022)

The Reference Architecture (RA) will provide solution architects with a common vocabulary, reusable designs, industry best practices, standards, and general implementation guidance. The RA is not a solution architecture and is not implemented directly but constrains concrete solution architectures with the purpose of promoting the development of conformant solution architectures that enable interoperability across multiple systems in a clinical study, improve efficiency and data quality, and reduce cycle times. 

Please refer to the Project Deliverables tab for more information on the standards developed during Phase 1.

Digital Data Flow - Phase 2 (September 2022 - July 2023)

Continuing the collaboration with TransCelerate, CDISC has further developed the Reference Architecture in a second phase. The anticipated focus of Phase 2 was to update the RA to:

  • enable greater population of study set-up elements and represent structured study design information for more complex trials
  • support electronic data capture (EDC) automation
  • demonstrate population of the TransCelerate Common Protocol Template (CPT)

Digital Data Flow - Phase 3 (Starting July 2023)

Phase 3 of the Digital Data Flow Project is due to commence in July 2023. Additional information on the activities for Phase 3 will be made available shortly.

How to Participate in Phase 3

We invite your organization to participate in Phase 3 of this exciting project. Please visit the DDF Project Participate tab to learn more.

DDF Project Call-for-Participation Webinar 

We are looking for targeted reviewers of the draft deliverables throughout the development phase with particular expertise in the following areas:

  • CDISC Protocol-related Controlled Terminology
  • UML model review in relation to study/protocol concepts
  • Technical skills to review API specifications in relation to study/protocol concepts
  • Experience of this type of project within your own organization

Anticipated Timelines for Phase 3:

  • Development Activities begin July 2023
  • 60-day Public Review is planned for Q1 2024.

Public Review time commitment is estimated at 10-20 hours depending on experience with the four areas mentioned above. We encourage reviewers to reach out to colleagues who may also have experience in reviewing these types of deliverables or have business experience with this type of project. Please get in touch using the sign-up instructions below.

Participate

  • If you would like to participate in this exciting effort, please follow the instructions below: 
    • Targeted Public Reviewers: Sign up on the CDISC Volunteer page and indicate DDF as the Standards Development team. Please enter Targeted Internal and Public Reviewer in the box "Specify in which capacity you want to participate."
    • Participation in Standards Development: Sign up on the CDISC Volunteer page and indicate DDF as the Standards Development team. In the specify box enter in what capacity you feel you can contribute to the team and the amount of time you can contribute to the project.

DDF Phase 3 will focus on updating the Unified Study Definition (USDM) reference architecture, which will serve as a standard model for the development of a Study Definitions Repository. The Repository is a novel central component aimed at facilitating the exchange of structured study definitions across clinical systems using technical and data standards.

DDF Phase 3 Webinars

Listen to the DDF Phase 3 Public Informational Webinar

DDF Phase 2 Webinars

Listen to the DDF Phase 2 Public Review Webinar

DDF Phase 1 Webinars

CDISC held a public review information webinar on 22 March 2022. We invite you to watch and learn more about the public review process.





CDISC and TransCelerate held a Project Information and Call for Volunteers webinar on 5 October 2021. We invite you to watch and learn more about this important initiative.
 

Published Date: 27 June 2023

DDF Reference Architecture v2.0 Final

The Digital Data Flow Reference Architecture v2.0 has been released and is available via the Files tab on this page. Key elements include the Unified Study Definition Model (USDM) Class Diagram, the Application Programming Interface (API) Specification, associated CDISC Controlled Terminology, and, new for this release, the Unified Study Definition Model Implementation Guide (USDMIG).

In version 2.0, the USDM has been updated to accommodate more complex study designs, demonstrate population of an electronic protocol template, and expand the model to include a biomedical concept layer to aid in CRF automation.

The USDMIG provides users with sufficient information to understand the USDM and its potential implementations with the study design process by showing examples of the types of study definition information that can be represented in the USDM.

The Reference Architecture v2.0 has completed the CDISC Standards Development Process and is released as Final; the CDISC Controlled Terminology may not be final at the time of release. Additionally, DDF Phase 3 will start July 2023 and build on v2.0. To keep informed of Reference Architecture future enhancements, please attend the informational webinars. Please visit the CDISC RA GitHub Site for ongoing development of Phase 3 of the Digital Data Flow Reference Architecture.

CDISC and TransCelerate have collaborated to develop the USDM Reference Architecture in conjunction with development of a Study Definitions Repository (a reference Implementation of the USDM architecture). For more information on the Study Definitions Repository, please visit the TransCelerate DDF GitHub site and the SDR Github Site.

For any questions related to this release please get in touch using the Contact Us tab. Also if you are interested in becoming part of the DDF team please see the Participate tab for more information.

The DDF Reference Architecture contains the following:

Unified Study Definition Model (USDM) Class Diagram

The USDM Class Diagram is a UML Class diagram that functions as the Study Definitions Logical Data Model (LDM). The class diagram as LDM depicts the classes and attributes that represent data entities and the relationships between those entities that are needed to support a conformant Study Definitions application. The class diagram includes a data dictionary that defines each of the classes and attributes to provide the semantics needed to implement the model consistently.

The purpose of the LDM is to provide guidance to the solution implementer. It does not attempt to be inclusive of every class, attribute, and relationship implemented by any given solution. Instead, the LDM describes all elements for a conformant solution based on the Minimal Viable Product. It provides input into the solution architecture with the expectation that every effort will be made to ensure the solution architecture aligns with the RA LDM. The LDM represented by the class diagram may be implemented in a variety of physical data stores, including relational databases, property graph databases, RDF triple stores, document databases, and others. The semantics represented in the LDM and data dictionary support a uniform understanding of the Study Definition data elements to improve interoperability. The LDM will be used to define the resources available in the API specification; these resource definitions represent the data exchange format for applications implementing the API.

The class diagram references industry standards and terminology systems where relevant. For example, the Study Definition class diagram includes references to CDISC Biomedical Concepts,  Foundational Standards, and Controlled Terminology. The data dictionary accompanying the class diagram will include references to the CDISC Library for retrieval of all CDISC content. Content from other standards and terminology systems (HL7, LOINC, CPT, ICH M11, et al) may be incorporated into the LDM, with the intention of extending interoperability to data exchange standards, such as FHIR. The class diagram also depicts essential elements of a study protocol including study objectives, endpoints, and the schedule of activities.  

CDISC Controlled Terminology to Support USDM

The use of standardized terminology is an essential element of interoperability. To support the Study Definition LDM, CDISC will develop new Protocol Controlled Terminology, including code lists and terms as well as changes to existing terms, if needed. Terminology for SDTM Trial Design domains will also be covered in this project.

Application Programming Interface (API) Specification

A standardized API provides a standard interface for a common set of core services that must be supported by all conformant solutions. A standard API will enable application developers to build solutions that can use the core services of any conformant Study Definition. Data standards, controlled terminology, data exchange standards, and APIs work together to create an optimal environment for interoperability between conformant systems.

A Study Definition REST API will be specified using the OpenAPI Specification (OAS) version 3.0. As a machine-readable format, OAS can be used to generate not only interactive documentation but also source code to aid developers creating software implementing the specification. The CDISC Library also publishes API specifications using these formats. The CDISC Library, HL7 FHIR, and the DDF Study Definition API form a growing set of standard APIs to improve interoperability and automation in clinical trials. Much like the CDISC Library API, the Study Definition API will support a range of media types, including JSON and XML.

The Study Definition API enables clients to create, update, and remove Study Definition content assuming the user’s role permissions support the activity. Accordingly, the API provides access to Study Definition services so that alternative user interfaces and applications can be developed to extend or exchange information with any given Study Definition tool.

Unified Study Definition Model Implementation Guide (USDMIG)

A new deliverable for the Digital Data Flow Reference Architecture v2.0 is a corresponding Implementation Guide. The Unified Study Definitions Model Implementation Guide (USDMIG) is intended for companies and individuals involved in the set-up of clinical studies—sponsors or stakeholders involved in upstream (protocol and content authoring tools)—and downstream consumers of system (e.g., electronic data capture, clinical trial management, trial master file) and document (e.g., protocol, clinical study reports, statistical analysis plans) standardized digitized study definitions.

This document provides users with sufficient information to understand the USDM and also its potential implementations with the study design process by showing examples of the types of study definition information that can be represented in the USDM.


Volunteering

I work with CDISC Foundational Standards, but am unfamiliar with deliverables for this project apart from Controlled Terminology, can I still get involved?
  • Yes, we welcome a variety of feedback from the user community.
  • The deliverables for this project are technical automation standards. CDISC is providing workshops during Internal and Public Review to assist in review of the standards.
  • We also recommend reaching out to colleagues within your organization who may have expertise in this area from a business and technical perspective.
What are the volunteer activities you're looking for besides review?
  • This project uses a strict agile methodology. As such, involvement as part of the development team requires significant time commitment (FTE 40% plus). 
  • Some organizations may already have experience in this type of project and individuals may want to volunteer in a supportive capacity to the standards developers (e.g., performing weekly reviews of the deliverables from each sprint).
  • If you are interested in serving on the development team, please visit the Participate tab and follow the sign-up instructions. 
What is the time commitment for volunteers?
  • Targeted Reviewer (Internal/Public): 10-15 hours per review.
  • Supporting the development team (in a review or protocol SME capacity): FTE 20% or more.
  • Participate as a developer: FTE 40% or more.
Is it expected that volunteers should have any specific technology/programming skills as well?
  • Targeted Reviewers do not require specific technical skills.
  • Supporting roles for the development team do not require specific technical skills but do require experience working in trial design.
  • The Development Team require specific technical skills in the area (for example UML diagram creation in Enterprise Architect, API specification development using SwaggerHub) and preferable experience working in the trial design area.
When I signed up for volunteering at CDISC, I had initially picked ADaM as the interest area. Can I change it to DDF for participating in this?
  • Yes, please follow the instructions on the Participate tab.
Where can I go if I have questions about DDF outside of the CDISC standards work?
  • For questions about the TransCelerate DDF initiative outside of the work that CDISC is doing, you should contact TransCelerate using the Digital Data Flow Feedback Form.
Do I have to be a CDISC member to volunteer?
  • No, volunteers on all CDISC teams are welcome from both member and non-member organizations.

Relationship to other Standards

Where do these new standards fit into the current CDISC standards landscape?
  • These standards support the CDISC vision of supporting the implementation of standards.
  • These technical standards will be open source and available for all users with alignment with current CDISC standards.
Is the Unified Study Definition Model (USDM) using an existing or new standard?
  • The USDM is a new technical automation standard, re-using existing standards where applicable.
Will this impact the Foundational Standards (CDASH, SDTM, ADaM)?

The new standards will support the development of conformant study definition technologies. Existing standards will be reused where appropriate; the new standards will not replace existing Foundational Standards.

Are the standards using the terminology developed by the CDISC Protocol Entities team?
  • Yes. The Controlled Terminology developers for this project are also part of the CDISC Protocol Entities team.
  • Existing terminologies are leveraged where applicable.
  • New terminology developed for this project will follow the standard CDISC Controlled Terminology process.
What is the relationship of the Unified Study Definition Model (USDM) to the BRIDG model?
  • Both BRIDG and the USDM are logical data models.
  • The USDM is a more practical and implementable model. CDISC is delivering a reference architecture that can immediately be implemented in a reference implementation.
Is this project linked to the CDISC 360 initiative? What are the connections?
  • There are team members involved in the development of the USDM that were also part of the CDISC 360 project.
  • The USDM will provide links to future biomedical concept development.
Does the scope of DDF also cover development of biomedical taxonomies and ontologies as well?
  • The USDM will be supported by CDISC Controlled Terminology and external ontologies where appropriate (e.g., study indication).
What is the plan to ensure that the study definition will encompass all study designs, including oncology and immunology?
  • The first release of the USDM will focus on the general cross-cutting concepts related to study definitions to be as flexible and useful for all study types.
Will standard CRFs be created as a result of this project?
  • No, discussions are ongoing about how the conformant study definition technologies may interact with company-specific eCRFs.
  • Independent of this project, CDISC offers the eCRF portal that is freely available for organizations to download eCRF content. eCRFS are added on an ongoing basis.
Interoperability is the ultimate goal. Is the integration of clinical studies with Electronic Health Records (EHRs) planned in the future with enhanced data exchange security?
  • Integration with EHRs will be discussed for future iterations, including alignment with Reference Architecture.
Are you all using the PRM in any way?
  • Yes, a list of CDISC, including PRM, and non-CDISC assets related to study design were evaluated as inputs during the scoping process.


Scope

Where can I find more information on TransCelerate's DDF project?
  • Additional information on the DDF Project is available on the TransCelerate DDF website.
Is this project for clinical data only or will it include nonclinical studies?
  • The main focus of the initial version will be related to clinical study designs; additional study design aspects, such as nonclinical, will be discussed for future iterations of the USDM.
Is the USDM intended for RCTs only, or are you considering observational studies within the initial, or later, scope?
  • Discussions on the different types of studies are currently ongoing.
Would this project deliver a standard protocol template which can be machine readable?
  • The intent is that the USDM will hold those common protocol elements that can be used by other systems or tools such as a digital protocol creation tool and the TransCelerate Common Protocol Template (which is part of the TransCelerate Clinical Content & Reuse Initiative).
How will contents of the individual elements within the USDM be authored/reviewed/approved? (e.g., Endpoints or Inclusion Criteria)
  • The development of the standards will follow the CDISC Standards Development Process and the governance process defined in the process. CDISC will work with TransCelerate and volunteer Protocol subject matter experts to develop the concepts for the CDISC deliverables.

 

Technology

What kind of technology recommendations is the effort going to publish? Is there a concern that any technology recommendation will bias towards one implementation or another and ultimately hurt the ability to innovate?
  • The Study Definition Repository Reference Implementation will be cloud agnostic and open-source; the aim is not to select a particular technology to influence users. The Study Definition Repository will act as a template for use in other technologies. It is a functioning, example approach to connect systems that produce protocol information with those that consume it to drive study operations. Various implementation patterns exist and organizations may or may not choose to implement a stand-alone SDR.
  • Upstream and downstream connecting vendors will be involved to provide their input into the deliverables so they can be implemented with connections to many upstream and downstream systems.
Has the Open Source License been picked?
  • The open-source license for the Study Definitions Repository Reference Implementation is Apache 2.0.
Is there any specific link to check on what is actually happening or any standards available as a part of TransCelerate?
  • The standards developed will be using the CDISC WIKI; volunteers will have access to this area.
  • Additional information on the DDF Project is available on the TransCelerate DDF webpage.
What about something like RESTFUL versus GraphQL? That selection has particular implementation implications.
  • The original rationale for choosing REST over GraphQL is that we're creating a standard API specification that all conformant systems must implement. We anticipate REST will be simpler to produce and govern as a standard specification. It should also allow for the broadest range of implementation support since REST is mature and widely supported across most technology stacks. The decision to use REST also came from early discussions with TransCelerate. CDISC isn’t suggesting that a server can't implement GraphQL, just that the standard API is RESTful. There are pros and cons to each alternative.
Will the model or the reference implementation be tested through some form of connectathon, such as the TransCelerate DDF hackathon which ran two years back now?
  • CDISC will deliver a set of conformance tests to ensure that any given Study Definition solution conforms to the DDF Study Definition RA.
  • Anyone in the technology community interested in becoming involved in the reference implementation testing should contact TransCelerate using the Digital Data Flow Feedback Form.
Please contact us using the form below if you have any questions or comments about the DDF project or any of the project's standards that have been developed. If you are interested in becoming part of the DDF volunteer team, please navigate the Participate tab.

Many thanks,
The CDISC DDF Team

CDISC, in collaboration with TransCelerate’s Digital Data Flow Project, is developing a reference architecture, which will serve as a standard model for the development of a Study Definitions Repository. The Repository is a novel central component aimed at facilitating the exchange of structured study definitions across clinical systems using technical and data standards.

Deliverables will include a logical data model, supporting Controlled Terminology, API specifications and related conformance tests. For more information, please read the press release.

CDISC invites you to provide comments on the draft DDF Project Deliverables during Public Review.

  • Please visit the DDF Public Review Dashboard page for instructions, timelines, links to the JIRA commenting tool and to deliverables. Please note that this page contains a link to a Public Review Workshop taking place on Monday 4th April 10:00-11:00 EDT. No invites will be sent so please use the Zoom details provided to join this workshop.
  • The recording of the Public Review webinar is available at this link .

You will need to log in or register for the CDISC Wiki to provide comments.

  • Register for the Wiki. If you already have an account on Wiki or JIRA, our issue-tracking system, simply log in to your account; Wiki and JIRA use the same login credentials. CDISC Wiki is a different login from www.cdisc.org.

Public review is a key quality step in our Standards Development Process. CDISC relies on your input to ensure neutral, consensus-based data standards are developed and adopted by a diverse global community interested in improving research processes and quality for the benefit of all.

Thank you for contributing your time and expertise.

CDISC is collaborating with TransCelerate and Accenture as a part of TransCelerate’s Digital Data Flow Project to develop a Study Definition Reference Architecture that will serve as a standard model for the development of a Study Definitions Repository. The Repository is a novel central component aimed at facilitating the exchange of structured study definitions across clinical systems using technical and data standards. Additional information can be found on the TransCelerate Digital Data Flow Solutions page.

Digital Data Flow - Phase 1 (July 2021 - July 2022)

The Reference Architecture (RA) will provide solution architects with a common vocabulary, reusable designs, industry best practices, standards, and general implementation guidance. The RA is not a solution architecture and is not implemented directly but constrains concrete solution architectures with the purpose of promoting the development of conformant solution architectures that enable interoperability across multiple systems in a clinical study, improve efficiency and data quality, and reduce cycle times. 

Please refer the the Project Deliverables tab for more information on the standards developed during Phase 1.

Digital Data Flow - Phase 2 (September 2022 - July 2023)

Continuing the collaboration with TransCelerate and Accenture, CDISC will further develop the Reference Architecture in a second phase. The anticipated focus of Phase 2 will be updates the RA to allow:

  • Population of structured  data elements from the TransCelerate eCPT
  • Enablement of eCRF automation

Phase 2 will also evaluate the inputs received during the first Public Review that were deferred to a future version, along with feedback from the TransCelerate Connectathon where vendors are invited to connect to the Reference Implementation Study Definitions Repository (SDR). Please see below for additional information on the TransCelerate Connectathon.

How to Participate in Phase 2

We invite your organization to participate in Phase 2 of this exciting project. Please visit the DDF Project Participate tab to learn more.