Data Modeling Team

Helping researchers connect the dots

Mission

To empower researchers with findable, reusable and interoperable data achieved by providing a common data model, tools and services to support strategic data management in Terra.

What We Do

Terra Interoperability Model (TIM)

TIM is a data model that captures a common set of concepts and relationships for biological research intended to facilitate and encourage data sharing and reuse. Its purpose is to enable researchers to find highly connected data in a federated search space and support interoperability among datasets.

We begin with a core model focused on findability across key genomic analysis processes, diagnoses, biosample and donor characteristics. This core subset will be extended to include other analysis types, patient treatments, lab tests and other medical record information as necessary to support our research datasets. Our consulting work informs these extension.

The Core Data Model Team, a dedicated group of researchers, data stewards, data owners, and data modelers, gives us our real superpower! External collaborations and maintaining expertise with other standards are important to our success here.

Consulting

We provide data modeling consulting for researchers and project teams to define or extend a data model for new use cases, new data types, new research methods or to better integrate with other data sources. Consulting for UX designers or engineers developing search or cohort-building tools where data modeling can help guide the design is also available.

Curation & Data Mapping

To support ingest and transformation work for datasets, the Data Modeling team supports the development of mapping specifications to support ingest and transformation work for datasets, leveraging available tools.

Tools

Tools make the data model accessible to researchers and engineers and support scaling and automating curation to generate mapping specifications for datasets. Tools may be 3rd party, developed with collaborators or in-house. See the Tools Page for more details.

  • CENtree Client - UI to explore the Terra Interoperability Model (TIM) and recommended vocabularies.

  • Vocabulary Server - An API to programmatically query TIM for synonyms and subclass/superclass relationships.

  • Data Model Exporter - Exports JSON Schema version of the TIM.

  • Curation & Mapping Tools - Tools to generate machine-readable mapping specifications.

  • Metadata Workbench - Generates data dictionary or template spreadsheets based on TIM.


Who we are

The Data Modeling Team consists of data modelers, bioinformaticians, data scientists and software engineers.

Kathy

Manager, Team vision/objectives, semantic data modeling, consulting.

Semantic data modeling, consulting, curation and data mapping.

 

 

?

to be hired…

Senior Software Engineer

Tools, curation and data mapping.