About RAPTOR 

RAPTOR (Research Assets Provisioning and Tracking Online Repository) serves as a secure cloud-based data repository and analytics platform, enabling approved researchers to securely access and analyze the SG10K_Health data for approved research project.

The RAPTOR platform is designed to be foundationally secure and embedded with all elements essential for data governance, adhering to the “5 Safes” framework: Safe Purpose, Safe People, Safe Settings, Safe Data and Safe Output.

Getting started

Accessing the SG10K_Health data through RAPTOR

The SG10K_Health data is a controlled dataset, to gain access to this resource through RAPTOR, please follow these steps:

  1. Researcher submits the SG10K_Health data access request form to the NPM Data Access Committee (NPM DAC) for review.

  2. Upon approval, your institute will be required to sign a non-negotiable Data Access Agreement (DAA).

  3. Approved researchers will need to agree to the Terms of Use for RAPTOR platform and attend a mandatory onboarding training session before using the platform.

  4. A RAPTOR account will be created by the RAPTOR admin for the approved researchers.

Accessing the SG10K_Health data through RAPTOR
Login to RAPTOR

If you already have a RAPTOR account, click on the below link to access the platform.

Cost and Billing

Researchers will be billed based on their usage, which includes a nominal fee for data platform management and administration. The costs incurred by platform users include:

  • Compute resources: Charges related to the utilization of computational resources on the platform.

  • Data storage: Costs apply to data storage beyond the SG10K_Health data provided by NPM. This includes data uploaded by users or data generated during their work on the platform.

  • Data egress: Expenses associated with the transfer of data out of the platform.

  • Platform management and administration fee: A fee to cover the administration and maintenance of the RAPTOR platform.

FAQ for RAPTOR

  • RAPTOR is a secured cloud-based data analytics platform for authorized researchers to access the NPM SG10K_Health data to conduct their analyses securely without the need to download the SG10K_Health data.

  • There is no cost involved for accessing the SG10K_Health dataset. However, researchers will incur cost for computation, data storage and data egress perform on RAPTOR. There will also be an administration fee for using the RAPTOR platform.

  • See rate card to estimate the cost of genomic data analysis on RAPTOR. The rates are subjected to change from time to time.

  • RAPTOR platform supports R, Python, Hail/SPARK and EMR/Notebook.

  • Researchers are allowed to import their own custom codes or tools into RAPTOR for their analysis.

  • Researchers can import their own data into RAPTOR, there is no restriction on the file format and file size, but Researchers will need to bear the cost incurred for importing and hosting the data on RAPTOR.

  • The de-identified SG10K_Health research phenotypes, whole genome joint-call variant files (.VCF), phased SG10K_Health imputation reference panel and DNA methylation data (.idat) are available for access by authorized researchers via RAPTOR.

  • Researchers will only be allowed to export aggregated research results from RAPTOR, which will be subjected to RAPTOR Data Concierge team review and approval.