About RAPTOR
RAPTOR (Research Assets Provisioning and Tracking Online Repository) serves as a secure cloud-based data repository and analytics platform, enabling approved researchers to securely access and analyze the SG10K_Health data for approved research project.
The RAPTOR platform is designed to be foundationally secure and embedded with all elements essential for data governance, adhering to the “5 Safes†framework: Safe Purpose, Safe People, Safe Settings, Safe Data and Safe Output.
Getting started
Accessing the SG10K_Health data through RAPTOR
The SG10K_Health data is a controlled dataset, to gain access to this resource through RAPTOR, please follow these steps:
-
Researcher submits the SG10K_Health data access request form to the NPM Data Access Committee (NPM DAC) for review.
-
Upon approval, your institute will be required to sign a non-negotiable Data Access Agreement (DAA).
-
Approved researchers will need to agree to the Terms of Use for RAPTOR platform and attend a mandatory onboarding training session before using the platform.
-
A RAPTOR account will be created by the RAPTOR admin for the approved researchers.

Login to RAPTOR
If you already have a RAPTOR account, click on the below link to access the platform.
Cost and Billing
Researchers will be billed based on their usage, which includes a nominal fee for data platform management and administration. The costs incurred by platform users include:
-
Compute resources: Charges related to the utilization of computational resources on the platform.
-
Data storage: Costs apply to data storage beyond the SG10K_Health data provided by NPM. This includes data uploaded by users or data generated during their work on the platform.
-
Data egress: Expenses associated with the transfer of data out of the platform.
-
Platform management and administration fee: A fee to cover the administration and maintenance of the RAPTOR platform.
FAQ for RAPTOR
What is RAPTOR?
-
RAPTOR is a secured cloud-based data analytics platform for authorized researchers to access the NPM SG10K_Health data to conduct their analyses securely without the need to download the SG10K_Health data.
What will be the cost involved in using RAPTOR?
-
There is no cost involved for accessing the SG10K_Health dataset. However, researchers will incur cost for computation, data storage and data egress perform on RAPTOR. There will also be an administration fee for using the RAPTOR platform.
-
See rate card to estimate the cost of genomic data analysis on RAPTOR. The rates are subjected to change from time to time.
What are the available tools available on RAPTOR?
-
RAPTOR platform supports R, Python, Hail/SPARK and EMR/Notebook.
Can custom codes or tools be imported into RAPTOR?
-
Researchers are allowed to import their own custom codes or tools into RAPTOR for their analysis.
Can data be imported into RAPTOR? Any restrictions on the file format and file size?
-
Researchers can import their own data into RAPTOR, there is no restriction on the file format and file size, but Researchers will need to bear the cost incurred for importing and hosting the data on RAPTOR.
What data will be available on RAPTOR?
-
The de-identified SG10K_Health research phenotypes, whole genome joint-call variant files (.VCF), phased SG10K_Health imputation reference panel and DNA methylation data (.idat) are available for access by authorized researchers via RAPTOR.
Can I egress data from RAPTOR?
-
Researchers will only be allowed to export aggregated research results from RAPTOR, which will be subjected to RAPTOR Data Concierge team review and approval.