scRNA-seq_v3 User Manual
scRNA-seq_v3 User Manual
1. Scope of application
scRNA-seq_v3 can handle the high-throughput sequencing data analysis for the following kits:
DNBelab C Series High-throughput Single-cell RNA Library Preparation Set V3.0.
2. Product Introduction
The scRNA-seq is an automated analysis process developed based on the DCS Cloud, which includes data filtering and quality control, reference genome alignment and annotation, effective cell selection, downstream analysis, and report output.
2.1 Analysis Workflow Diagram

2.1.1 Data Filtering and Quality Control
Perform fastq data quality control according to the quality standards filled in the configuration file, removing low average base quality reads, trimming reads to specified lengths, and retaining high-quality sequences for subsequent analysis.
2.1.2 Reference genome alignment and annotation
Use STAR software to align the filtered cDNA fastq files with the reference genome, perform UMI correction, and provide annotation results.
2.1.3 Effective cell selection
Use the EmptyDrops method to select effective cells from the raw matrix, and merge cells using cosine similarity to generate a standard matrix file for downstream analysis.
2.1.4 Downstream analysis
Perform cell quality control and doublet filtering based on the generated matrix, conduct dimensionality reduction and clustering, and annotate cell types.
2.1.5 Report output
Compile and summarize the analysis results into an HTML report.
3. User Manual
scRNA-seq standard analysis workflow managed through the DCS Cloud for sample input and report output. The following provides a detailed guide on how to use scRNA-seq standard analysis workflow based on the DCS Cloud.
3.1 Guide Overview
This chapter introduces how to use scRNA-seq standard analysis workflow for analysis. Before using, please read and understand the content carefully to ensure correct usage of scRNA-seq.
3.2 Usage Scenario 1: Manual Submission
The operation consists of four steps: upload data, construct reference genome (optional), sample information entry, and start analysis. After completing the sample information entry, run the task, when the client sees the task status as completed, it indicates that the task has been completed, and the report section can be viewed (see section 2.4 for details).
3.2.1 Step One: Upload data
1) Click on the navigation bar [Data] to enter the Data Management page, navigate to the target folder, and click the upper right corner [+Add files]-[Tool upload] to upload data.

- Click [Upload] browse and select the required file (Figure 3-2), after uploading, the file will be displayed in the target folder (if this is the first upload, you need to click [Install and start the transport client] to install the required tools) .

3.2.2 Step Two: Construct Reference Genome (Optional)
- Click the navigation bar [Workflow ] to enter the Process Analysis page, enter scRNA-seq-build-index in the search box, and click [Run] .

- Select Run workflow, enter Entity ID, and click [Next ].

- Enter the reference genome information, and after completion click [Next].

::: warnig
Explanation of Variables for Entering Reference Genome Information :
refName:The species name of the reference genome, which will be displayed in the analysis report;
GTF:Genome annotation GTF file;
chrM:Mitochondrial name;
FASTA:Genome FASTA file;
Outdir:Output file path;
Cpu:CPU required for running;
Mem:Memory size required for running.
:::
- Click [Run] to start the analysis.

- After the task is completed, the Status will display as completed. Copy the Task ID (Figure 3 -7 ), click on the navigation bar [Data], and enter the Task ID to search (Figure 3 -8 ). The star_index file will serve as the reference genome file for the standard analysis process of single-cell RNA (Figure 3 -9 ).



3.2.3 Step Three: Sample Information Entry
- Click on the navigation bar [Workflow] to enter the Process Analysis page. In the search box, enter sc RNA-seq _ v3, and click [Run].

- Select Run workflow, enter Entity ID, click [Next].

- Enter sample information, and after completion click [Next].

Note
Explanation of Sample Information Entry Variables :
sampleID: Sample name, defaults to be consistent with Entity ID;
cDNA: cDNA library fastq format R1, R 2 sequences, note that for each pair of fastq, select R 1 first, then select R 2;
Oligo: oligo library fastq format R1, R 2 sequences, note that for each pair of fastq, select R 1 first, then select R 2;
genomeDir: Reference genome file, such as the star_index file in the Task ID folder if constructed independently in step two ( 3.2.2 Construct Reference Genome );
Outdir:Output file path;
expectcells: Expected cell count, can be selected based on experience;
forcecells: Force selection of bead count, forcibly cut off bead count based on drop chart; If it is 0, retain all cells;
Cpu: Required CPU size for operation, the minimum required CPU value is 8, recommended value is 16;
Mem: Required memory size for operation.
3.2.4 Step Four: Start Analysis
Click [Run] to start the analysis.

3.3 Usage Scenario 2: Table Submission
The operation includes five steps: upload data, construct reference genome (optional), download sample template, fill in and import sample template, start analysis. After completing the sample template import, tasks can be run in batches, when the customer sees the task status as completed, it indicates that the task is complete, and the report section can be viewed (see section 3.4 for details).
3.3.1 Step One: Upload data
Step one is consistent with usage scenario one (manual submission) (see 3.2.1 Step One: Upload Data for details).
3.3.2 Step Two: Construct Reference Genome (Optional)
Step two is consistent with usage scenario one (manual submission) (see 3.2.2 Step Two: Construct Reference Genome for details).
3.3.3 Step Three: Sample Information Entry Table Download
- Click Navigation Bar [Data], select [Table] -[Download] (as shown in Figure 3 -14 ), click [Data model template], select scRNA-seq _ v3.2.0 template for download.

- After opening scRNA-seq_v3.2.0 sample template Excel as shown in Figure 3-15.

3.3.4 Step Four: Sample Information Import
- Under this usage scenario, the sample import table must be filled out Work Table (Figure 3-15 ). This scenario indicates that after importing the sequenced sample data into the table, it directly proceeds to analysis.
Note
Excel Notes:
[ 1 ] The file path for import must already exist on the cloud platform.
[ 2 ] In the template, all content is mandatory, and no field may be left empty.
[ 3 ] The SampleID in Excel must be unique; if the SampleID is the same, it is assumed to be the same task.
[ 4 ] Cells in Excel cannot be merged, and there should be no spaces or special characters before or after the cell content.
[5 ] Sample Entry for Analysis (Figure 3-16):
sampleID: Sample name, defaults to be consistent with Entity ID ;
cDNA: cDNA library fastq format R1, R 2 sequences, note that for each pair of fastq, select R 1 first, then select R 2;
Oligo: oligo library fastq format R1, R 2 sequences, note that for each pair of fastq, select R 1 first, then select R 2;
genomeDir: Reference genome file; if constructed independently in step two (3.3.2 Build Reference Genome), select the star_ index file within the Task ID folder.
Outdir: Output file path;
expectcells: Expected cell count, can be selected based on experience;
forcecells: Force selection of bead count, forcibly cut off bead count based on drop chart; If it is 0, retain all cells;
Cpu: Required CPU size for operation, the minimum required CPU value is 8, recommended value is 16 ;
Mem: Required memory size for operation.

- After configuring the sample template analysis sample entry work table, return to [Data] interface, click [Table] - [+Add table] (Figure 3 -17).

Click [Click to upload/Drop here] browse and select the completed sample information table, then click [confirm] (Figure 3-18),after the upload is complete, the file will be displayed in the target folder.
Click on the navigation bar [Workflow] to enter the Process Analysis page, enter scRNA-seq_v3 in the search box, and click [Run] (Figure 3 -19).

- Select Run workflow(s), click on Please select table, choose the table imported in subsection 3), select the required rows, and click [Next] (Figure 3 -20 ).

- Click on Values and select the corresponding values, such as cDNA selection of ${cDNA1} and ${cDNA2}, noting to select in order (as shown in Figure 3-21); Select Oligo ${Oligo1} and ${Oligo2}, noting to select in order (as shown in Figure 3-22).

- Enter sample information, and after completion click [Next], ensuring that the parameter settings are correct (as shown in Figure 3-23).

3.3.5 Step Five: Start Analysis
Click [Run] to start the analysis (Figure 3 -24 ).

3.4 View Report and Download Result Files
- Click on the navigation bar [Task], when the task status shows as completed, it indicates that the task is complete, and you can view the report section (as shown in Figure 3-25 ).

- Click on the navigation bar [Data], enter the Data Management page, search according to the task's Task ID, and click to enter the Task ID folder (as shown in Figure 3 -26 ).

- Click to enter the Entity ID folder (as shown in Figure 3 -27 ).

- Click to enter 04. report folder (as shown in Figure 3 -28 ).

- Select Entity ID_report.html folder, click [Download] -[Raysync download] (as shown in Figure 3 -29 ).

- Click [Transfer]-[Download]-[Confirm], select the target directory and download the report (as shown in Figure 3 -30 ).

4. FAQ
- What are the official single-cell processes and what are the corresponding versions of the kits?
scRNA-seq_v3 and scRNA-seq-3.1.5 are the officially maintained processes.

scRNA-seq-3.1.5, corresponding to the kit DNBelab C Series High-throughput Single-cell RNA Library Preparation Set V2.0.
scRNA-seq_v3 corresponds to dnbc4tools v2.1.2 of MGI, and corresponds to the kit: DNBelab C Series High-throughput Single-cell RNA Library Preparation Set V3.0.
- What are the requirements for the gtf file format after the Reference is constructed?
The chromosome name in the gtf file should be the same as the chromosome name in the genome file.