2 Functions

This chapter documents the function-level modules in IOBRportal (stepwise, modular analysis).

2.1 Data Preparation

2.1.1 Counts to TPM

Convert raw gene-level read counts to TPM (Transcripts Per Million) with gene-length normalization and library-size scaling. Optional log2 transformation can be applied after TPM conversion.

Example input

                  TCGA-BR-6455  TCGA-BR-7196  TCGA-BR-8371  TCGA-BR-8380
ENSG00000000003      8006          2114          767           1556
ENSG00000000005      1             0             5             5
ENSG00000000419      3831          2600          1729          1760
ENSG00000000457      1126          745           1040          1260

Parameters

  • Organism (org)
    • Human (hsa)
    • Mouse (mmus)
  • ID Type (idType) β€” depends on the selected organism
    • Human: Ensembl / Entrez / Symbol
    • Mouse: Ensembl / MGI / Symbol
  • Source (source) for gene-length annotation
    • local: use built-in local annotation resources
    • biomart: retrieve annotation from Ensembl BioMart
  • Log2
    • True: apply log2 transformation after TPM conversion
    • False: keep TPM values on the original scale

Steps

  1. Upload a count matrix.
  2. Select the Organism.
  3. Select the appropriate ID Type.
  4. Choose the Source for gene-length annotation.
  5. Choose whether to apply Log2 transformation.
  6. Click Run Analysis.

Example output

           TCGA-BR-6455   TCGA-BR-7196   TCGA-BR-8371   TCGA-BR-8380
MT-CO1     14.61531657    14.55416750    15.85331284    15.61442176
MT-ND4     13.82638047    13.86929727    15.55664588    14.97415153
MT-CO3     13.60573394    13.85689319    15.53842439    15.07683124
IGKC       13.27294092    16.26343514    9.480865354    12.09469208

Download

  • Results can be exported from the Download panel.

2.1.2 Annotate ExpressionSet

Annotate an expression matrix with gene symbols using built-in annotation resources. The module maps probe or gene identifiers to gene symbols, removes invalid entries, resolves duplicated genes, and optionally applies log2 transformation.

Example input

                  Sample_1   Sample_2   Sample_3   Sample_4
ENSG00000121410      12.5       10.8       14.2       13.1
ENSG00000175899       5.6        6.2        4.9        5.4
ENSG00000256069       0.0        0.3        0.1        0.0
ENSG00000111640       8.1        7.5        9.0        8.4

Parameters

  • Annotation (annotation)
    • RNA-seq (Human) β†’ anno_grch38
    • Affymetrix (Human) β†’ anno_hug133plus2
    • Illumina (Human) β†’ anno_illumina
    • RNA-seq (Mouse) β†’ anno_gc_vm32
  • Method (method) β€” used to summarize duplicated gene symbols
    • Mean
    • Sum
    • Sd
  • Log2
    • True: apply log2 transformation after annotation
    • False: keep values on the original scale

Steps

  1. Upload an expression matrix.
  2. Select the appropriate Annotation.
  3. Choose the Method for duplicated genes.
  4. Choose whether to apply Log2 transformation.
  5. Click Run Analysis.

Example output

         Sample_1   Sample_2   Sample_3   Sample_4
MT-CO1      14.62      14.55      15.85      15.61
MT-ND4      13.83      13.87      15.56      14.97
IGKC        13.27      16.26       9.48      12.09
EPCAM       10.42      10.10      11.03      10.88

Download

  • Results can be exported from the Download panel.

2.1.3 Detect Outlier Samples

Identify outlier samples from an expression matrix and remove them from the dataset. The module displays the cleaned matrix, shows the detected outlier samples, and provides a diagnostic plot for quality assessment.

Example input

                  TCGA-BR-6455  TCGA-BR-7196  TCGA-BR-8371  TCGA-BR-8380
MT-CO1               14.62         14.55         15.85         15.61
MT-ND4               13.83         13.87         15.56         14.97
MT-CO3               13.61         13.86         15.54         15.08
IGKC                 13.27         16.26          9.48         12.09

Steps

  1. Upload an expression matrix.
  2. Click Run Analysis.
  3. View the cleaned expression matrix in the Data_clean tab.
  4. View the diagnostic plot in the Plot tab.
  5. Check the detected outlier samples in the Outliers tab.

Example output

                  TCGA-BR-6455  TCGA-BR-7196  TCGA-BR-8380
MT-CO1               14.62         14.55         15.61
MT-ND4               13.83         13.87         14.97
MT-CO3               13.61         13.86         15.08
IGKC                 13.27         16.26         12.09

Outliers

  • Example: TCGA-BR-8371

Download

  • Cleaned data can be exported from the Download panel.
  • The diagnostic plot can be exported from the Plot tab.

2.1.4 Remove Duplicate Genes

Aggregate duplicated gene symbols in an expression matrix using the selected method. The module keeps one value per gene symbol and summarizes duplicate rows by Mean, SD, or Sum.

Example input

                  id        TCGA-BR-6455  TCGA-BR-7196  TCGA-BR-8371  TCGA-BR-8380
1                 IGKC          13.27         16.26          9.48         12.09
2                 EPCAM         10.42         10.10         11.03         10.88
3                 IGKC          12.95         15.80          9.12         11.74
4                 KRT18         11.63         11.20         12.01         11.56

Parameters

  • Symbol Column (column_of_symbol)
    • Enter the column name containing gene symbols
    • Example: id
  • Method (method)
    • Mean
    • SD
    • Sum

Steps

  1. Upload an expression matrix containing one column of gene symbols.
  2. Enter the gene symbol column name in Symbol Column.
  3. Select the aggregation Method.
  4. Click Run Analysis.

Example output

                  TCGA-BR-6455  TCGA-BR-7196  TCGA-BR-8371  TCGA-BR-8380
IGKC                 13.11         16.03          9.30         11.92
EPCAM                10.42         10.10         11.03         10.88
KRT18                11.63         11.20         12.01         11.56

Download

  • Results can be exported from the Download panel.

2.1.5 Correct Batch Effect

Integrate multiple expression datasets and correct batch effects for downstream analysis. The module accepts two required datasets and one optional dataset, performs batch correction, returns the adjusted expression matrix, and displays a PCA plot for quality assessment.

Example input

Dataset 1

                  Sample_1   Sample_2   Sample_3   Sample_4
ENSG00000000003      8006       2114        767       1556
ENSG00000000005         1          0          5          5
ENSG00000000419      3831       2600       1729       1760
ENSG00000000457      1126        745       1040       1260

Dataset 2

                  Sample_5   Sample_6   Sample_7   Sample_8
ENSG00000000003      7560       1980        845       1498
ENSG00000000005         0          2          3          4
ENSG00000000419      4012       2505       1810       1695
ENSG00000000457      1203        802        998       1184

Dataset 3 (optional)

                  Sample_9   Sample_10  Sample_11  Sample_12
ENSG00000000003      7902       2056        801       1520
ENSG00000000005         1          1          4          6
ENSG00000000419      3925       2588       1765       1712
ENSG00000000457      1154        768       1024       1211

Parameters

  • ID Type (id_type) β€” select the gene identifier format used in the uploaded datasets
    • Ensembl: Ensembl gene IDs
    • Symbol: gene symbols
  • Data Type (data_type) β€” select the expression data format of the uploaded datasets
    • Array: microarray expression data
    • Count: raw count matrix
    • Tpm: TPM-normalized expression matrix

Steps

  1. Upload Dataset 1.
  2. Upload Dataset 2.
  3. Optionally upload Dataset 3.
  4. Select the appropriate ID Type.
  5. Select the appropriate Data Type.
  6. Click Run Analysis.
  7. View the adjusted expression matrix in the Data tab.
  8. View the PCA plot in the Plot tab.

Example output

                  Sample_1   Sample_2   Sample_3   Sample_4   Sample_5   Sample_6
ENSG00000000003    13.42      12.11      10.67      11.52      13.30      12.03
ENSG00000000005     0.25       0.00       0.58       0.61       0.00       0.32
ENSG00000000419    12.68      12.11      11.54      11.60      12.75      12.02
ENSG00000000457    11.21      10.63      11.08      11.34      11.30      10.78

Download

  • Batch-corrected data can be exported from the Download panel.
  • The PCA plot can be exported from the Plot tab.

2.1.6 Mouse-to-Human Genes

Convert mouse gene symbols in an expression matrix to human homolog gene symbols. The module supports matrix-formatted input or data frames with a dedicated gene symbol column.

Example input

                  Sample_1   Sample_2   Sample_3   Sample_4
Cd3d                 8.21       7.95       8.44       8.10
Epcam                5.42       5.18       5.63       5.37
Col1a1               9.87      10.12       9.65       9.94
Mki67                6.30       6.05       6.88       6.41

Parameters

  • Source (source) β€” the source used for mouse-to-human homolog mapping
    • Ensembl: retrieve homolog information from Ensembl
    • Local: use built-in local annotation resources
  • Matrix (is_matrix) β€” whether the uploaded data is a pure expression matrix
    • True: row names are treated as gene symbols
    • False: gene symbols are stored in a separate column
  • Symbol Column (column_of_symbol) β€” the column containing mouse gene symbols when Matrix is False
    • Example: symbol

Steps

  1. Upload a mouse expression dataset.
  2. Select the homolog mapping Source.
  3. Choose whether the input is a pure Matrix.
  4. If Matrix is False, enter the gene symbol column name in Symbol Column.
  5. Click Run Analysis.

Example output

                  Sample_1   Sample_2   Sample_3   Sample_4
CD3D                 8.21       7.95       8.44       8.10
EPCAM                5.42       5.18       5.63       5.37
COL1A1               9.87      10.12       9.65       9.94
MKI67                6.30       6.05       6.88       6.41

Download

  • Results can be exported from the Download panel.

2.2 SigScore Calculation

2.2.1 Calculate SigScores

Compute signature scores from an expression matrix using predefined gene signature collections. The module supports four scoring methods, including PCA, ssGSEA, z-score, and integration, and returns a sample-level score matrix for downstream analysis.

This module is designed for transcriptome-based functional characterization. It can be used to quantify immune, metabolic, tumor-related, and pathway-associated signatures across samples. The output is typically a matrix in which each row represents one sample and each column represents one signature.

Example input

                  TCGA-2F-A9KO  TCGA-2F-A9KP  TCGA-2F-A9KQ  TCGA-2F-A9KR
CD8A                  8.421         7.952         6.884         8.103
GZMB                  7.116         6.208         5.774         6.945
EPCAM                 5.238         4.992         5.641         5.310
MKI67                 6.803         7.126         6.542         6.918

Parameters

  • Method (method) β€” the algorithm used to calculate signature scores
    • PCA: calculates signature scores based on principal component analysis
    • ssGSEA: calculates enrichment scores for each sample and signature
    • Z-score: calculates standardized scores based on the expression pattern of signature genes
    • Integration: integrates multiple scoring strategies into a combined score
  • Signature (signature) β€” the signature collection used for scoring
    • TME: tumor microenvironment-related signatures
    • Metabolism: metabolism-related signatures
    • Tumor: tumor biology-related signatures
    • Collection: integrated signature collection provided in IOBR
    • Go_bp: Gene Ontology biological process signatures
    • Go_cc: Gene Ontology cellular component signatures
    • Go_mf: Gene Ontology molecular function signatures
    • KEGG: KEGG pathway signatures
    • Hallmark: MSigDB hallmark gene sets
    • Reactome: Reactome pathway signatures
  • Mini gene count (mini_gene_count) β€” the minimum number of matched genes required to calculate a valid score for a signature
    • Example: 3
    • Signatures with fewer matched genes than this threshold may be skipped or filtered
  • Adjust eset (adjust_eset) β€” whether to internally adjust the expression matrix before score calculation
    • True: apply internal adjustment before scoring
    • False: use the input matrix directly

Steps

  1. Upload an expression matrix.
  2. Select the signature scoring Method.
  3. Select the desired Signature collection.
  4. Set the Mini gene count threshold.
  5. Choose whether to Adjust eset.
  6. Click Run Analysis.
  7. View the score matrix in the Data tab.

Example output

                   ID          CD_8_T_effector       DDR           APM    Immune_Checkpoint  CellCycle_Reg
1         TCGA-2F-A9KO         4.7093          -4.3653       3.1724          4.5259           -1.3468
2         TCGA-2F-A9KP        -1.6480           5.0614      -1.3928         -1.4447            3.2313
3         TCGA-2F-A9KQ        -2.1915         -11.1568      -1.8568         -1.7691            0.6771
4         TCGA-2F-A9KR         0.0528           3.2845       1.6877         -0.2206           -1.3867
5         TCGA-2F-A9KT        -0.9226           7.1762      -1.6106         -1.0915           -1.1749

Output interpretation

  • Each row represents one sample.
  • Each column represents one signature score.
  • Positive values generally indicate relatively higher activity or enrichment of that signature in the sample.
  • Negative values generally indicate relatively lower activity or enrichment relative to other samples or the scoring framework.
  • The exact scale and interpretation may vary depending on the selected Method.

Notes

  • The input expression matrix should contain genes as rows and samples as columns.
  • Gene identifiers should be compatible with the selected signature resource and the preprocessing workflow used upstream.
  • Different methods may produce score matrices with different value distributions, so results from different methods should not be mixed without caution.
  • A larger Mini gene count threshold is usually more stringent, but may reduce the number of retained signatures.
  • This module is often used after preprocessing steps such as annotation, duplicate-gene handling, and expression normalization.

Download

  • Results can be exported from the Download panel.

2.3 TME Deconvolution

2.3.1 Deconvolute TME

Perform tumor microenvironment deconvolution from bulk transcriptome data using multiple supported algorithms. The module estimates immune or stromal cell fractions, enrichment scores, or immune-related indices depending on the selected method.

This module supports several commonly used deconvolution methods, including CIBERSORT, EPIC, quanTIseq, xCell, ESTIMATE, TIMER, MCPcounter, IPS, and an Integration mode that combines multiple methods into one result table.

Example input

                  TCGA-BR-6455  TCGA-BR-7196  TCGA-BR-8371  TCGA-BR-8380
CD8A                  8.421         7.952         6.884         8.103
EPCAM                 5.238         4.992         5.641         5.310
COL1A1                9.114         8.773         9.562         9.028
GZMB                  7.116         6.208         5.774         6.945

Parameters

  • Method (method) β€” the deconvolution algorithm used for analysis
    • CIBERSORT: estimates immune cell fractions using the LM22 reference signature
    • EPIC: estimates immune and stromal cell fractions, commonly used for tumor RNA-seq data
    • quanTIseq: estimates immune cell fractions from bulk expression data
    • xCell: calculates enrichment scores for multiple immune and stromal cell types
    • ESTIMATE: calculates stromal score, immune score, and ESTIMATE score
    • TIMER: estimates major immune cell infiltration levels and requires a cancer type
    • MCPcounter: quantifies immune and stromal cell populations using marker genes
    • IPS: calculates immunophenoscore-related features
    • Integration: runs multiple deconvolution methods and merges valid results by sample ID
  • CIBERSORT parameters
    • Array (arrays) β€” whether the input comes from microarray data
      • True: use microarray-optimized mode
      • False: use RNA-seq mode
    • Perm (perm) β€” number of permutations used in CIBERSORT
      • Example: 100
      • Larger values are slower but generally more stable
    • Absolute (absolute.mode) β€” whether to run CIBERSORT in absolute mode
      • True
      • False
    • Absolute Method (abs.method) β€” method used when absolute mode is enabled
      • Sigscore
      • No Sum-to-1
    • Parallel (parallel) β€” whether to enable parallel computation
      • True
      • False
  • EPIC parameters
    • Tumor (tumor) β€” whether the samples are tumor samples
      • True
      • False
    • Scale (scale_mrna) β€” whether to apply mRNA-content scaling when supported
      • True
      • False
  • quanTIseq parameters
    • Array (arrays)
      • True
      • False
    • Tumor (tumor)
      • True
      • False
    • Scale (scale_mrna)
      • True
      • False
  • xCell parameters
    • Array (arrays) β€” whether the input is microarray data
      • True
      • False
  • ESTIMATE parameters
    • Platform (platform) β€” platform type used by ESTIMATE
      • affymetrix
      • agilent
      • illumina
  • TIMER parameters
    • Cancer Type (group_list) β€” tumor type required by TIMER
      • Example: BRCA, STAD, LUAD, LIHC
  • Integration parameters
    • Array (array) β€” whether the input is microarray data
      • True
      • False
    • Permutations (permutation) β€” number of permutations used for CIBERSORT in the integration workflow
    • Cancer Type (for TIMER) (tumor_type) β€” tumor type used by TIMER inside the integration workflow

Steps

  1. Upload a gene expression matrix.
  2. Select the deconvolution Method.
  3. Adjust the method-specific parameters displayed in the left panel.
  4. Click Run Analysis.
  5. View the deconvolution result table in the Data tab.
  6. Export the results from the Download panel.

Example output

           ID         Bcells_EPIC  CAFs_EPIC  CD4_Tcells_EPIC  CD8_Tcells_EPIC
1     TCGA-BR-6455      0.0151       0.0848          0.0451          9.65e-7
2     TCGA-BR-7196      0.0642       0.2541          0.0415          1.26e-7
3     TCGA-BR-8371      0.1562       0.0457          0.0705          0.0078
4     TCGA-BR-8380      0.0202       0.2044          0.0613          0.0141
5     TCGA-BR-8592      0.0575       0.2250          0.0792          0.0065

Output interpretation

  • The first column is the sample identifier.
  • The remaining columns are estimated cell fractions, enrichment scores, or immune-related scores, depending on the selected method.
  • Output column names are usually suffixed by the method name, such as _EPIC, _xCell, _MCPcounter, _TIMER, _quantiseq, _CIBERSORT, or _estimate.
  • Different methods produce different types of values, so results from different methods should be interpreted within the context of that method.

Notes

  • The input matrix should contain genes as rows and samples as columns.
  • Gene symbols are recommended for deconvolution. Ensembl IDs may cause method failure, especially for methods that expect HGNC symbols.
  • Input values should usually be non-log expression values suitable for deconvolution workflows.
  • CIBERSORT may take substantially longer than other methods, especially with larger permutation counts.
  • TIMER requires a valid cancer type selection.
  • Integration runs multiple methods and may take much longer than a single-method analysis.
  • Some methods estimate relative cell abundance, whereas others produce enrichment or scoring values rather than direct fractions.

Download

  • Results can be exported from the Download panel.

2.4 Statistical Analysis

2.4.1 Batch Correlation

Calculate correlations between one target variable and multiple selected features in a batch manner. The module supports Pearson or Spearman correlation and returns a summary table for downstream interpretation.

Example input

                  ID         CD_8_T_effector   DDR      APM    Immune_Checkpoint   CellCycle_Reg
1         TCGA-2F-A9KO         4.7093        -4.3653   3.1724         4.5259           -1.3468
2         TCGA-2F-A9KP        -1.6480         5.0614  -1.3928        -1.4447            3.2313
3         TCGA-2F-A9KQ        -2.1915       -11.1568  -1.8568        -1.7691            0.6771
4         TCGA-2F-A9KR         0.0528         3.2845   1.6877        -0.2206           -1.3867
5         TCGA-2F-A9KT        -0.9226         7.1762  -1.6106        -1.0915           -1.1749

Parameters

  • Target (target) β€” the main variable used as the correlation reference
  • Features (feature) β€” one or more variables to be correlated with the target
  • Method (method) β€” correlation method used for analysis
    • Spearman: rank-based correlation, more robust to non-normal data
    • Pearson: linear correlation for continuous numeric data

Steps

  1. Upload a data matrix with samples in rows and variables in columns.
  2. Select one Target variable.
  3. Select one or more Features.
  4. Optionally click Select All to use all available features except the target.
  5. Choose the correlation Method.
  6. Click Run Analysis.
  7. View the result table in the Data tab.

Example output

Example output

                  sig_names                       p.value     statistic      p.adj    log10pvalue   stars
1                 IFNG_signature_Ayers_et_al      2.56e-21      0.9050    4.54e-19       20.5912    ****
2                 TMEscoreA_CIR                   1.25e-18      0.8784    1.10e-16       17.9045    ****
3                 TIP_Killing_of_cancer_cells_1   5.92e-18      0.8706    3.49e-16       17.2277    ****
4                 CD8_Rooney_et_al                1.39e-17      0.8660    6.16e-16       16.8563    ****

Download

  • Results can be exported from the Download panel.

2.4.2 Batch Partial Correlation

Compute partial correlations between one target variable and multiple selected features while adjusting for a control variable. The module performs batch analysis and returns a summary table of partial correlation results.

Example input

                  ID           CD_8_T_effector   DDR      APM    Immune_Checkpoint   CellCycle_Reg
1         TCGA-2F-A9KO         4.7093        -4.3653   3.1724         4.5259           -1.3468
2         TCGA-2F-A9KP        -1.6480         5.0614  -1.3928        -1.4447            3.2313
3         TCGA-2F-A9KQ        -2.1915       -11.1568  -1.8568        -1.7691            0.6771
4         TCGA-2F-A9KR         0.0528         3.2845   1.6877        -0.2206           -1.3867
5         TCGA-2F-A9KT        -0.9226         7.1762  -1.6106        -1.0915           -1.1749

Parameters

  • Control Variable (interferenceid) β€” the variable used for adjustment in partial correlation analysis
  • Target Variable (target) β€” the main variable to be correlated with all selected features
  • Features (features) β€” one or more variables tested against the target variable
  • Method (method) β€” correlation method used for partial correlation
    • Pearson
    • Spearman
    • Kendall

Steps

  1. Upload a data matrix with samples in rows and variables in columns.
  2. Select one Control Variable.
  3. Select one Target Variable.
  4. Select one or more Features.
  5. Choose the correlation Method.
  6. Click Run Analysis.
  7. View the result table in the Data tab.

Example output

                  sig_names                      p.value     statistic      p.adj    log10pvalue  stars
1                 CD8_c11_Teff_SEMA4A             9.13e-10     -0.7190    9.51e-8        9.0396    ****
2                 Cytokine_Receptors_Li_et_al     1.07e-9      -0.7170    9.51e-8        8.9710    ****
3                 IL_iCAF                         9.56e-9      -0.6872    5.67e-7        8.0195    ****
4                 T_cell_exhaustion_Peng_et_al    1.37e-8      -0.6819    6.12e-7        7.8618    ****

Download

  • Results can be exported from the Download panel.

2.4.3 Batch Survival

Run batch Cox proportional hazards survival analysis for multiple variables in one dataset. The module evaluates each selected feature against survival outcome and returns hazard ratio statistics, confidence intervals, and P values.

This module is designed for integrated datasets that contain both clinical survival information and quantitative variables, such as signature scores, immune infiltration estimates, pathway scores, or other numeric biomarkers.

Example input

          ID           OS_time   OS_status   B_cells_naive   T_cells_CD8   StromalScore   ImmuneScore   ESTIMATEScore
1         TCGA-3M-AB46   58.83     0           0.0264          0.0290        -0.6766        -1.3235       -1.0774
2         TCGA-3M-AB47             1           0.1680          0.0617         0.9613         0.0518        0.5563
3         TCGA-B7-5818   11.87     0           0.0380          0.1248        -0.2425         0.8001        0.2930
4         TCGA-B7-A5TI   19.83     0           0.1163          0.0653         0.4017        -0.4095        0.0027
5         TCGA-BR-4187    4.70     1           0.0641          0.0515         2.2367         1.2632        1.9032

Parameters

  • Variables (variable) β€” one or more quantitative variables tested in Cox survival analysis
    • These usually include signature scores, deconvolution results, pathway scores, or other numeric biomarkers
  • Status (status) β€” the event indicator column used in survival analysis
    • Example: OS_status
    • Typical coding is 0 for censored and 1 for event
  • Time (time) β€” the survival time column
    • Example: OS_time, RFS_time, or PFS_time
  • best_cutoff (best_cutoff) β€” whether to determine an optimal cutoff for grouping before survival analysis
    • True
    • False

Steps

  1. Upload a dataset containing both survival information and numeric variables.
  2. Select one or more Variables for batch testing.
  3. Select the survival Status column.
  4. Select the survival Time column.
  5. Choose whether to enable best_cutoff.
  6. Click Run Analysis.
  7. View the Cox regression result table in the Data tab.

Example output

            ID                 P         HR      CI_low_0.95    CI_up_0.95
1         Neu_05_Peri        0.067993   0.2718      0.0671        1.1010
2         Neu_06_Per         0.081241   0.3054      0.0805        1.1586
3         Mono_like          0.083310   0.2961      0.0747        1.1739
4         Neu_07_APO         0.091479   0.3449      0.1002        1.1874
5         Chemokine          0.092081   0.2723      0.0600        1.2370

Output interpretation

  • ID indicates the tested variable name.
  • P is the Cox regression P value.
  • HR is the hazard ratio.
  • CI_low_0.95 and CI_up_0.95 are the lower and upper bounds of the 95% confidence interval.
  • HR > 1 suggests higher risk associated with higher variable values.
  • HR < 1 suggests a protective association.

Download

  • Results can be exported from the Download panel.

2.4.4 Batch Wilcoxon

Run batch Wilcoxon rank-sum tests for multiple numeric features between two groups. The module compares feature distributions across a selected binary grouping variable and returns summary statistics for each tested feature.

Example input

                  ID     cluster    CD_8_T_effector      DDR       APM    Immune_Checkpoint   CellCycle_Reg
1         TCGA-2F-A9KO    TME1            4.7093       -4.3653    3.1724         4.5259           -1.3468
2         TCGA-2F-A9KP    TME2           -1.6480        5.0614   -1.3928        -1.4447            3.2313
3         TCGA-2F-A9KQ    TME2           -2.1915      -11.1568   -1.8568        -1.7691            0.6771
4         TCGA-2F-A9KR    TME3            0.0528        3.2845    1.6877        -0.2206           -1.3867
5         TCGA-2F-A9KT    TME1           -0.9226        7.1762   -1.6106        -1.0915           -1.1749

Parameters

  • Group(=2) (target) β€” the grouping variable with exactly two categories
    • Example: treatment group, subtype, sex, or response group
  • Features (feature) β€” one or more numeric variables tested between the two groups
    • These usually include signature scores, deconvolution results, pathway scores, or other quantitative features
  • Feature Manipulation (feature_manipulation) β€” whether to perform internal feature processing before statistical testing
    • True
    • False

Steps

  1. Upload a dataset containing one two-group variable and multiple numeric features.
  2. Select the Group(=2) column.
  3. Select one or more Features.
  4. Choose whether to enable Feature Manipulation.
  5. Click Run Analysis.
  6. View the result table in the Data tab.

Example output

                sig_names      p.value      TME1       TME2       TME3      mean       p.adj      log10pvalue   stars
1               CD8_c12_Trm      2.05e-9   -3.3378   -18.1813    21.5191    4.0193     3.51e-8       8.6879      ****
2                CD4_c3_Tfh      2.05e-9    0.1222    -7.4525     7.3304    0.5722     3.51e-8       8.6879      ****
3               DC_03_LAMP3      2.05e-9    0.0961    -6.2516     6.1555    0.4846     3.51e-8       8.6879      ****
4               Neu_05_ELL2      2.19e-9    0.5149    -4.8792     4.3643    0.0608     3.51e-8       8.6603      ****

Download

  • Results can be exported from the Download panel.

2.4.5 Batch Kruskal-Wallis

Run batch Kruskal–Wallis tests for multiple numeric features across three or more groups. The module compares feature distributions among selected categories and returns summary statistics for each tested feature.

Example input

              ID         cluster    CD_8_T_effector      DDR       APM    Immune_Checkpoint   CellCycle_Reg
1         TCGA-2F-A9KO    TME1            4.7093       -4.3653    3.1724         4.5259           -1.3468
2         TCGA-2F-A9KP    TME2           -1.6480        5.0614   -1.3928        -1.4447            3.2313
3         TCGA-2F-A9KQ    TME2           -2.1915      -11.1568   -1.8568        -1.7691            0.6771
4         TCGA-2F-A9KR    TME3            0.0528        3.2845    1.6877        -0.2206           -1.3867
5         TCGA-2F-A9KT    TME1           -0.9226        7.1762   -1.6106        -1.0915           -1.1749

Parameters

  • Group(>2) (group) β€” the grouping variable with three or more categories
    • Example: molecular subtype, stage group, or histological class
  • Features (feature) β€” one or more numeric variables tested across the groups
    • These usually include signature scores, deconvolution results, pathway scores, or other quantitative features
  • Feature Manipulation (feature_manipulation) β€” whether to perform internal feature processing before statistical testing
    • True
    • False

Steps

  1. Upload a dataset containing one multi-group variable and multiple numeric features.
  2. Select the Group(>2) column.
  3. Select one or more Features.
  4. Choose whether to enable Feature Manipulation.
  5. Click Run Analysis.
  6. View the result table in the Data tab.

Example output

                sig_names      p.value      TME1       TME2       TME3      mean       p.adj      log10pvalue   stars
1               CD8_c12_Trm      2.05e-9   -3.3378   -18.1813    21.5191    4.0193     3.51e-8       8.6879      ****
2                CD4_c3_Tfh      2.05e-9    0.1222    -7.4525     7.3304    0.5722     3.51e-8       8.6879      ****
3               DC_03_LAMP3      2.05e-9    0.0961    -6.2516     6.1555    0.4846     3.51e-8       8.6879      ****
4               Neu_05_ELL2      2.19e-9    0.5149    -4.8792     4.3643    0.0608     3.51e-8       8.6603      ****

Download

  • Results can be exported from the Download panel.

2.5 TME Interaction

2.5.1 TME Clustering

Perform tumor microenvironment clustering analysis based on selected numeric features. The module groups samples into clusters across a specified range of cluster numbers, returns the clustering result table, and optionally displays a heatmap grouped by cluster.

Example input

               ID            CD_8_T_effector   DDR      APM    Immune_Checkpoint   CellCycle_Reg
1         TCGA-2F-A9KO         4.7093        -4.3653   3.1724         4.5259           -1.3468
2         TCGA-2F-A9KP        -1.6480         5.0614  -1.3928        -1.4447            3.2313
3         TCGA-2F-A9KQ        -2.1915       -11.1568  -1.8568        -1.7691            0.6771
4         TCGA-2F-A9KR         0.0528         3.2845   1.6877        -0.2206           -1.3867
5         TCGA-2F-A9KT        -0.9226         7.1762  -1.6106        -1.0915           -1.1749

Parameters

  • Features (features) β€” one or more numeric variables used for clustering
    • These usually include TME-related scores, immune infiltration estimates, or other quantitative signatures
  • Min Clusters (min_nc) β€” minimum number of clusters evaluated during clustering
    • Example: 2
  • Max Clusters (max.nc) β€” maximum number of clusters evaluated during clustering
    • Example: 6

Steps

  1. Upload a dataset containing sample IDs and numeric TME-related features.
  2. Select one or more Features for clustering.
  3. Set the Min Clusters value.
  4. Set the Max Clusters value.
  5. Click Run Analysis.
  6. View the clustering heatmap in the Plot tab.
  7. View the clustered data in the Data tab.
  8. View the cluster size summary in the Cluster tab.

Example output

              ID         cluster    CD_8_T_effector      DDR       APM    Immune_Checkpoint   CellCycle_Reg
1         TCGA-2F-A9KO    TME1            4.7093       -4.3653    3.1724         4.5259           -1.3468
2         TCGA-2F-A9KP    TME2           -1.6480        5.0614   -1.3928        -1.4447            3.2313
3         TCGA-2F-A9KQ    TME2           -2.1915      -11.1568   -1.8568        -1.7691            0.6771
4         TCGA-2F-A9KR    TME3            0.0528        3.2845    1.6877        -0.2206           -1.3867
5         TCGA-2F-A9KT    TME1           -0.9226        7.1762   -1.6106        -1.0915           -1.1749

Download

  • The result plot can be exported from the Download panel.
  • An initial plot size is provided, which can be adjusted if needed.
  • If needed, you can adjust the plot width and height before downloading to obtain a more suitable layout.

2.5.2 Ligand–Receptor Interaction

Quantify ligand-receptor interactions in the tumor microenvironment from bulk transcriptome data. The module calculates ligand-receptor interaction scores and returns a result table for downstream analysis.

Example input

                  TCGA-3M-AB46  TCGA-3M-AB47  TCGA-B7-5818  TCGA-B7-A5TI
ENSG00000112715      120.0         98.0          135.0         142.0
ENSG00000153563       45.0         52.0           39.0          41.0
ENSG00000169429      310.0        288.0          356.0         330.0
ENSG00000125347       22.0         18.0           31.0          27.0

Parameters

  • Data Type (data_type) β€” the expression data format used for ligand-receptor analysis
    • Count
    • TPM
  • ID Type (id_type) β€” the gene identifier format used in the input matrix
    • Ensembl
    • Symbol

Steps

  1. Upload an expression matrix.
  2. Select the Data Type.
  3. Select the ID Type.
  4. Click Run Analysis.
  5. View the ligand-receptor interaction result table in the Data tab.

Example output

              ID           ADM_CALCRL   ADM_MRGPRX2   ADM_RAMP2   ADM2_CALCRL   ADM2_RAMP1
1         TCGA-BR-6455      3.0609        0.0000       3.0609       2.6353        2.6353
2         TCGA-BR-7196      3.8971        0.0000       3.8971       1.2811        1.2811
3         TCGA-BR-8371      2.4770        0.0000       3.2104       1.8195        1.8195
4         TCGA-BR-8380      3.7969        0.0000       3.7969       2.1996        2.1996
5         TCGA-BR-8592      3.8942        0.1866       3.9470       3.1062        3.1062

2.6 Visualization

2.6.1 Heatmap

Draw a customizable heatmap for selected features across samples and group the samples by a chosen annotation column. The module is suitable for visualizing signature scores, immune-related variables, pathway scores, or other numeric features across categorical sample groups.

Example input

              ID         cluster    CD_8_T_effector      DDR       APM    Immune_Checkpoint   CellCycle_Reg
1         TCGA-2F-A9KO    TME1            4.7093       -4.3653    3.1724         4.5259           -1.3468
2         TCGA-2F-A9KP    TME2           -1.6480        5.0614   -1.3928        -1.4447            3.2313
3         TCGA-2F-A9KQ    TME2           -2.1915      -11.1568   -1.8568        -1.7691            0.6771
4         TCGA-2F-A9KR    TME3            0.0528        3.2845    1.6877        -0.2206           -1.3867
5         TCGA-2F-A9KT    TME1           -0.9226        7.1762   -1.6106        -1.0915           -1.1749

Parameters

  • Groups (group) β€” the categorical annotation column used to group samples in the heatmap
    • Example: subtype, treatment group, response group, or stage
  • Features (features) β€” one or more numeric variables displayed in the heatmap
    • These usually include signature scores, deconvolution results, pathway scores, or other quantitative features
  • Auto-select (surv_top_n) β€” automatically select the top N features when this module is linked to an upstream ranking result
    • Example: 20
  • Scale (scale) β€” whether to scale feature values before plotting
    • True
    • False
  • Palette (palette) β€” preset palette index for the heatmap
    • 1
    • 2
    • 3
    • 4
  • Palette Group (palette_group) β€” preset color palette for sample group annotations
    • nrc
    • jama
    • aaas
    • jco
    • paired1
    • paired2
    • paired3
    • paired4
    • accent
    • set2
  • Colors Group (cols_group) β€” custom colors for sample group annotations
    • Enter color names or hex codes separated by commas
    • If provided, this setting overrides Palette Group
  • Colors Heatmap (cols_heatmap) β€” custom colors for heatmap low-mid-high values
    • Provide at least 3 colors separated by commas
  • Row Font Size (size_row) β€” font size of feature labels in the heatmap
    • Example: 8

Steps

  1. Upload a dataset containing one grouping column and multiple numeric features.
  2. Select the Groups column.
  3. Select one or more Features.
  4. Optionally set Auto-select if the module is connected to an upstream ranked result.
  5. Choose whether to apply Scale.
  6. Select a preset Palette and Palette Group, or provide custom colors in Colors Group and Colors Heatmap.
  7. Adjust Row Font Size if needed.
  8. Click Run Analysis.
  9. View the heatmap in the Plot tab.

Example output

Download

  • The heatmap can be exported from the Download panel.
  • An initial plot size is provided, which can be adjusted if needed.
  • If needed, you can adjust the plot width and height before downloading to obtain a more suitable layout.

2.6.2 Box Plot

Create a customizable boxplot for one selected numeric signature or variable across sample groups. The module supports optional scaling, custom colors, jittered points, and P-value display for group comparisons.

Example input

              ID         cluster    CD_8_T_effector      DDR       APM    Immune_Checkpoint   CellCycle_Reg
1         TCGA-2F-A9KO    TME1            4.7093       -4.3653    3.1724         4.5259           -1.3468
2         TCGA-2F-A9KP    TME2           -1.6480        5.0614   -1.3928        -1.4447            3.2313
3         TCGA-2F-A9KQ    TME2           -2.1915      -11.1568   -1.8568        -1.7691            0.6771
4         TCGA-2F-A9KR    TME3            0.0528        3.2845    1.6877        -0.2206           -1.3867
5         TCGA-2F-A9KT    TME1           -0.9226        7.1762   -1.6106        -1.0915           -1.1749

Parameters

  • Groups (variable) β€” the categorical grouping column used on the x-axis
    • Example: subtype, response group, treatment group, or stage
  • Signature (signature) β€” the numeric variable displayed on the y-axis
    • Example: signature score, immune infiltration estimate, or pathway score
  • Scale (scale) β€” whether to scale the selected signature before plotting
    • True
    • False
  • Palette (palette) β€” preset color palette for group display
    • nrc
    • jama
    • aaas
    • jco
    • paired1
    • paired2
    • paired3
    • paired4
    • accent
    • set2
  • Colors (cols) β€” custom group colors
    • Enter color names or hex codes separated by commas
    • If provided, this setting overrides Palette
  • Show Jitter (jitter) β€” whether to display sample points on top of boxplots
    • True
    • False
  • Show Pairwise P-value (show_pairwise_p) β€” whether to display pairwise comparison P values
    • True
    • False
  • Show Overall P-value (show_overall_p) β€” whether to display the overall group comparison P value
    • True
    • False
  • P-value Size (size_of_pvalue) β€” font size of displayed P values
    • Example: 6
  • Point Size (point_size) β€” size of jittered sample points
    • Example: 5
  • Font Size (size_of_font) β€” font size of plot text
    • Example: 10
  • X-axis Text Angle (angle_x_text) β€” rotation angle of x-axis labels
    • Example: 0
  • X-axis Text Justification (hjust) β€” horizontal justification of rotated x-axis labels
    • Example: 0.5

Steps

  1. Upload a dataset containing one grouping column and one or more numeric variables.
  2. Select the Groups column.
  3. Select the Signature column.
  4. Choose whether to apply Scale.
  5. Select a preset Palette or provide custom colors in Colors.
  6. Choose whether to show jitter points and P values.
  7. Adjust point size, font size, and x-axis label settings if needed.
  8. Click Run Analysis.
  9. View the boxplot in the Plot tab.

Example output

Download

  • The heatmap can be exported from the Download panel.
  • An initial plot size is provided, which can be adjusted if needed.
  • If needed, you can adjust the plot width and height before downloading to obtain a more suitable layout.

2.6.3 Percentage Bar Plot

Plot a percentage bar chart for two categorical variables. The module summarizes the distribution of one category within another, supports custom colors, optional frequency labels, summary annotations, and coordinate flipping, and is suitable for comparing proportions across groups.

Example input

          ID           Subtype   Lauren       MSI   EBV   Hpylori
1         TCGA-3M-AB46   GS        Mixed        0     0     0
2         TCGA-3M-AB47   CIN       Mixed        0     0     0
3         TCGA-B7-5818   EBV       Diffuse      0     1     0
4         TCGA-B7-A5TI   MSI       Diffuse      1     0     0
5         TCGA-BR-4187   GS        Mixed        0     0     0

Parameters

  • X Variable (x) β€” the categorical variable displayed on the x-axis

    • Example: subtype, stage, response group, or molecular class
  • Y Variable (y) β€” the categorical variable used for stacked proportion display

    • Example: Lauren type, MSI status, EBV status, or sex
  • Palette (palette) β€” preset color palette for category display

    • nrc
    • jama
    • aaas
    • jco
    • paired1
    • paired2
    • paired3
    • paired4
    • accent
    • set2
  • Colors (color) β€” custom colors for categories

    • Enter color names or hex codes separated by commas
    • If provided, this setting overrides Palette
  • Title (title) β€” optional plot title

  • Axis Angle (axis_angle) β€” rotation angle of x-axis labels

    • Example: 0
  • Coord Flip (coord_flip) β€” whether to flip the plot coordinates

    • True
    • False
  • Add Frequency (add_Freq) β€” whether to display percentage or frequency labels on the bars

    • True
    • False
  • Freq Font Size (size_freq) β€” font size of frequency labels

    • Example: 8
  • Legend Text Size (legend.size.text) β€” font size of legend labels

    • Example: 10
  • Add Summary (add_sum) β€” whether to add summary annotations for each bar

    • True
    • False

Steps

  1. Upload a dataset containing categorical annotation variables.
  2. Select the X Variable.
  3. Select the Y Variable.
  4. Choose a preset Palette or provide custom colors in Colors.
  5. Optionally enter a plot Title.
  6. Adjust axis angle, coordinate direction, label size, and legend size if needed.
  7. Choose whether to show frequency labels and summary annotations.
  8. Click Run Analysis.
  9. View the percentage bar plot in the Plot tab.

Example output

Download

  • The plot can be exported from the Download panel.
  • An initial plot size is provided, which can be adjusted if needed.
  • If needed, you can adjust the plot width and height before downloading to obtain a more suitable layout.

2.6.4 Cell Bar Plot

Visualize cell composition or immune infiltration profiles across samples using a stacked bar plot. The module is suitable for deconvolution results such as CIBERSORT, EPIC, and quanTIseq, and supports custom colors, sample number control, legend position, and coordinate flipping.

Example input

               ID         B_cells_naive   T_cells_CD8   NK_cells_resting   Macrophages_M0   Macrophages_M2   Fibroblasts
1         TCGA-3M-AB46      0.0264          0.0290         0.0455             0.2464           0.1099          -0.3167
2         TCGA-3M-AB47      0.1680          0.0617         0.0601             0.0357           0.1241           1.1450
3         TCGA-B7-5818      0.0380          0.1248         0.0321             0.1035           0.2062          -0.0338
4         TCGA-B7-A5TI      0.1163          0.0653         0.0504             0.0978           0.2043           0.6185
5         TCGA-BR-4187      0.0641          0.0515         0.0530             0.0000           0.2551           1.8778

Parameters

  • ID column (id) β€” the sample identifier column
    • Example: ID
  • Features (features) β€” one or more cell-fraction or infiltration-related columns to display
    • These usually include deconvolution-derived cell fractions or abundance estimates
  • Sample Number (n) β€” number of samples displayed in the plot
    • Example: 10
  • Colors (cols) β€” custom colors for cell types
    • Enter color names or hex codes separated by commas
  • Title (title) β€” plot title
    • Default example: Cell Fraction
  • Legend Position (legend.position) β€” position of the legend
    • bottom
    • top
    • left
    • right
  • Palette (palette) β€” preset palette index
    • 1
    • 2
    • 3
    • 4
  • Coord Flip (coord_filp) β€” whether to flip the plot orientation
    • True
    • False

Steps

  1. Upload a deconvolution result matrix or other cell-composition dataset.
  2. Enter the sample ID column name.
  3. Select one or more infiltration Features.
  4. Set the Sample Number to control how many samples are displayed.
  5. Optionally provide custom Colors.
  6. Enter a plot Title if needed.
  7. Select Legend Position, Palette, and whether to enable Coord Flip.
  8. Click Run Analysis.
  9. View the stacked bar plot in the Plot tab.

Example output

Download

  • The plot can be exported from the Download panel.
  • An initial plot size is provided, which can be adjusted if needed.
  • If needed, you can adjust the plot width and height before downloading to obtain a more suitable layout.

2.6.5 Forest Plot

Produce a forest plot from batch survival analysis results. The module visualizes hazard ratios, confidence intervals, and P values for selected signatures or variables, and is suitable for summarizing survival-associated features.

Example input

          ID               P         HR      lower_0.95   upper_0.95
1         TMEscore_plus   0.0031    1.8425      1.2254        2.7710
2         StromalScore    0.0184    1.5362      1.0738        2.1980
3         ImmuneScore     0.0412    0.7124      0.5148        0.9858
4         ESTIMATEScore   0.0097    1.6841      1.1296        2.5110
5         CD8_T_cells     0.0275    0.6483      0.4412        0.9527

Parameters

  • Signature (signature) β€” the column containing signature or feature names
    • Example: ID
  • P-value (pvalue) β€” the column containing P values
    • Example: P
  • HR (HR) β€” the column containing hazard ratios
    • Example: HR
  • Lower CI (95%) (CI_low_0.95) β€” the column containing the lower bound of the 95% confidence interval
    • Example: lower_0.95
  • Upper CI (95%) (CI_up_0.95) β€” the column containing the upper bound of the 95% confidence interval
    • Example: upper_0.95
  • Signature Number (n) β€” the number of top signatures or variables to display
    • Example: 10
  • Text Size (text.size) β€” font size used in the forest plot
    • Example: 13
  • Colors (cols) β€” custom colors for forest plot elements
    • Enter at least 2 color names or hex codes separated by commas

Steps

  1. Upload a batch survival result table.
  2. Enter the column name for Signature.
  3. Enter the column names for P-value, HR, Lower CI (95%), and Upper CI (95%).
  4. Set the Signature Number to control how many results are displayed.
  5. Adjust Text Size if needed.
  6. Optionally provide custom Colors.
  7. Click Run Analysis.
  8. View the forest plot in the Plot tab.

Example output

Download

  • The plot can be exported from the Download panel.
  • An initial plot size is provided, which can be adjusted if needed.
  • If needed, you can adjust the plot width and height before downloading to obtain a more suitable layout.

2.6.6 Correlation Plot

Compute and visualize the correlation between two variables with optional subtype grouping and regression display. The module supports Pearson, Spearman, and Kendall correlation methods, and can generate a scatter plot with fitted line and correlation statistics.

Example input

              ID         cluster    CD_8_T_effector      DDR       APM    Immune_Checkpoint   CellCycle_Reg
1         TCGA-2F-A9KO    TME1            4.7093       -4.3653    3.1724         4.5259           -1.3468
2         TCGA-2F-A9KP    TME2           -1.6480        5.0614   -1.3928        -1.4447            3.2313
3         TCGA-2F-A9KQ    TME2           -2.1915      -11.1568   -1.8568        -1.7691            0.6771
4         TCGA-2F-A9KR    TME3            0.0528        3.2845    1.6877        -0.2206           -1.3867
5         TCGA-2F-A9KT    TME1           -0.9226        7.1762   -1.6106        -1.0915           -1.1749

Parameters

  • Matrix (is.matrix) β€” whether the input is treated as a pure matrix

    • True
    • False
  • Variable 1 (var1) β€” the first numeric variable used in the correlation analysis

  • Variable 2 (var2) β€” the second numeric variable used in the correlation analysis

  • Subtype (subtype) β€” optional grouping variable used to color points by category

    • Example: subtype, response group, or treatment group
  • Scale (scale) β€” whether to scale the selected variables before analysis

    • True
    • False
  • Method (method) β€” correlation method

    • Pearson
    • Spearman
    • Kendall
  • Show Result (show_cor_result) β€” whether to display correlation statistics on the plot

    • True
    • False
  • Color Line(optional) (col_line) β€” color of the fitted regression line

    • Example: black or #E69F00
  • Subtype Colors (color_subtype) β€” custom colors for subtype groups

    • Enter color names or hex codes separated by commas
  • Title(optional) (title) β€” plot title

  • Title Size(optional) (title_size) β€” plot title size

    • Example: 1.5
  • Point Size (point_size) β€” size of scatter points

    • Example: 4
  • Point Transparency (alpha) β€” transparency of scatter points

    • Example: 0.5
  • Text Size (text_size) β€” size of plot text

    • Example: 10
  • Axis Label Angle (axis_angle) β€” rotation angle of axis labels

    • Example: 0
  • Hjust (hjust) β€” horizontal justification of rotated labels

    • Example: 0

Steps

  1. Upload a dataset containing numeric variables for correlation analysis.
  2. Choose whether the input should be treated as a Matrix.
  3. Select Variable 1 and Variable 2.
  4. Optionally select a Subtype column for grouped coloring.
  5. Choose whether to apply Scale.
  6. Select the correlation Method.
  7. Choose whether to show correlation statistics on the plot.
  8. Optionally set the regression line color, subtype colors, and plot title.
  9. Adjust point size, transparency, text size, and axis label settings if needed.
  10. Click Run Analysis.
  11. View the scatter plot in the Plot tab.

Example output

Download

  • The plot can be exported from the Download panel.
  • An initial plot size is provided, which can be adjusted if needed.
  • If needed, you can adjust the plot width and height before downloading to obtain a more suitable layout.

2.6.7 Correlation Matrix

Calculate and visualize a correlation matrix between two selected feature sets. The module supports Pearson, Spearman, and Kendall correlation methods, optional scaling, custom heatmap colors, significance annotation, and correlation-based filling.

Example input

              ID         cluster    CD_8_T_effector      DDR       APM    Immune_Checkpoint   CellCycle_Reg
1         TCGA-2F-A9KO    TME1            4.7093       -4.3653    3.1724         4.5259           -1.3468
2         TCGA-2F-A9KP    TME2           -1.6480        5.0614   -1.3928        -1.4447            3.2313
3         TCGA-2F-A9KQ    TME2           -2.1915      -11.1568   -1.8568        -1.7691            0.6771
4         TCGA-2F-A9KR    TME3            0.0528        3.2845    1.6877        -0.2206           -1.3867
5         TCGA-2F-A9KT    TME1           -0.9226        7.1762   -1.6106        -1.0915           -1.1749

Parameters

  • Matrix (is.matrix) β€” whether the input is treated as a pure matrix
    • True
    • False
  • Features Set1 (feas1) β€” the first group of variables used in the correlation matrix
    • One or more numeric variables
  • Features Set2 (feas2) β€” the second group of variables used in the correlation matrix
    • One or more numeric variables
  • Heatmap Colors (cols) β€” colors used for the correlation heatmap
    • Provide 2 or 3 colors separated by commas
    • 2 colors: low/high, with white used as the midpoint
    • 3 colors: low/mid/high
  • Scale (scale) β€” whether to scale variables before correlation analysis
    • True
    • False
  • Method (method) β€” correlation method
    • Pearson
    • Spearman
    • Kendall
  • Font Size Star (font.size.star) β€” font size of significance markers
    • Example: 8
  • Font Size (font.size) β€” overall font size of the matrix plot
    • Example: 15
  • Fill by cor (fill_by_cor) β€” whether to fill tiles directly by correlation values
    • True
    • False

Steps

  1. Upload a dataset containing numeric variables for correlation analysis.
  2. Choose whether the input should be treated as a Matrix.
  3. Select one or more variables for Features Set1.
  4. Select one or more variables for Features Set2.
  5. Optionally provide custom Heatmap Colors.
  6. Choose whether to apply Scale.
  7. Select the correlation Method.
  8. Adjust font size settings if needed.
  9. Choose whether to enable Fill by cor.
  10. Click Run Analysis.
  11. View the correlation matrix in the Plot tab.

Example output

Download

  • The plot can be exported from the Download panel.
  • An initial plot size is provided, which can be adjusted if needed.
  • If needed, you can adjust the plot width and height before downloading to obtain a more suitable layout.

2.6.8 Survival Plots

Generate Kaplan–Meier survival plots for one selected signature or numeric variable. The module automatically creates multiple survival grouping strategies, including best cutoff, two-group comparison, and three-group comparison, and combines them into one figure.

Example input

           ID           OS_time   OS_status   TMEscore_plus   StromalScore   ImmuneScore
1         TCGA-3M-AB46   58.83     0           -1.2456         -0.6766        -1.3235
2         TCGA-3M-AB47   12.40     1            0.8452          0.9613         0.0518
3         TCGA-B7-5818   11.87     0            0.1264         -0.2425         0.8001
4         TCGA-B7-A5TI   19.83     0            0.4028          0.4017        -0.4095
5         TCGA-BR-4187    4.70     1            1.7631          2.2367         1.2632

Parameters

  • Signature (signature) β€” the numeric variable used for survival stratification
    • Example: signature score, pathway score, or immune-related feature
  • Status (status) β€” the event indicator column used in survival analysis
    • Example: OS_status
  • Time (time) β€” the survival time column
    • Example: OS_time
  • ID (ID) β€” the sample identifier column
    • Default example: ID
  • Time Type (time_type) β€” the time unit of the survival column
    • Month
    • Day
  • Palette (palette) β€” preset color palette for survival groups
    • nrc
    • jama
    • aaas
    • jco
    • paired1
    • paired2
    • paired3
    • paired4
    • accent
    • set2
  • Colors (cols) β€” custom colors for survival groups
    • Enter color names or hex codes separated by commas
    • If provided, this setting overrides Palette

Steps

  1. Upload a dataset containing survival information and one or more numeric signatures.
  2. Select the Signature column.
  3. Select the survival Status column.
  4. Select the survival Time column.
  5. Enter the sample ID column name if needed.
  6. Choose the correct Time Type.
  7. Select a preset Palette or provide custom Colors.
  8. Click Run Analysis.
  9. View the combined Kaplan–Meier plots in the Plot tab.

Example output

Download

  • The plot can be exported from the Download panel.
  • An initial plot size is provided, which can be adjusted if needed.
  • If needed, you can adjust the plot width and height before downloading to obtain a more suitable layout.

2.6.9 Survival Group Plot

2.6.10 Surv_group

Generate Kaplan–Meier survival plots for a categorical grouping variable. The module compares survival outcomes across predefined groups and is suitable for subgroup-based survival analysis such as molecular subtype, treatment class, or clinical category.

Example input

          ID           Subtype   OS_time   OS_status
1         TCGA-3M-AB46   GS        58.83     0
2         TCGA-3M-AB47   CIN       12.40     1
3         TCGA-B7-5818   EBV       11.87     0
4         TCGA-B7-A5TI   MSI       19.83     0
5         TCGA-BR-4187   GS         4.70     1

Parameters

  • ID column (ID) β€” the sample identifier column
    • Default example: ID
  • Target Group (target_group) β€” the categorical grouping variable used for Kaplan–Meier comparison
    • Example: subtype, response group, treatment group, or stage
  • Status (status) β€” the event indicator column used in survival analysis
    • Example: OS_status
  • Time (time) β€” the survival time column
    • Example: OS_time
  • Time Type (time_type) β€” the time unit of the survival column
    • Months
    • Days
  • Palette (palette) β€” preset color palette for survival groups
    • nrc
    • aaas
    • jco
    • jama
    • paired1
    • paired2
    • paired3
    • paired4
    • accent
    • set2
  • Colors (cols) β€” custom colors for survival groups
    • Enter color names or hex codes separated by commas
    • If provided, this setting overrides Palette
  • Font Size (Table) (font.size.table) β€” font size of the risk table
    • Example: 5

Steps

  1. Upload a dataset containing one grouping column and survival information.
  2. Enter the sample ID column name if needed.
  3. Select the Target Group column.
  4. Select the survival Status column.
  5. Select the survival Time column.
  6. Choose the correct Time Type.
  7. Select a preset Palette or provide custom Colors.
  8. Adjust Font Size (Table) if needed.
  9. Click Run Analysis.
  10. View the Kaplan–Meier plot in the Plot tab.

Example output

Download

  • The plot can be exported from the Download panel.
  • An initial plot size is provided, which can be adjusted if needed.
  • If needed, you can adjust the plot width and height before downloading to obtain a more suitable layout.

2.6.11 Time ROC Curves

Generate time-dependent ROC curves with AUC values for one to three selected variables. The module is suitable for evaluating and comparing the prognostic performance of signatures, scores, or biomarkers at a specific survival time point.

Example input

          ID           OS_time   OS_status   TMEscore_plus   StromalScore   ImmuneScore
1         TCGA-3M-AB46   58.83     0           -1.2456         -0.6766        -1.3235
2         TCGA-3M-AB47   12.40     1            0.8452          0.9613         0.0518
3         TCGA-B7-5818   11.87     0            0.1264         -0.2425         0.8001
4         TCGA-B7-A5TI   19.83     0            0.4028          0.4017        -0.4095
5         TCGA-BR-4187    4.70     1            1.7631          2.2367         1.2632

Parameters

  • Variables (Max 3) (vars) β€” one to three numeric variables used to build time-dependent ROC curves
    • Example: signature scores, pathway scores, or biomarkers
  • Status (status) β€” the event indicator column used in survival analysis
    • Example: OS_status
  • Time (time) β€” the survival time column
    • Example: OS_time
  • Time Point (time_point) β€” the survival time point used for ROC evaluation
    • Example: 12
  • Time Type (time_type) β€” the time unit of the survival column
    • Month
    • Day
  • Palette (palette) β€” preset color palette for ROC curves
    • nrc
    • jama
    • aaas
    • jco
    • paired1
    • paired2
    • paired3
    • paired4
    • accent
    • set2
  • Colors (cols) β€” custom colors for ROC curves
    • Enter color names or hex codes separated by commas
    • If provided, this setting overrides Palette
  • Title (main) β€” optional plot title

Steps

  1. Upload a dataset containing survival information and one or more numeric variables.
  2. Select one to three Variables.
  3. Select the survival Status column.
  4. Select the survival Time column.
  5. Set the Time Point for ROC evaluation.
  6. Choose the correct Time Type.
  7. Select a preset Palette or provide custom Colors.
  8. Optionally enter a plot Title.
  9. Click Run Analysis.
  10. View the time-dependent ROC plot in the Plot tab.

Example output

Download

  • The plot can be exported from the Download panel.
  • An initial plot size is provided, which can be adjusted if needed.
  • If needed, you can adjust the plot width and height before downloading to obtain a more suitable layout.

2.6.12 Signature ROC Curves

Generate ROC curves for one or more selected variables against a binary outcome. The module supports multiple ROC curves in one plot, optional curve comparison, smoothing, custom colors, and transparency adjustment.

Example input

             ID          OS_status   cluster    CD_8_T_effector      DDR       APM    Immune_Checkpoint
1         TCGA-2F-A9KO      0           TME1            4.7093       -4.3653    3.1724         4.5259
2         TCGA-2F-A9KP      1           TME2           -1.6480        5.0614   -1.3928        -1.4447
3         TCGA-2F-A9KQ      1           TME2           -2.1915      -11.1568   -1.8568        -1.7691
4         TCGA-2F-A9KR      0           TME3            0.0528        3.2845    1.6877        -0.2206
5         TCGA-2F-A9KT      0           TME1           -0.9226        7.1762   -1.6106        -1.0915

Parameters

  • Status (response) β€” the binary outcome column used for ROC analysis
    • Example: OS_status
  • Variables (variables) β€” one or more numeric variables used to generate ROC curves
    • Example: signature scores, pathway scores, or biomarkers
  • Palette (palette) β€” preset color palette for ROC curves
    • nrc
    • jama
    • aaas
    • jco
    • paired1
    • paired2
    • paired3
    • paired4
    • accent
    • set2
  • Colors (cols) β€” custom colors for ROC curves
    • Enter color names or hex codes separated by commas
    • If provided, this setting overrides Palette
  • Transparency (alpha) β€” line transparency of ROC curves
    • Example: 1
  • Compare (compare) β€” whether to compare ROC curves statistically
    • True
    • False
  • Compare Method (compare_method) β€” statistical method used for ROC comparison
    • Bootstrap
    • Delong
    • Venkatraman
  • Smooth (smooth) β€” whether to smooth ROC curves
    • True
    • False
  • Title (main) β€” optional plot title

Steps

  1. Upload a dataset containing one binary outcome column and one or more numeric variables.
  2. Select the Status column.
  3. Select one or more Variables.
  4. Choose a preset Palette or provide custom Colors.
  5. Adjust Transparency if needed.
  6. Choose whether to enable Compare.
  7. If comparison is enabled, select the Compare Method.
  8. Choose whether to apply Smooth.
  9. Optionally enter a plot Title.
  10. Click Run Analysis.
  11. View the ROC plot in the Plot tab.

Example output

Download

  • The plot can be exported from the Download panel.
  • An initial plot size is provided, which can be adjusted if needed.
  • If needed, you can adjust the plot width and height before downloading to obtain a more suitable layout.

2.6.13 PCA Visualization

Perform Principal Component Analysis (PCA) on an expression matrix and optionally color samples by group information from a phenotype table. The module supports matrix scaling, log transformation, sample labels, group colors, ellipse display, and flexible axis selection.

Example input

Expression matrix

                  TCGA-3M-AB46  TCGA-3M-AB47  TCGA-B7-5818  TCGA-B7-A5TI
EPCAM                 5.238         4.992         5.641         5.310
CD8A                  8.421         7.952         6.884         8.103
GZMB                  7.116         6.208         5.774         6.945
COL1A1                9.114         8.773         9.562         9.028

Phenotype data

           ID            Subtype   Response
1         TCGA-3M-AB46    GS        Non-response
2         TCGA-3M-AB47    CIN       Response
3         TCGA-B7-5818    EBV       Response
4         TCGA-B7-A5TI    MSI       Non-response

Parameters

  • ID Column (Pdata) (id_pdata) β€” the sample ID column in the phenotype table
    • Example: ID
  • Group (Pdata) (group) β€” the grouping variable in the phenotype table used to color samples
    • Example: subtype, response group, or treatment group
  • Matrix (is.matrix) β€” whether the uploaded expression data is treated as a pure matrix
    • True
    • False
  • Scale (scale) β€” whether to scale variables before PCA
    • True
    • False
  • Log (is.log) β€” whether the matrix is already log-transformed or should be treated as log-scale input
    • True
    • False
  • Sample Display (geom.ind) β€” how samples are shown in the PCA plot
    • Point
    • Text
    • Both
  • Palette (palette) β€” preset color palette for sample groups
    • nrc
    • jama
    • aaas
    • jco
    • paired1
    • paired2
    • paired3
    • paired4
    • accent
    • set2
  • Colors (cols) β€” custom colors for sample groups
    • Enter color names or hex codes separated by commas
    • If provided, this setting overrides Palette
  • Repel Labels (repel) β€” whether to repel overlapping sample labels
    • True
    • False
  • Components Number (ncp) β€” number of principal components to calculate
    • Example: 5
  • X-axis (axes[1]) β€” principal component displayed on the x-axis
    • Example: 1
  • Y-axis (axes[2]) β€” principal component displayed on the y-axis
    • Example: 2
  • Add Ellipses (addEllipses) β€” whether to add group ellipses to the PCA plot
    • True
    • False

Steps

  1. Upload an expression matrix.
  2. Upload a phenotype table if group annotation is needed.
  3. Enter the sample ID Column (Pdata).
  4. Select the Group (Pdata) column.
  5. Choose whether the expression input is a Matrix.
  6. Set Scale and Log options as needed.
  7. Select the Sample Display mode.
  8. Choose a preset Palette or provide custom Colors.
  9. Adjust label repelling, component number, PCA axes, and ellipse display if needed.
  10. Click Run Analysis.
  11. View the PCA plot in the Plot tab.

Example output

Download

  • The plot can be exported from the Download panel.
  • An initial plot size is provided, which can be adjusted if needed.
  • If needed, you can adjust the plot width and height before downloading to obtain a more suitable layout.

2.6.14 Signature GSEA

Perform Gene Set Enrichment Analysis (GSEA) based on differential expression results generated from an expression matrix and phenotype data. The module first runs differential expression analysis between two selected groups, then performs GSEA using a selected signature collection and displays the top enriched signatures.

Example input

Expression matrix

                  TCGA-BR-6455  TCGA-BR-7196  TCGA-BR-8371  TCGA-BR-8380
ENSG00000000003      8006          2114          767           1556
ENSG00000000005      1             0             5             5
ENSG00000000419      3831          2600          1729          1760
ENSG00000000457      1126          745           1040          1260

Phenotype data

          ID            Subtype
1         TCGA-3M-AB46    GS
2         TCGA-3M-AB47    CIN
3         TCGA-B7-5818    EBV
4         TCGA-B7-A5TI    MSI

Parameters

  • ID (Pdata) (pdata_id) β€” the sample ID column in the phenotype table
    • Example: ID
  • Group (group_id) β€” the grouping column used for differential expression
    • Example: tumor vs normal, responder vs non-responder
  • Contrast (Case vs Control) (contrast)
    • Case: the experimental group
    • Control: the reference group
  • Array (array) β€” whether the expression data is from microarray
    • True
    • False
  • Method (method) β€” differential expression method
    • DESeq2: suitable for raw count data
    • Limma: suitable for log-transformed expression data such as log2 TPM
  • Padj Cutoff (padj_cutoff) β€” adjusted P-value threshold for differential expression filtering
    • Example: 0.01
  • Logfc Cutoff (logfc_cutoff) β€” log fold-change threshold for differential expression filtering
    • Example: 0.5
  • Palette (palette_gsea) β€” preset color palette for the GSEA plot
    • 1
    • 2
    • 3
    • 4
  • Colors (cols_gsea) β€” custom colors for the GSEA plot
    • Enter color names or hex codes separated by commas
  • Signature (genesets) β€” the gene set collection used for enrichment analysis
    • TME
    • Metabolism
    • Tumor
    • Collection
    • Go_bp
    • Go_cc
    • Go_mf
    • KEGG
    • Hallmark
    • Reactome
  • Signature Numbers (show_gsea) β€” number of top enriched signatures displayed in the plot
    • Example: 8

Steps

  1. Upload the expression matrix.
  2. Upload the phenotype table.
  3. Enter the sample ID (Pdata) column name.
  4. Select the Group column.
  5. Select the Case and Control groups for comparison.
  6. Choose Array and Method according to the input data type.
  7. Set Padj Cutoff and Logfc Cutoff.
  8. Click the first Run Analysis button to perform differential expression and GSEA.
  9. Select the GSEA Palette, custom Colors, Signature collection, and Signature Numbers if needed.
  10. Click the second Run Analysis button to update the GSEA plot only.
  11. View the differential expression result in the Data tab.
  12. View the GSEA plot in the Plot tab.

Example output

Download

  • The result plot can be exported from the Download panel.
  • An initial plot size is provided, which can be adjusted if needed.
  • If needed, you can adjust the plot width and height before downloading to obtain a more suitable layout.

2.6.15 Find Markers

Identify marker genes for each group from bulk transcriptome data using a Seurat-based workflow. The module takes an expression matrix and phenotype data, detects group-specific marker genes, displays a heatmap of top markers, and returns the complete marker table.

Example input

                  TCGA-3M-AB46  TCGA-3M-AB47  TCGA-B7-5818  TCGA-B7-A5TI
EPCAM                 5.238         4.992         5.641         5.310
CD8A                  8.421         7.952         6.884         8.103
GZMB                  7.116         6.208         5.774         6.945
COL1A1                9.114         8.773         9.562         9.028

Phenotype data

          ID            Subtype
1         TCGA-3M-AB46    GS
2         TCGA-3M-AB47    CIN
3         TCGA-B7-5818    EBV
4         TCGA-B7-A5TI    MSI

Parameters

  • Pdata ID (id_pdata) β€” the sample ID column in the phenotype table
    • Example: ID
  • Group Column (group) β€” the grouping variable used to identify marker genes
    • Example: subtype, response group, or treatment group
  • Top N Markers (top_n) β€” number of top marker genes displayed per group in the heatmap
    • Example: 20
  • Group Color (group_color_style) β€” preset color palette for group annotations
    • npg
    • aaas
    • lancet
    • set1
    • set2
    • paired
  • Heatmap Color (heatmap_body_color) β€” preset color palette for heatmap expression values
    • Red-White-Blue
    • Yellow-Black-Purple
    • Red-Black-Green
    • Spectra
  • Custom Group Colors (group.colors) β€” custom colors for group annotations
    • Enter color names or hex codes separated by commas
    • If provided, this setting overrides Group Color
  • Custom Heatmap Colors (heatmap.colors) β€” custom colors for the heatmap body
    • Enter at least 2 color names or hex codes separated by commas
    • If provided, this setting overrides Heatmap Color

Steps

  1. Upload the expression matrix.
  2. Upload the phenotype table.
  3. Enter the sample Pdata ID column name.
  4. Select the Group Column.
  5. Set the Top N Markers to display in the heatmap.
  6. Select a preset Group Color and Heatmap Color, or provide custom colors.
  7. Click Run Analysis.
  8. View the marker heatmap in the Plot tab.
  9. View the complete marker table in the Marker Table tab.

Example output

Download

  • The result plot and marker table can be exported from the Download panel.
  • An initial plot size is provided, which can be adjusted if needed.
  • If needed, you can adjust the plot width and height before downloading to obtain a more suitable layout.

Download

  • Results can be exported from the Download panel.

2.7 Mutation Module

2.7.1 Build Mutation Matrix

Convert MAF mutation data into a binary mutation matrix. The module summarizes mutation events at the gene level and returns a sample-by-gene matrix indicating whether each gene is mutated in each sample.

Example input

Mutation Annotation Format (MAF) table

                Hugo_Symbol   Tumor_Sample_Barcode   Variant_Classification
1                 TP53          TCGA-3M-AB46          Missense_Mutation
2                 ARID1A        TCGA-3M-AB46          Frame_Shift_Del
3                 PIK3CA        TCGA-3M-AB47          Missense_Mutation
4                 CDH1          TCGA-B7-5818          Nonsense_Mutation
5                 FAT4          TCGA-B7-A5TI          Frame_Shift_Ins

Parameters

  • TCGA (isTCGA) β€” whether the uploaded MAF file uses TCGA-style sample identifiers
    • True
    • False
  • Type to show and download (table_type) β€” mutation category displayed in the result table
    • All
    • SNP
    • INDEL
    • Frameshift

Steps

  1. Upload a MAF mutation file.
  2. Select whether the file uses TCGA sample IDs.
  3. Choose the mutation Type to show and download.
  4. Click Run Analysis.
  5. View the binary mutation matrix in the Data tab.

Example output

                  TCGA-3M-AB46  TCGA-3M-AB47  TCGA-B7-5818  TCGA-B7-A5TI
TP53                  1              0              0              0
ARID1A                1              0              0              0
PIK3CA                0              1              0              0
CDH1                  0              0              1              0
FAT4                  0              0              0              1

Download

  • Results can be exported from the Download panel.

2.7.2 Identify Mutations

Identify phenotype-associated mutations by combining a binary mutation matrix with a signature matrix. The module tests whether mutations in specific genes are associated with the selected signature, and generates both an oncoprint and a box plot for visualization.

Example input

Mutation matrix

                  TCGA-3M-AB46  TCGA-3M-AB47  TCGA-B7-5818  TCGA-B7-A5TI
TP53                  1              0              0              0
ARID1A                1              0              0              0
PIK3CA                0              1              0              0
CDH1                  0              0              1              0
FAT4                  0              0              0              1

Signature matrix

              ID        TMEscore_plus   StromalScore   ImmuneScore
1         TCGA-3M-AB46    -1.2456         -0.6766        -1.3235
2         TCGA-3M-AB47     0.8452          0.9613         0.0518
3         TCGA-B7-5818     0.1264         -0.2425         0.8001
4         TCGA-B7-A5TI     0.4028          0.4017        -0.4095

Parameters

  • ID Column (id_signature_matrix) β€” the sample ID column in the signature matrix
    • Example: ID
  • Signature (signature) β€” the numeric signature used to test mutation-associated differences
    • Example: TMEscore_plus
  • Min Mutation Frequency (min_mut_freq) β€” minimum mutation frequency threshold for genes included in the analysis
    • 0.01
    • 0.05
    • 0.1
  • Method (method) β€” statistical method used to identify phenotype-associated mutations
    • Multi(Cuzick and Wilcoxon)
    • Wilcoxon

Parameters for oncoprint

  • Group By (oncoprint_group_by) β€” grouping method used for oncoprint display
    • Mean
    • Quantile
  • Gene Counts (gene_counts) β€” number of top genes shown in the oncoprint
    • Example: 10

Parameters for box plot

  • Point Size (point_size) β€” size of points displayed on the box plot
    • Example: 4.5
  • Point Transparency (point_alpha) β€” transparency of points on the box plot
    • Example: 0.1
  • Show Jitter (jitter) β€” whether to display jittered sample points
    • True
    • False

Parameters for all results download

  • Output Folder Name (save_path) β€” folder name used when packaging all generated results
    • Example: mutation_results

Steps

  1. Upload the binary mutation matrix.
  2. Upload the signature matrix.
  3. Enter the sample ID Column name for the signature matrix.
  4. Select the target Signature.
  5. Choose the Min Mutation Frequency threshold.
  6. Select the statistical Method.
  7. Adjust the oncoprint parameters if needed.
  8. Adjust the box plot parameters if needed.
  9. Click Run Analysis.
  10. View the mutation oncoprint in the Oncoprint tab.
  11. View the mutation-associated signature box plot in the Box Plot tab.

Example output

Download

  • The oncoprint and box plot can be exported from their respective Download panels, and all generated results can also be packaged from the All Results Download panel.
  • An initial plot size is provided, which can be adjusted if needed.
  • If needed, you can adjust the plot width and height before downloading to obtain a more suitable layout.