obs_level_CT_rank

Function

crisgi_obj.obs_level_CT_rank(gene_sets, prefix='test', min_size=5)

Performs observation-level cell type (CT) ranking using gene set variation analysis (GSVA) on the provided gene sets. This method is designed as a member function of the crisgi_obj class. It computes enrichment scores for each observation (cell/sample), ranks them, and saves the results to a CSV file. If group information is available, it annotates the results accordingly.

Parameters

Name	Type	Description
gene_sets	object	Dictionary or compatible object containing gene sets for GSVA analysis.
prefix	str	Prefix for output directory and files. Default is `'test'`.
min_size	int	Minimum number of genes required in each gene set. Default is `5`.

Return type

pandas.DataFrame

Returns

A DataFrame with GSVA enrichment scores for each observation, optionally annotated with group information and sorted by enrichment score (ES). The results are also exported as a CSV file in the specified output directory.

Attributes Set

self.gp_es: Stores the GSVA result object for downstream analysis.

Example

# Assuming `crisgi_obj` is an instance with `edata`, `out_dir`, and optionally `groupby` attributes
gene_sets = {'Pathway1': ['GeneA', 'GeneB'], 'Pathway2': ['GeneC', 'GeneD']}
result_df = crisgi_obj.obs_level_CT_rank(gene_sets=gene_sets, prefix='experiment1', min_size=10)

# The results are saved to './<out_dir>/experiment1/prerank_gsva_interaction.csv'
# The DataFrame `result_df` contains the ranked enrichment scores for each observation.