🔧 Troubleshooting Guide¶

Common Issues and Solutions¶

This guide provides solutions to the most common problems encountered when using SPEX for spatial omics analysis (proteomics and transcriptomics).

Installation Issues¶

Problem: "No module named 'spex'" Error¶

Symptoms:

ImportError: No module named 'spex'

Solutions:

Check installation:
```
pip list | grep spex
```

Reinstall SPEX:

pip uninstall spex-tools
pip install spex-tools

Check Python environment:
```
which python
pip --version
```

Install in virtual environment:

python -m venv spex_env
source spex_env/bin/activate  # On Windows: spex_env\Scripts\activate
pip install spex-tools

Problem: CUDA/GPU Installation Issues¶

Symptoms:

RuntimeError: CUDA out of memory
ImportError: No module named 'torch'

Solutions:

Install PyTorch with CUDA support:

pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118

Use CPU-only version:

pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cpu

Check GPU availability:

import torch
print(f"CUDA available: {torch.cuda.is_available()}")
print(f"GPU count: {torch.cuda.device_count()}")

Problem: OpenCV Installation Issues¶

Symptoms:

ImportError: libGL.so.1: cannot open shared object file

Solutions:

Install system dependencies (Ubuntu/Debian):

sudo apt-get update
sudo apt-get install libgl1-mesa-glx libglib2.0-0 libsm6 libxext6 libxrender-dev

Install system dependencies (CentOS/RHEL):

sudo yum install mesa-libGL libglib2.0-0 libSM libXext libXrender

Reinstall OpenCV:

pip uninstall opencv-python
pip install opencv-python-headless

Data Loading Issues¶

Symptoms:

MemoryError: Unable to allocate array

Solutions:

Use chunked loading:

# Load in chunks
Image, channel = sp.load_image('large_image.tiff', chunk_size=1000)

Reduce image resolution:

from PIL import Image
img = Image.open('large_image.tiff')
img_resized = img.resize((img.width//2, img.height//2))
img_resized.save('resized_image.tiff')

Use memory-efficient data types:

import numpy as np
Image = Image.astype(np.float32)  # Instead of float64

Monitor memory usage:

import psutil
import os

def get_memory_usage():
    process = psutil.Process(os.getpid())
    return process.memory_info().rss / 1024 / 1024  # MB

print(f"Memory usage: {get_memory_usage():.1f} MB")

Problem: Unsupported Image Format¶

Symptoms:

ValueError: Unsupported image format

Solutions:

Convert image format:

from PIL import Image
img = Image.open('unsupported_format.bmp')
img.save('converted_image.tiff')

Check supported formats:
```
from PIL import Image
print(Image.OPEN)
```

Use alternative loading:

import tifffile
Image = tifffile.imread('image.tiff')

Problem: Incorrect Channel Information¶

Symptoms:

IndexError: Channel index out of bounds

Solutions:

Check image dimensions:

print(f"Image shape: {Image.shape}")
print(f"Number of channels: {Image.shape[2] if len(Image.shape) == 3 else 1}")

Verify channel indices:

# Use valid channel indices
valid_channels = list(range(Image.shape[2]))
features = sp.feature_extraction(Image, labels, valid_channels)

Handle single-channel images:

if len(Image.shape) == 2:
    Image = Image[:, :, np.newaxis]

Image Preprocessing Issues¶

Problem: Background Subtraction Not Working¶

Symptoms: - No visible change after background subtraction - Images become too dark or bright

Solutions:

Adjust window size:

# Increase window size for larger background variations
Image_bg = sp.background_subtract(Image, [0], window_size=100)

# Decrease for fine detail preservation
Image_bg = sp.background_subtract(Image, [0], window_size=20)

Check image statistics:

print(f"Original range: {Image.min():.2f} - {Image.max():.2f}")
print(f"Background subtracted range: {Image_bg.min():.2f} - {Image_bg.max():.2f}")

Visualize results:

import matplotlib.pyplot as plt

fig, axes = plt.subplots(1, 2, figsize=(10, 5))
axes[0].imshow(Image[:, :, 0], cmap='gray')
axes[0].set_title('Original')
axes[1].imshow(Image_bg[:, :, 0], cmap='gray')
axes[1].set_title('Background Subtracted')
plt.show()

Problem: Denoising Too Aggressive¶

Symptoms: - Loss of important features - Over-smoothed images

Solutions:

Reduce denoising strength:

# Use smaller h parameter for less aggressive denoising
Image_denoised = sp.nlm_denoise(Image, [0], h=0.05)

Try different denoising methods:

# Use median filter for salt-and-pepper noise
Image_denoised = sp.median_denoise(Image, [0])

Apply denoising selectively:

# Denoise only noisy channels
Image_denoised = Image.copy()
noisy_channels = [0, 2]  # Only denoise channels 0 and 2
for ch in noisy_channels:
    Image_denoised[:, :, ch] = sp.nlm_denoise(Image[:, :, ch], [0])

Segmentation Issues¶

Problem: No Cells Detected¶

Symptoms:

print(f"Cells detected: {labels.max()}")  # Output: 0

Solutions:

Check image quality:

# Verify image has sufficient contrast
print(f"Image range: {Image.min():.2f} - {Image.max():.2f}")
print(f"Image mean: {Image.mean():.2f}")
print(f"Image std: {Image.std():.2f}")

Improve preprocessing:

# Apply better preprocessing
Image_bg = sp.background_subtract(Image, [0], window_size=50)
Image_denoised = sp.nlm_denoise(Image_bg, [0])

Adjust segmentation parameters:

# Try smaller diameter
labels = sp.cellpose_cellseg(Image, [0], diameter=15)

# Try different flow threshold
labels = sp.cellpose_cellseg(Image, [0], flow_threshold=0.4)

Use alternative segmentation method:

# Try watershed segmentation
labels = sp.watershed_classic(Image, [0])

Problem: Too Many False Positive Cells¶

Symptoms: - Many small objects detected - Noise identified as cells

Solutions:

Remove small objects:

# Increase minimum size threshold
labels_clean = sp.remove_small_objects(labels, min_size=100)

Remove large objects:

# Remove objects that are too large
labels_clean = sp.remove_large_objects(labels, max_size=500)

Adjust segmentation parameters:

# Increase diameter to avoid detecting small noise
labels = sp.cellpose_cellseg(Image, [0], diameter=40)

Improve image quality:

# Apply stronger denoising
Image_denoised = sp.nlm_denoise(Image, [0], h=0.1)
Image_denoised = sp.median_denoise(Image_denoised, [0])

Problem: Missing Cells in Segmentation¶

Symptoms: - Expected cells not detected - Incomplete segmentation

Solutions:

Use cell rescue:

# Rescue missing cells
labels_rescued = sp.rescue_cells(Image, labels, [0])

Adjust diameter parameter:

# Try smaller diameter for smaller cells
labels = sp.cellpose_cellseg(Image, [0], diameter=20)

Improve contrast:

# Enhance contrast before segmentation
from skimage import exposure
Image_enhanced = exposure.equalize_hist(Image[:, :, 0])
Image_enhanced = np.stack([Image_enhanced] * Image.shape[2], axis=2)

Try different segmentation method:

# Try StarDist for different cell shapes
labels = sp.stardist_cellseg(Image, [0])

Problem: Segmentation Boundaries Not Accurate¶

Symptoms: - Cell boundaries don't follow actual cell edges - Over-segmentation or under-segmentation

Solutions:

Validate boundaries:

def validate_boundaries(image, labels):
    from scipy import ndimage

    # Calculate image gradients
    grad_x = ndimage.sobel(image[:, :, 0], axis=0)
    grad_y = ndimage.sobel(image[:, :, 0], axis=1)
    gradient_magnitude = np.sqrt(grad_x**2 + grad_y**2)

    # Find cell boundaries
    boundaries = labels == 0
    boundary_gradients = gradient_magnitude[boundaries]

    quality_score = np.mean(boundary_gradients) / np.max(gradient_magnitude)
    return quality_score

quality = validate_boundaries(Image, labels)
print(f"Boundary quality: {quality:.3f}")

Adjust flow threshold:

# Lower threshold for more precise boundaries
labels = sp.cellpose_cellseg(Image, [0], flow_threshold=0.2)

Use post-processing:

# Apply morphological operations
from scipy import ndimage
labels_processed = ndimage.binary_fill_holes(labels > 0)
labels_processed = ndimage.label(labels_processed)[0]

Feature Extraction Issues¶

Problem: Feature Extraction Fails¶

Symptoms:

ValueError: No valid cells found for feature extraction

Solutions:

Check segmentation:

print(f"Number of cells: {labels.max()}")
print(f"Unique labels: {np.unique(labels)}")

Remove background label:

# Ensure labels start from 1
if labels.min() == 0:
    labels = labels + 1

Check channel indices:

# Ensure channel indices are valid
valid_channels = list(range(min(len(channel), Image.shape[2])))
features = sp.feature_extraction(Image, labels, valid_channels)

Problem: Missing Values in Features¶

Symptoms:

print(f"Missing values: {features.isnull().sum().sum()}")

Solutions:

Handle missing values:

# Remove cells with missing values
features_clean = features.dropna()

# Fill missing values
features_filled = features.fillna(features.mean())

# Interpolate missing values
features_interpolated = features.interpolate(method='linear')

Check for zero-variance features:

# Remove features with zero variance
feature_variance = features.var()
good_features = feature_variance[feature_variance > 0].index
features_filtered = features[good_features]

Investigate specific features:

# Check which features have missing values
missing_features = features.columns[features.isnull().any()].tolist()
print(f"Features with missing values: {missing_features}")

Problem: Too Many/Low-Quality Features¶

Symptoms: - Too many features extracted - Features with low information content

Solutions:

Select informative features:

# Select features with high variance
feature_variance = features.var()
informative_features = feature_variance[feature_variance > 0.01].index
features_selected = features[informative_features]

Remove correlated features:

# Remove highly correlated features
corr_matrix = features.corr()
high_corr_pairs = []

for i in range(len(corr_matrix.columns)):
    for j in range(i+1, len(corr_matrix.columns)):
        if abs(corr_matrix.iloc[i, j]) > 0.95:
            high_corr_pairs.append((corr_matrix.columns[i], corr_matrix.columns[j]))

# Remove one feature from each highly correlated pair
features_to_remove = [pair[1] for pair in high_corr_pairs]
features_uncorr = features.drop(columns=features_to_remove)

Use feature selection methods:

from sklearn.feature_selection import SelectKBest, f_classif

# Select top k features
selector = SelectKBest(score_func=f_classif, k=50)
features_selected = selector.fit_transform(features, clusters)

Clustering Issues¶

Problem: Only One Cluster Produced¶

Symptoms:

print(f"Number of clusters: {len(np.unique(clusters))}")  # Output: 1

Solutions:

Check feature quality:

# Check feature variance
feature_variance = features.var()
print(f"Features with zero variance: {(feature_variance == 0).sum()}")

# Remove low-variance features
good_features = feature_variance[feature_variance > 0.01].index
features_filtered = features[good_features]

Increase resolution:

# Try higher resolution
clusters = sp.cluster(adata, method='leiden', resolution=1.0)

Try different clustering method:

# Use Phenograph
clusters = sp.phenograph_cluster(features, k=30)

Normalize features:

from sklearn.preprocessing import StandardScaler
scaler = StandardScaler()
features_normalized = scaler.fit_transform(features)

Problem: Too Many Clusters¶

Symptoms:

print(f"Number of clusters: {len(np.unique(clusters))}")  # Too many

Solutions:

Decrease resolution:

# Try lower resolution
clusters = sp.cluster(adata, method='leiden', resolution=0.1)

Reduce feature dimensionality:

from sklearn.decomposition import PCA
pca = PCA(n_components=20)
features_pca = pca.fit_transform(features)

Use different clustering method:

# Use Louvain with lower resolution
clusters = sp.cluster(adata, method='louvain', resolution=0.3)

Problem: Clusters Don't Make Biological Sense¶

Symptoms: - Clusters don't correspond to known cell types - Poor cluster separation

Solutions:

Validate clustering quality:

from sklearn.metrics import silhouette_score

silhouette_avg = silhouette_score(features, clusters)
print(f"Silhouette score: {silhouette_avg:.3f}")

# Good: > 0.3, Acceptable: > 0.1, Poor: < 0.1

Check feature selection:

# Use domain-specific features
biological_features = ['area', 'perimeter', 'mean_intensity', 'eccentricity']
features_bio = features[biological_features]

Try different preprocessing:

# Log transform features
features_log = np.log1p(features)

# Z-score normalization
from sklearn.preprocessing import StandardScaler
scaler = StandardScaler()
features_scaled = scaler.fit_transform(features)

Spatial Analysis Issues¶

Problem: CLQ Calculation Fails¶

Symptoms:

ValueError: Invalid cell types specified

Solutions:

Check cell type labels:

# Ensure cell types exist in cluster labels
unique_clusters = np.unique(clusters)
print(f"Available cell types: {unique_clusters}")

# Use valid cell types
valid_cell_types = [0, 1, 2]  # Must exist in clusters
clq_matrix = sp.CLQ_vec_numba(labels, clusters, cell_types=valid_cell_types)

Check radius parameter:

# Use appropriate radius
clq_matrix = sp.CLQ_vec_numba(labels, clusters, cell_types=[0, 1, 2], radius=50)

Ensure sufficient cells per type:

# Check cell counts per type
for cell_type in [0, 1, 2]:
    count = np.sum(clusters == cell_type)
    print(f"Cell type {cell_type}: {count} cells")

Problem: Spatial Analysis Results Don't Make Sense¶

Symptoms: - Unexpected spatial patterns - Counterintuitive results

Solutions:

Validate spatial coordinates:

from skimage import measure

# Get cell centroids
props = measure.regionprops_table(labels, properties=['centroid'])
centroids = np.column_stack([props['centroid-0'], props['centroid-1']])

# Check coordinate ranges
print(f"X range: {centroids[:, 0].min():.1f} - {centroids[:, 0].max():.1f}")
print(f"Y range: {centroids[:, 1].min():.1f} - {centroids[:, 1].max():.1f}")

Visualize spatial distribution:

import matplotlib.pyplot as plt

plt.figure(figsize=(10, 8))
for cell_type in np.unique(clusters):
    mask = clusters == cell_type
    plt.scatter(centroids[mask, 0], centroids[mask, 1], 
               label=f'Type {cell_type}', alpha=0.7)
plt.legend()
plt.title('Spatial Distribution of Cell Types')
plt.show()

Check for edge effects:

# Remove cells near image edges
edge_distance = 50
valid_cells = (centroids[:, 0] > edge_distance) & (centroids[:, 0] < Image.shape[1] - edge_distance) & \
              (centroids[:, 1] > edge_distance) & (centroids[:, 1] < Image.shape[0] - edge_distance)

labels_center = labels.copy()
labels_center[~valid_cells] = 0

Performance Issues¶

Problem: Analysis Takes Too Long¶

Symptoms: - Processing time is excessive - Memory usage is high

Solutions:

Use parallel processing:

# Set number of jobs
clusters = sp.cluster(adata, method='leiden', n_jobs=4)

Reduce image size:

from skimage.transform import resize
Image_small = resize(Image, (Image.shape[0]//2, Image.shape[1]//2))

Use chunked processing:

# Process large images in chunks
labels = sp.cellpose_cellseg(Image, [0], chunk_size=512)

Optimize memory usage:

# Use memory-efficient data types
Image = Image.astype(np.float32)

# Clear memory after processing
import gc
gc.collect()

Problem: CUDA Out of Memory¶

Symptoms:

RuntimeError: CUDA out of memory

Solutions:

Reduce batch size:

labels = sp.cellpose_cellseg(Image, [0], batch_size=1)

Use CPU instead of GPU:

labels = sp.cellpose_cellseg(Image, [0], use_gpu=False)

Process smaller chunks:

labels = sp.cellpose_cellseg(Image, [0], chunk_size=256)

Clear GPU memory:

import torch
if torch.cuda.is_available():
    torch.cuda.empty_cache()

Data Export Issues¶

Problem: Results Can't Be Saved¶

Symptoms:

PermissionError: [Errno 13] Permission denied

Solutions:

Check file permissions:

import os
print(f"Current directory: {os.getcwd()}")
print(f"Write permission: {os.access('.', os.W_OK)}")

Use absolute paths:

import os
output_dir = '/path/to/output/directory'
os.makedirs(output_dir, exist_ok=True)

# Save results
features.to_csv(os.path.join(output_dir, 'features.csv'))

Handle file formats:

# Save in different formats
features.to_csv('features.csv')
features.to_excel('features.xlsx')
features.to_hdf('features.h5', key='features')

Problem: Large Files Can't Be Saved¶

Symptoms:

MemoryError: Unable to save large file

Solutions:

Save in chunks:

# Save large DataFrames in chunks
chunk_size = 1000
for i in range(0, len(features), chunk_size):
    chunk = features.iloc[i:i+chunk_size]
    chunk.to_csv(f'features_chunk_{i//chunk_size}.csv')

Use efficient formats:

# Use HDF5 for large files
features.to_hdf('features.h5', key='features', mode='w', complevel=9)

# Use Parquet for large files
features.to_parquet('features.parquet')

Compress files:

# Save with compression
features.to_csv('features.csv.gz', compression='gzip')

Getting Help¶

When to Contact Support¶

Contact the SPEX development team if you encounter:

Bugs: Unexpected behavior or crashes
Performance issues: Unreasonably slow processing
Missing features: Functionality you need but isn't available
Documentation issues: Unclear or incorrect documentation

How to Report Issues¶

When reporting issues, include:

Clear description of the problem
Minimal reproducible example with code
System information:
Operating system
Python version
SPEX version
Hardware specifications
Error messages and stack traces
Expected vs actual behavior

Example Issue Report¶

Title: Segmentation fails with memory error on large images

Description:
When processing images larger than 4000x4000 pixels, segmentation fails with a memory error.

Steps to reproduce:
1. Load large image: Image, channel = sp.load_image('large_image.tiff')
2. Run segmentation: labels = sp.cellpose_cellseg(Image, [0])
3. Error occurs during processing

System information:
- OS: Ubuntu 20.04
- Python: 3.9.7
- SPEX: 0.3.1055
- RAM: 16GB
- GPU: NVIDIA RTX 3080

Error message:
RuntimeError: CUDA out of memory. Tried to allocate 2.00 GiB

Expected behavior:
Segmentation should complete successfully or provide a clear error message about memory requirements.

Still having issues? Check the FAQ section or contact the SPEX development team for additional support.

🔧 Troubleshooting Guide¶

Common Issues and Solutions¶

Installation Issues¶

Problem: "No module named 'spex'" Error¶

Problem: CUDA/GPU Installation Issues¶

Problem: OpenCV Installation Issues¶

Data Loading Issues¶

Problem: Memory Errors When Loading Large Images¶

Problem: Unsupported Image Format¶

Problem: Incorrect Channel Information¶

Image Preprocessing Issues¶

Problem: Background Subtraction Not Working¶

Problem: Denoising Too Aggressive¶

Segmentation Issues¶

Problem: No Cells Detected¶

Problem: Too Many False Positive Cells¶

Problem: Missing Cells in Segmentation¶

Problem: Segmentation Boundaries Not Accurate¶

Feature Extraction Issues¶

Problem: Feature Extraction Fails¶

Problem: Missing Values in Features¶

Problem: Too Many/Low-Quality Features¶

Clustering Issues¶

Problem: Only One Cluster Produced¶

Problem: Too Many Clusters¶

Problem: Clusters Don't Make Biological Sense¶

Spatial Analysis Issues¶

Problem: CLQ Calculation Fails¶

Problem: Spatial Analysis Results Don't Make Sense¶

Performance Issues¶

Problem: Analysis Takes Too Long¶

Problem: CUDA Out of Memory¶

Data Export Issues¶

Problem: Results Can't Be Saved¶

Problem: Large Files Can't Be Saved¶

Getting Help¶

When to Contact Support¶

How to Report Issues¶

Example Issue Report¶