GitHub - imics-lab/waveformer: WaveFormer: Wavelet Embedding Transformer for Biomedical Signals

WaveFormer: Wavelet Embedding Transformer for Biomedical Signals

This is a PyTorch implementation of "WaveFormer: Wavelet Embedding Transformer for Biomedical Signals"

Overview

WaveFormer is a transformer architecture that integrates wavelet decomposition at two critical stages: embedding construction, where multi-channel Discrete Wavelet Transform (DWT) extracts frequency features to create tokens containing both time-domain and frequency-domain information, and positional encoding, where Dynamic Wavelet Positional Encoding (DyWPE) adapts position embeddings to signal-specific temporal structure through mono-channel DWT analysis.

Methodology

Mathematical Framework

For input x ∈ ℝ^(B×L×d_x):

Wavelet-Enhanced Patch Embedding

DWT to each channel: cA, cD = DWT(x) # Approximation (low-freq) + Detail (high-freq)
Wavelet-Derived Frequency Features: W_input = cA + α·cD
Fusion: E_patches = [Conv1d(x^T) ; Conv1d(W_input)]^T # Concatenate

Dynamic Wavelet Positional Encoding (DyWPE)

Channel Projection: x_mono = x · w_channel
Multi-Level DWT: (cA_J, [cD_J, ..., cD_1]) = DWT(x_mono)
Gated Modulation: modulated_coeffs = gate(scale_embeddings, coeffs)
IDWT Synthesis: P_DyWPE = IDWT(modulated_coeffs)

Combines token content with position information: E_final = E_patches + P_DyWPE # Concatenate

Installation

git clone https://github.com/imics-lab/waveformer.git
cd waveformer
pip install -r requirements.txt

Dependencies

torch>=1.9.0
numpy>=1.20.0
pandas>=1.3.0
matplotlib>=3.4.0
scikit-learn>=1.0.0
pytorch_wavelets>=1.3.0

Repository Structure

waveformer/
├── src/
│   ├── models/
│   │   ├── waveformer.py           # Main WaveFormer model
│   │   ├── embeddings.py           # Wavelet-enhanced patch embedding
│   │   ├── dywpe.py                # Dynamic Wavelet Positional Encoding
│   │   └── transformer.py          # Transformer encoder with RPE
│   └── utils/
│       ├── metrics.py              # Evaluation metrics
│       └── visualization.py        # Plotting utilities              
├── scripts/
│   └── run_example.py  
│   └── run_ablation_study.py  
└── README.md

Usage

from models.waveformer import WaveFormer


# Initialize WaveFormer
model = WaveFormer(
        input_timesteps= SEQ_LENGTH,         # Sequence length
        in_channels= INPUT_CHANNELS,         # Number of input channels
        patch_size=PATCH_SIZE,               # Patch size for embedding
        embedding_dim=EMBED_DIM.             # Embedding dimension
        num_transformer_layers=NUM_LAYERS,   # Number of transformer layers (4, 8, etc.)
        num_heads=N_HEADS,                   # Number of attention heads
        num_layers=NUM_LAYERS,               # Number of transformer layers
        dim_feedforward=DIM_FF,              # Feedforward dimension
        dropout=DROPOUT,                     # Dropout rate (0.1, 0.2, etc.)
        num_classes= NUM_CLASSES             # Number of output classes
        use_wavelet_embedding=True,          # Enable wavelet-enhanced embedding
        use_dywpe=True,                      # Enable DyWPE
        use_rpe=True                         # Enable bucketing RPE
    )


# Forward pass
x = torch.randn(BATCH_SIZE, SEQ_LENGTH, INPUT_CHANNELS)  # (batch, sequence, features)
output = model(x)

Results

Our comprehensive evaluation across 8 diverse time series datasets demonstrates WaveFormer's superior performance compared to state-of-the-art Deep learning models.

Performance Overview

Left: Distribution of z-score normalized classification accuracy across 8 datasets. WaveFormer shows the highest median performance and most consistent results. Right: Performance advantage versus sequence length. WaveFormer's accuracy improvement over the best baseline correlates positively with sequence length, with largest gains on long sequences.

For detailed experimental results and ablation studies, please refer to our paper.

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

Please make sure to update tests as appropriate.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Citation

If you find WaveFormer useful for your research, please consider citing this repository using the following information:

@article{irani2026waveformer,
  title={WaveFormer: Wavelet Embedding Transformer for Biomedical Signals},
  author={Habib Irani and Bikram De and Vangelis Metsis},
  journal={arXiv preprint arXiv:2602.12189},
  year={2026}
}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
docs		docs
notebooks		notebooks
src		src
tests		tests
.gitignore		.gitignore
CITATION.cff		CITATION.cff
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WaveFormer: Wavelet Embedding Transformer for Biomedical Signals

Overview

Methodology

Mathematical Framework

Installation

Dependencies

Repository Structure

Usage

Results

Performance Overview

Contributing

License

Citation

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

imics-lab/waveformer

Folders and files

Latest commit

History

Repository files navigation

WaveFormer: Wavelet Embedding Transformer for Biomedical Signals

Overview

Methodology

Mathematical Framework

Installation

Dependencies

Repository Structure

Usage

Results

Performance Overview

Contributing

License

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages