Development
Prerequisites
| Requirement |
Version |
Notes |
| Python |
>=3.12 |
Required by pyproject.toml |
| uv |
Latest |
Package manager and build tool |
| CUDA GPU |
Optional |
Required for Triton kernels and GPU tests |
Setup
git clone https://github.com/Alberto-Codes/turboquant-vllm.git
cd turboquant-vllm
uv sync --extra vllm
Testing
# All tests (CPU)
uv run pytest tests/ -v
# Unit tests only
uv run pytest -m unit
# GPU tests
uv run pytest -m gpu
# Specific test file
uv run pytest tests/test_per_layer_cosine.py -v
Linting
uv run ruff check .
uv run ruff format .
uv run ty check
uv run docvet check --all
Building
uv build
# dist/turboquant_vllm-*.whl
# dist/turboquant_vllm-*.tar.gz
Documentation
uv sync --extra docs
uv run mkdocs serve # local preview at http://127.0.0.1:8000
uv run mkdocs build # build to site/