OmniCaptioner: One Captioner to Rule Them All
## 💻 Finetuning Code
Coming Soon
## 🚀 Inference Code
- Python >= 3.10.0 (Recommend to use [Anaconda](https://www.anaconda.com/download/#linux) or [Miniconda](https://docs.conda.io/en/latest/miniconda.html))
- [PyTorch >= 2.0.1+cu12.1](https://pytorch.org/)
```bash
git clone https://github.com/NVlabs/Sana.git
cd Sana
./environment_setup.sh sana
```
- Prepare the prompts in asset/samples/samples.txt
```
python scripts/inference.py \
--config=configs/sana_config/1024ms/Sana_1600M_img1024.yaml \
--model_path=hf://U4R/Sana_trainwithOmnicap/sana_omnicaptioner.pth
```
## Citation
If you find the provided code or models useful for your research, consider citing them as:
```
```