OmniCaptioner: One Captioner to Rule Them All

HomePage   |    Github   |    Paper  

## 💻 Finetuning Code Coming Soon ## 🚀 Inference Code - Python >= 3.10.0 (Recommend to use [Anaconda](https://www.anaconda.com/download/#linux) or [Miniconda](https://docs.conda.io/en/latest/miniconda.html)) - [PyTorch >= 2.0.1+cu12.1](https://pytorch.org/) ```bash git clone https://github.com/NVlabs/Sana.git cd Sana ./environment_setup.sh sana ``` - Prepare the prompts in asset/samples/samples.txt ``` python scripts/inference.py \ --config=configs/sana_config/1024ms/Sana_1600M_img1024.yaml \ --model_path=hf://U4R/Sana_trainwithOmnicap/sana_omnicaptioner.pth ``` ## Citation If you find the provided code or models useful for your research, consider citing them as: ``` ```