projects(scratch)
- ripple [text-image semantic search and tagging with CLIP]
- shira [neural audio search/retrieval based on CLAP]
- mini_pixart [minimal pytorch implementation of Pixart text2image model, pytorch]
- aaliyah [mobilenet/convnext vision impls., reusable for image classification]
- lila [vision transformer/masked autoencoder in pytorch]
- shiryoku [vision model for image captioning, CNN+RNN]
- mini_clip [minimal implementation of the CLIP paper]
- soundnet [music/audio classification, pytorch]
- paint_diffuse [a mini-diffusion model for unconditional image(painting) generation]
- ..and many others on my github.