diff --git a/src/ai/cv.md b/src/ai/cv.md index 5d70489..dd8d784 100644 --- a/src/ai/cv.md +++ b/src/ai/cv.md @@ -1,3 +1,19 @@ # CV(Computer Vision) -- CS231n: Stanford 的 CV 入门课程 \[[Main Page](http://cs231n.stanford.edu/)\] \[[bilibili](https://www.bilibili.com/video/BV1nJ411z7fe)\] \[[Assignments](http://cs231n.stanford.edu/schedule.html)\] \ No newline at end of file +- CS231n: Stanford 的 CV 入门课程 \[[Main Page](http://cs231n.stanford.edu/)\] \[[bilibili](https://www.bilibili.com/video/BV1nJ411z7fe)\] \[[Assignments](http://cs231n.stanford.edu/schedule.html)\] +- [Awesome-Vision-Attentions](https://github.com/MenghaoGuo/Awesome-Vision-Attentions): Summary of related papers on visual attention. Related code will be released based on Jittor gradually. +- [Transformer-in-Computer-Vision](https://github.com/Yangzhangcst/Transformer-in-Computer-Vision): A paper list of some recent Transformer-based CV works. +- [rese1f/Awesome-VQVAE](https://github.com/rese1f/Awesome-VQVAE): A collection of resources and papers on Vector Quantized Variational Autoencoder (VQ-VAE) and its application + + +## object detection +- [open-mmlab/mmdetection](https://github.com/open-mmlab/mmdetection): OpenMMLab Detection Toolbox and Benchmark +- [facebookresearch/detectron2](https://github.com/facebookresearch/detectron2): Detectron2 is a platform for object detection, segmentation and other visual recognition tasks. +- [facebookresearch/detr]: End-to-End Object Detection with Transformers + +## segmentation +- [facebookresearch/segment-anything](https://github.com/facebookresearch/segment-anything) + +## Vision-Language Model +- [VLM_survey](https://github.com/jingyi0000/VLM_survey): Vision-Language Models for Vision Tasks: A Survey +- [LLM-in-Vision](https://github.com/DirtyHarryLYL/LLM-in-Vision): Recent LLM-based CV and related works. Welcome to comment/contribute!