DY's DS

Tag: CLIP

4 items with this tag.

Oct 29, 2025
언어 매개를 통한 독립적 멀티모달 임베딩의 제로샷 정렬 프레임워크
Oct 29, 2025
A Touch, Vision, and Language Dataset for Multimodal Alignment
Oct 29, 2025
DeepSeek-OCR Contexts Optical Compression
- paper
- text
- image
- OCR
- RAG
- chunking
- CLIP
- vision
Oct 29, 2025
RECONSTRUCTION ALIGNMENT IMPROVES UNIFIED MULTIMODAL MODELS

Created with Quartz v4.5.2 © 2026

GitHub
Discord Community