DY's DS

Tag: CLIP

4 items with this tag.

  • Oct 29, 2025

    언어 매개를 통한 독립적 멀티모달 임베딩의 제로샷 정렬 프레임워크

    • Research_Proposal
    • tactile
    • image
    • multimodal
    • timeseries
    • text
    • llm
    • CLIP
  • Oct 29, 2025

    A Touch, Vision, and Language Dataset for Multimodal Alignment

    • paper
    • llm
    • multimodal
    • tactile
    • vision
    • CLIP
    • LoRA
    • Encoder
    • Decoder
  • Oct 29, 2025

    DeepSeek-OCR Contexts Optical Compression

    • paper
    • text
    • image
    • OCR
    • RAG
    • chunking
    • CLIP
    • vision
  • Oct 29, 2025

    RECONSTRUCTION ALIGNMENT IMPROVES UNIFIED MULTIMODAL MODELS

    • paper
    • multimodal
    • image
    • llm
    • CLIP
    • text
    • embedding

Created with Quartz v4.5.2 © 2026

  • GitHub
  • Discord Community