no code implementations • 5 Feb 2024 • Dunam Kim, Seokju Lee
Recent studies on generalizing CLIP for monocular depth estimation reveal that CLIP pre-trained on web-crawled data is inefficient for deriving proper similarities between image patches and depth-related prompts.
no code implementations • 27 Feb 2023 • Dunam Kim, Jeeeun Kim
We propose a new technique for computational language representation called elementwise embedding, in which a material (semantic unit) is abstracted into a horizontal concatenation of lower-dimensional element (character) embeddings.