Youngjoon Jang's picture

Youngjoon Jang

yjoonjang

·

https://yjoonjang.github.io/

AI & ML interests

Information Retrieval (IR), Retrieval-Augmented Generation (RAG)

Recent Activity

liked a dataset 9 days ago

hotchpotch/NanoMIRACL

upvoted a collection 22 days ago

jina-embeddings-v5-text

upvoted a collection 28 days ago

View all activity

Organizations

upvoted a collection 22 days ago

jina-embeddings-v5-text

Our 5th-gen embeddings: two lightweight multilingual models with SOTA performance in retrieval, matching, clustering, and classification. • 29 items • Updated 13 days ago • 35

upvoted a collection 28 days ago

pplx-embed

Diffusion-Pretrained Dense and Contextual Embeddings • 7 items • Updated 14 days ago • 87

upvoted a collection 2 months ago

KoViDoRe Benchmark (BEIR) v2

Korean Vision Document Retrieval Benchmark • 4 items • Updated 10 days ago • 5

upvoted an article 3 months ago

Article

Nano-BEIR: A Multilingual Information Retrieval Benchmark with Quality-Enhanced Queries

Dec 22, 2025

•

9

upvoted an article 5 months ago

Article

Introducing RTEB: A New Standard for Retrieval Evaluation

+4

Oct 1, 2025

•

138

upvoted a collection 7 months ago

PIXIE-Preview

Information retrieval models • 5 items • Updated 10 days ago • 5

upvoted a paper 7 months ago

SONAR-LLM: Autoregressive Transformer that Thinks in Sentence Embeddings and Speaks in Tokens

Paper • 2508.05305 • Published Aug 7, 2025 • 47

upvoted a paper 10 months ago

Crosslingual Reasoning through Test-Time Scaling

Paper • 2505.05408 • Published May 8, 2025 • 8

upvoted a collection 11 months ago

HyperCLOVA X SEED

HyperCLOVA X SEED is NAVER's lightweight open-source lineup with a strong focus on Korean language performance • 6 items • Updated Dec 24, 2025 • 41

upvoted a paper 11 months ago

MIEB: Massive Image Embedding Benchmark

Paper • 2504.10471 • Published Apr 14, 2025 • 21

upvoted a collection 12 months ago

EXAONE-Deep

EXAONE reasoning model series of 2.4B, 7.8B, and 32B, optimized for reasoning tasks including math and coding • 10 items • Updated Jul 7, 2025 • 96

upvoted a collection over 1 year ago

Magpie-Llama3.1 Datasets

Dataset built with Meta Llama 3.1 70B. • 6 items • Updated Jan 13, 2025 • 4