Computer Vision Meetup: Using Machine Vision to Create Sustainable Practices in Fisheries

Computer Vision Meetup: CLIP: Insights into Zero-Shot Image Classification with Mutual Knowledge

Crafting Qubits: Harnessing Quantum Mechanics for Computation

ซินเดอเรลล่ากลายเป็นภรรยาของลุงสุดหล่อหลังจากคืนโรแมนติกนั้น ไม่รู้ว่าเธอได้พบกับมหาเศรษฐี

#โด่งดัง!ญี่ปุ่นซูฮก บอลอาเซียนเร้าใจ!! โค๊ชสิงคโปร์พูดแบบนี้ถึงไทย!! มาเลย์ขอบคุณไทยที่ให้ชีวิต..?

#WOWxดราม่าคอมเม้นแฟนบอลอาเซียน ตะลึง!! แห่ชื่นชมสปิริตทีมชาติไทย หลังเกมส์พลิกชนะสิงคโปร์ 4-2

Computer Vision Meetup: No "Zero-Shot" Without Exponential Data

Voxel51

มุมมอง 60

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 9 ก.พ. 2025
Web-crawled pretraining datasets underlie the impressive “zero-shot” evaluation performance of multimodal models. However, it is unclear how meaningful the notion of “zero-shot” generalization is for such multimodal models, as it is not known to what extent their pretraining datasets encompass the downstream concepts targeted for during “zero-shot” evaluation. In this work, we ask: How is the performance of multimodal models on downstream concepts influenced by the frequency of these concepts in their pretraining datasets?
Through thorough experiments, we consistently find that, far from exhibiting “zero-shot” generalization, multimodal models require exponentially more data to achieve linear improvements in downstream “zero-shot” performance, following a sample inefficient log-linear scaling trend. Furthermore, upon benchmarking models on long-tailed data sampled based on our analysis, we demonstrate that multimodal models across the board perform poorly. Taken together, our study reveals an exponential need for training data which implies that the key to “zero-shot” generalization capabilities under large-scale training paradigms remains to be found.
Read the paper: arxiv.org/abs/...
#computervision #ai #artificialintelligence #machinevision #machinelearning #datascience #NeurIPS #NeurIPS2024

ความคิดเห็น •

ต่อไป

เล่นอัตโนมัติ

Computer Vision Meetup: Using Machine Vision to Create Sustainable Practices in Fisheries

Computer Vision Meetup: Using Machine Vision to Create Sustainable Practices in Fisheries

Computer Vision Meetup: CLIP: Insights into Zero-Shot Image Classification with Mutual Knowledge

Computer Vision Meetup: CLIP: Insights into Zero-Shot Image Classification with Mutual Knowledge

Crafting Qubits: Harnessing Quantum Mechanics for Computation

Crafting Qubits: Harnessing Quantum Mechanics for Computation

ซินเดอเรลล่ากลายเป็นภรรยาของลุงสุดหล่อหลังจากคืนโรแมนติกนั้น ไม่รู้ว่าเธอได้พบกับมหาเศรษฐี

ซินเดอเรลล่ากลายเป็นภรรยาของลุงสุดหล่อหลังจากคืนโรแมนติกนั้น ไม่รู้ว่าเธอได้พบกับมหาเศรษฐี

#โด่งดัง!ญี่ปุ่นซูฮก บอลอาเซียนเร้าใจ!! โค๊ชสิงคโปร์พูดแบบนี้ถึงไทย!! มาเลย์ขอบคุณไทยที่ให้ชีวิต..?

#โด่งดัง!ญี่ปุ่นซูฮก บอลอาเซียนเร้าใจ!! โค๊ชสิงคโปร์พูดแบบนี้ถึงไทย!! มาเลย์ขอบคุณไทยที่ให้ชีวิต..?

#WOWxดราม่าคอมเม้นแฟนบอลอาเซียน ตะลึง!! แห่ชื่นชมสปิริตทีมชาติไทย หลังเกมส์พลิกชนะสิงคโปร์ 4-2

#WOWxดราม่าคอมเม้นแฟนบอลอาเซียน ตะลึง!! แห่ชื่นชมสปิริตทีมชาติไทย หลังเกมส์พลิกชนะสิงคโปร์ 4-2

Computer Vision Meetup: Intrinsic Self-Supervision for Data Quality Audits

Computer Vision Meetup: Intrinsic Self-Supervision for Data Quality Audits

SimpleAF: an augmented execution context for single-cell data preprocessing with alevin-fry

SimpleAF: an augmented execution context for single-cell data preprocessing with alevin-fry

Attention in transformers, step-by-step | DL6

Attention in transformers, step-by-step | DL6

Lecture 3 | Loss Functions and Optimization

Lecture 3 | Loss Functions and Optimization

Stanford Webinar - Large Language Models Get the Hype, but Compound Systems Are the Future of AI

Stanford Webinar - Large Language Models Get the Hype, but Compound Systems Are the Future of AI

Accelerating scientific discovery with AI

Accelerating scientific discovery with AI

Visual AI for Geospatial: Evaluating Earth Observation Foundation Models

Visual AI for Geospatial: Evaluating Earth Observation Foundation Models

Think Fast, Talk Smart: Communication Techniques

Think Fast, Talk Smart: Communication Techniques

Body Language Expert: Stop Using This, It’s Making People Dislike You, So Are These Subtle Mistakes!

Body Language Expert: Stop Using This, It’s Making People Dislike You, So Are These Subtle Mistakes!

#นายกแพทองธาร ลงพื้นที่มอบถุงยังชีพ บริเวณ ซ.พัฒนาการคูขวาง ๑๐ (ถ.ท่าโพธิ์) จ.นครศรีธรรมราช

#นายกแพทองธาร ลงพื้นที่มอบถุงยังชีพ บริเวณ ซ.พัฒนาการคูขวาง ๑๐ (ถ.ท่าโพธิ์) จ.นครศรีธรรมราช

🔴LIVE สด! PGC 2024 ศึกชิงแชมป์โลกพับจี Circuit 3 วันที่ 2

🔴LIVE สด! PGC 2024 ศึกชิงแชมป์โลกพับจี Circuit 3 วันที่ 2

ใครคือฆาตกรตัวจริง ?! EP.11 (ver. คืนคริสมาสต์ สุดสยอง !!!

ใครคือฆาตกรตัวจริง ?! EP.11 (ver. คืนคริสมาสต์ สุดสยอง !!!

Highlight | อัจฉริยะสาวไส้...เบื้องลึกเหตุยิง "สจ.โต้งปราจีนบุรี" | เปิดโต๊ะข่าว | 17 ธ.ค.67

Highlight | อัจฉริยะสาวไส้...เบื้องลึกเหตุยิง "สจ.โต้งปราจีนบุรี" | เปิดโต๊ะข่าว | 17 ธ.ค.67

ต้าห์อู๋-ออฟโรด ขอฝึกวิชาเซียน จับหมูป่ามือเปล่า | เฮ็ดอย่างเซียนหรั่ง FULL EP.21 | One Playground

ต้าห์อู๋-ออฟโรด ขอฝึกวิชาเซียน จับหมูป่ามือเปล่า | เฮ็ดอย่างเซียนหรั่ง FULL EP.21 | One Playground

นี่ไม่ใช่ลูกผม ผม63ปีแล้ว ผมแก่เกินจะมีลูก #สาระแทบไม่มี

นี่ไม่ใช่ลูกผม ผม63ปีแล้ว ผมแก่เกินจะมีลูก #สาระแทบไม่มี

🔴LIVE โหนกระแส ศึกชิงมรดก 500 ล้าน ทายาทฟ้องเด็กรับใช้ปลอมลายเซ็น

🔴LIVE โหนกระแส ศึกชิงมรดก 500 ล้าน ทายาทฟ้องเด็กรับใช้ปลอมลายเซ็น

คริสต์มาสมรณะ | Who Are You EP.7 ( Edwin )

คริสต์มาสมรณะ | Who Are You EP.7 ( Edwin )