Automate filling of Job Applications | LaVague Web Agent & Microsoft Phi-3 Vision | Free Gemini API

Testing Microsoft's New VLM - Phi-3 Vision

Demo: Rapid prototyping with Gemma and Llama.cpp

เมื่อสิบปีที่แล้วเธอช่วยชีวิตเขาไว้ แต่สิบปีต่อมาเธอก็กลายเป็นเจ้าหญิงที่ถูกปราบและถูกเขาคุมขัง

진 (Jin) ‘슈퍼 참치 (Super Tuna)’ Special Video

หนังเต็มเรื่อง | โปเยโปโลเย ลิขิตเซียน | หนังโรแมนติกแฟนตาซี หนังกำลังภายใน | พากย์ไทย HD

OCR Using Microsoft's Phi-3 Vision Model on Free Google Colab

TheAILearner

มุมมอง 5 306

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 13 ต.ค. 2024

ความคิดเห็น • 8

@theailearner1857 4 หลายเดือนก่อน ⁺²
There is an update in Phi-3 Vision's Hugging Face page. Now you need not to comment lines in code files to run model without flash attention. You just need to import model in eager mode. (huggingface.co/microsoft/Phi-3-vision-128k-instruct#sample-inference-code)
model = AutoModelForCausalLM.from_pretrained(model_id, device_map="cuda", trust_remote_code=True, torch_dtype="auto", _attn_implementation='eager') # use _attn_implementation='eager' to disable flash attention
@ai_enthusiastic_ 3 หลายเดือนก่อน
I just tried this model on my cpu. It appears that the model loads successfully, but it remains in a running state without producing any output thus far. My system's RAM capacity is 8 GB. Could this limitation be the reason for the lack of functionality?
@arunbhyashaswi1515 4 หลายเดือนก่อน
Quite enriching video. I will be trying it and letting you know my experience.
@phanikrishna8215 2 หลายเดือนก่อน
How do we get the bounding boxes of the OCR text using phi3 ?
@d.d.z. 4 หลายเดือนก่อน
Hey man, thank you!
@gabrielesilinic 3 หลายเดือนก่อน
I mean, cool. but if you really can't run it locally you likely have bigger issues. The Phi-3 model is just that small that can run about anywhere.
@Cloudvenus666 4 หลายเดือนก่อน ⁺²
Awesome video but this model is unreliable. It extract text on some pages, other times it just stops midway or returns a blank output. I thought, its for sure the low gpu power of the T4, so I tried it directly with azure, and it reproduced the same outcome.
@theailearner1857 4 หลายเดือนก่อน ⁺¹
Try to change prompt and test it out. And still if it doesn't work you might need to fine tune this model on domain specific documents.

ต่อไป

เล่นอัตโนมัติ

Automate filling of Job Applications | LaVague Web Agent & Microsoft Phi-3 Vision | Free Gemini API

Automate filling of Job Applications | LaVague Web Agent & Microsoft Phi-3 Vision | Free Gemini API

Testing Microsoft's New VLM - Phi-3 Vision

Testing Microsoft's New VLM - Phi-3 Vision

Demo: Rapid prototyping with Gemma and Llama.cpp

Demo: Rapid prototyping with Gemma and Llama.cpp

เมื่อสิบปีที่แล้วเธอช่วยชีวิตเขาไว้ แต่สิบปีต่อมาเธอก็กลายเป็นเจ้าหญิงที่ถูกปราบและถูกเขาคุมขัง

เมื่อสิบปีที่แล้วเธอช่วยชีวิตเขาไว้ แต่สิบปีต่อมาเธอก็กลายเป็นเจ้าหญิงที่ถูกปราบและถูกเขาคุมขัง

진 (Jin) ‘슈퍼 참치 (Super Tuna)’ Special Video

진 (Jin) ‘슈퍼 참치 (Super Tuna)’ Special Video

หนังเต็มเรื่อง | โปเยโปโลเย ลิขิตเซียน | หนังโรแมนติกแฟนตาซี หนังกำลังภายใน | พากย์ไทย HD

หนังเต็มเรื่อง | โปเยโปโลเย ลิขิตเซียน | หนังโรแมนติกแฟนตาซี หนังกำลังภายใน | พากย์ไทย HD

อ.วีรพัฒน์ มอง ธีรยุทธ ร้องเอาผิด ทักษิณ-พท.หวังผลการเมือง พปชร.ดิ้น เฮือกสุดท้าย | TODAY

อ.วีรพัฒน์ มอง ธีรยุทธ ร้องเอาผิด ทักษิณ-พท.หวังผลการเมือง พปชร.ดิ้น เฮือกสุดท้าย | TODAY

OCR Using Microsoft's Florence-2 Vision Model on Free Google Colab

OCR Using Microsoft's Florence-2 Vision Model on Free Google Colab

LLaVA 1.6 is here...but is it any good? (via Ollama)

LLaVA 1.6 is here...but is it any good? (via Ollama)

Battle of OPEN Vision Models Phi3-vision and Google PaliGemma

Battle of OPEN Vision Models Phi3-vision and Google PaliGemma

How to Train TensorFlow Lite Object Detection Models Using Google Colab | SSD MobileNet

How to Train TensorFlow Lite Object Detection Models Using Google Colab | SSD MobileNet

Phi-3 Medium - Microsoft's Open-Source Model is Ready For Action!

Phi-3 Medium - Microsoft's Open-Source Model is Ready For Action!

BLING PHI-3: Game Changing Model that Broke our Test! 😅

BLING PHI-3: Game Changing Model that Broke our Test! 😅

Have You Picked the Wrong AI Agent Framework?

Have You Picked the Wrong AI Agent Framework?

Introduction to Phi-3Cookbook

Introduction to Phi-3Cookbook

Getting started with Gemma models

Getting started with Gemma models

Trading True Triple Dark Blade ?⚔️| Doge Gaming

Trading True Triple Dark Blade ?⚔️| Doge Gaming

Cool Items!🥰 New Gadgets, Smart Appliances, Kitchen Tools Utensils, Home Cleaning, Beauty #shorts

Cool Items!🥰 New Gadgets, Smart Appliances, Kitchen Tools Utensils, Home Cleaning, Beauty #shorts

ติดกล้องแล้วเจอสิ่งนี้?! #kenbkk #kenchanon

ติดกล้องแล้วเจอสิ่งนี้?! #kenbkk #kenchanon

ยายเทอร์โบ คืนหลอนซ่อนผี ( เกมไทย ) | Night drive

ยายเทอร์โบ คืนหลอนซ่อนผี ( เกมไทย ) | Night drive

Harley Quinn left heartbroken！！#Harley Quinn #joker

Harley Quinn left heartbroken！！#Harley Quinn #joker

แกล้งเพื่อน กินทรายสี!

แกล้งเพื่อน กินทรายสี!

قم بتخزينك بشكل صحيح! 😍 استخدم حيلة ذكية #اصنعها بنفسك

قم بتخزينك بشكل صحيح! 😍 استخدم حيلة ذكية #اصنعها بنفسك

ไฮไลท์การแข่งขัน ไทย 3-1 ฟิลิปปินส์ | รอบรองชนะเลิศ | ฟุตบอลชิงถ้วยพระราชทานคิงส์คัพ ครั้งที่ 50

ไฮไลท์การแข่งขัน ไทย 3-1 ฟิลิปปินส์ | รอบรองชนะเลิศ | ฟุตบอลชิงถ้วยพระราชทานคิงส์คัพ ครั้งที่ 50