Top LLM and Deep Learning Inference Engines - Curated List

"I want Llama3 to perform 10x with my private knowledge" - Local Agentic RAG w/ llama3

Local GraphRAG with LLaMa 3.1 - LangChain, Ollama & Neo4j

หมวกกันน็อค - TaitosmitH |Official MV|

How Strong Is Tape?

ไฮไลท์การแข่งขัน สิงคโปร์ 2-4 ไทย | ฟุตบอล ASEAN Mitsubishi Electric Cup™ 2024

Summarization with LangChain using LLM - Stuff - Map_reduce - Refine

Abonia Sojasingarayar

มุมมอง 956

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 21 ม.ค. 2025

ความคิดเห็น • 12

@zerofive3699 9 หลายเดือนก่อน ⁺²
Really useful info mam , keep up the good work
@AboniaSojasingarayar 9 หลายเดือนก่อน
It's my pleasure.
@vijaygandhi7313 8 หลายเดือนก่อน ⁺²
In the abstractive summarization use-case, usually a lot of focus is given to the LLMs being used and its performance. Limitations of LLM including context length and ways to overcome this issue are often overlooked. Its important to make sure that our application is scalable when dealing with large document sizes. Thank you for this great and insightful video.
@AboniaSojasingarayar 8 หลายเดือนก่อน ⁺¹
Thank you Vijay Gandhi, for your insightful comment! You've raised an excellent point about the importance of considering the limitations of LLMs in the context of abstractive summarization, especially regarding their context length and scalability issues when dealing with large documents.
Indeed, one of the significant challenges in using LLMs for abstractive summarization is their inherent limitation in processing long texts due to the maximum token limit imposed by these models. This constraint can be particularly problematic when summarizing lengthy documents or articles, where the full context might not fit within the model's capacity.
@alvaroaraujo7945 4 หลายเดือนก่อน ⁺¹
Hey, Abonia..Thanks for the amazing content. I just had one issue though: on executing the 'map_reduce_outputs' function, I had the ConnectionRefusedError: [Errno 61].
Hope someone know what it is
@AboniaSojasingarayar 4 หลายเดือนก่อน
@@alvaroaraujo7945 Hello , thanks for your kind words.
It may be related to your ollama serve.Are you sure Ollama is running ?
@evellynnicolemachadorosa2666 8 หลายเดือนก่อน ⁺¹
hello! Thanks for the video. I am from Brazil. What would you recommend for large documents, averaging 150 pages? I tried map-reduce, but the inference time was 40 minutes. Are there any tips for these very long documents?
@AboniaSojasingarayar 8 หลายเดือนก่อน ⁺¹
Thanks for you kind words and glad this helped.
Implement a strategy that combines semantic chunking with K-means clustering to address the model’s contextual limitations. By employing efficient clustering techniques, we can extract key passages effectively, thereby reducing the overhead associated with processing large volumes of text. This approach not only significantly lowers costs by minimizing the number of tokens processed but also mitigates the recency and primacy effects inherent in LLMs, ensuring a balanced consideration of all text segments.
@VirtualMachine-d8x 3 หลายเดือนก่อน ⁺¹
@@AboniaSojasingarayar Video was great and very useful.. can you make the small video on this clustering method using embedding ?
@AboniaSojasingarayar 3 หลายเดือนก่อน ⁺¹
@@VirtualMachine-d8x Sure will do, happy to hear from you again. Thanks for the feedback.
@Coff03 8 หลายเดือนก่อน ⁺¹
Did you use OpenAI API key here?
@AboniaSojasingarayar 8 หลายเดือนก่อน
Here we use open-source Mixtral from ollama.But, yes we can use OpenAI models as well.

ต่อไป

เล่นอัตโนมัติ

Top LLM and Deep Learning Inference Engines - Curated List

Top LLM and Deep Learning Inference Engines - Curated List

"I want Llama3 to perform 10x with my private knowledge" - Local Agentic RAG w/ llama3

"I want Llama3 to perform 10x with my private knowledge" - Local Agentic RAG w/ llama3

Local GraphRAG with LLaMa 3.1 - LangChain, Ollama & Neo4j

Local GraphRAG with LLaMa 3.1 - LangChain, Ollama & Neo4j

หมวกกันน็อค - TaitosmitH |Official MV|

หมวกกันน็อค - TaitosmitH |Official MV|

How Strong Is Tape?

How Strong Is Tape?

ไฮไลท์การแข่งขัน สิงคโปร์ 2-4 ไทย | ฟุตบอล ASEAN Mitsubishi Electric Cup™ 2024

ไฮไลท์การแข่งขัน สิงคโปร์ 2-4 ไทย | ฟุตบอล ASEAN Mitsubishi Electric Cup™ 2024

ใครขยับไม่ได้เป็น!!

ใครขยับไม่ได้เป็น!!

Python RAG Tutorial (with Local LLMs): AI For Your PDFs

Python RAG Tutorial (with Local LLMs): AI For Your PDFs

Crewai Ollama Agent -Build an AI Article Generator with CrewAI|Ollama |Streamlit - Complete Tutorial

Crewai Ollama Agent -Build an AI Article Generator with CrewAI|Ollama |Streamlit - Complete Tutorial

What is RAG (Retrieval-Augmented Generation)?

What is RAG (Retrieval-Augmented Generation)?

Running a Streamlit App from Google Colab - Serve an LLM app in Colab

Running a Streamlit App from Google Colab - Serve an LLM app in Colab

Chat with SQL and Tabular Databases using LLM Agents (DON'T USE RAG!)

Chat with SQL and Tabular Databases using LLM Agents (DON'T USE RAG!)

Graph RAG: Improving RAG with Knowledge Graphs

Graph RAG: Improving RAG with Knowledge Graphs

Run ALL Your AI Locally in Minutes (LLMs, RAG, and more)

Run ALL Your AI Locally in Minutes (LLMs, RAG, and more)

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

OpenAI Embeddings and Vector Databases Crash Course

OpenAI Embeddings and Vector Databases Crash Course

ถ้าทาสไม่ขุดทอง แล้วทาสจะขุดอะไร #hererm #เกม #gaming

ถ้าทาสไม่ขุดทอง แล้วทาสจะขุดอะไร #hererm #เกม #gaming

ช่วยหนูด้วยคะ #shorts #แม่สุซูกัส

ช่วยหนูด้วยคะ #shorts #แม่สุซูกัส

ศึกมวยไทยพันธมิตร 16/12/2024

ศึกมวยไทยพันธมิตร 16/12/2024

MARK 마크 '프락치 (Fraktsiya) (Feat. 이영지)' MV

MARK 마크 '프락치 (Fraktsiya) (Feat. 이영지)' MV

🔴LIVE สด! PGC 2024 ศึกชิงแชมป์โลกพับจี Circuit 3 วันที่ 2

🔴LIVE สด! PGC 2024 ศึกชิงแชมป์โลกพับจี Circuit 3 วันที่ 2

หนูขอไปด้วย #แม่สุซูกัส #ตลก #shorts

หนูขอไปด้วย #แม่สุซูกัส #ตลก #shorts

Mache leckere Lutscher mit diesem PRO-Gadget! 🚽🍭

Mache leckere Lutscher mit diesem PRO-Gadget! 🚽🍭