190
194 651

Speculative RAG: Enhancing Retrieval Augmented Generation through Drafting Explained

1:00:00

Knowledge Circuits in Pretrained Transformers Explained

1:00:13

Hierarchical Prompting Taxonomy: A Universal Evaluation Framework for Large Language Models

1:01:18

Aligning LLM-Assisted Evaluation of LLM Outputs with Human Preferences Explained

56:35

Efficient Multi-Prompt Evaluation Explained

55:47

Influential Data Retrieval Explained

57:42

The Janus Interface: How Fine-Tuning in Large Language Models Amplifies the Privacy Risks

In this session, we welcome Xiaoyi Chen from Indiana University who co-authored the paper "The Janus Interface: How Fine-Tuning in Large Language Models Amplifies the Privacy Risks".
About the paper:
--------------------------
The research introduces a novel attack, Janus, which exploits the fine-tuning interface to recover forgotten PIIs from the pre-training data in LLMs. See you there!🤖🧠
🔬The Janus Interface: How Fine-Tuning in Large Language Models Amplifies the Privacy Risks: arxiv.org/pdf/2310.15469
📝 Xiaoyi Chen, Siyuan Tang, Rui Zhu, Shijun Yan, Lei Jin, Zihao Wang, Liya Su, Zhikun Zhang, XiaoFeng Wang, Haixu Tang
Read also:
----------------
📰 The Deep Dive. Follow the latest AI research and industry trends: unifyai.substack.com/
📖 Blogs. Dive into the AI deployment stack: unify.ai/blog
Follow us:
----------------
Website: unify.ai
Github: github.com/unifyai/
Discord: discord.gg/sXyFF8tDtm
Twitter: letsunifyai
Reddit: www.reddit.com/r/unifyai/
#ai #machinelearning #deeplearning

มุมมอง: 155

วีดีโอ

Speculative RAG: Enhancing Retrieval Augmented Generation through Drafting Explained

1:00:00

Speculative RAG: Enhancing Retrieval Augmented Generation through Drafting Explained

มุมมอง 5765 หลายเดือนก่อน

In this session, we are welcoming Zilong Wang from the University of California who co-authored the paper "Speculative RAG: Enhancing Retrieval Augmented Generation through Drafting". About the paper: The research introduces SPECULATIVE RAG, a framework that leverages a larger generalist LM to efficiently verify multiple RAG drafts produced in parallel by a smaller, distilled specialist LM. See...

Knowledge Circuits in Pretrained Transformers Explained

1:00:13

Knowledge Circuits in Pretrained Transformers Explained

มุมมอง 9815 หลายเดือนก่อน

In this session, we welcome Yunzhi Yao from Zhejiang University China , who co-authored the paper "Knowledge Circuits in Pretrained Transformers" About the paper: This paper dives into the computation graph of language models to uncover the knowledge circuits that are instrumental in articulating specific knowledge 🤖🧠 🔬Knowledge Circuits in Pretrained Transformers: arxiv.org/pdf/2405.17969v1 📝...

Hierarchical Prompting Taxonomy: A Universal Evaluation Framework for Large Language Models

1:01:18

Hierarchical Prompting Taxonomy: A Universal Evaluation Framework for Large Language Models

มุมมอง 2075 หลายเดือนก่อน

In this session, we welcome Devichand Budagam from IIT Kharagpur India , who co-authored the paper "Hierarchical Prompting Taxonomy: A Universal Evaluation Framework for Large Language Models" About the paper: This paper introduces Hierarchical Prompting Taxonomy (HPT), a universal evaluation metric that can be used to evaluate both the datasets' complexity and LLMs' capabilities🧠 🔬 Hierarchic...

Aligning LLM-Assisted Evaluation of LLM Outputs with Human Preferences Explained

56:35

Aligning LLM-Assisted Evaluation of LLM Outputs with Human Preferences Explained

มุมมอง 2935 หลายเดือนก่อน

In this session, we welcome Shreya Shankar from UC Berkeley , who co-authored the paper "Aligning LLM-Assisted Evaluation of LLM Outputs with Human Preferences" About the paper: This paper introduces EvalGen, an interface that provides automated assistance in generating evaluation criteria and implementing assertions🧠👩‍💻 🔬 Aligning LLM-Assisted Evaluation of LLM Outputs with Human Preferences:...

Efficient Multi-Prompt Evaluation Explained

55:47

Efficient Multi-Prompt Evaluation Explained

มุมมอง 1556 หลายเดือนก่อน

In this session, we welcome Felipe Polo from the University of Michigan, who co-authored the paper "Efficient multi-prompt evaluation of LLMs". About the paper: This paper introduces PromptEval, a new method for estimating performance across a large set of prompts, borrowing strength across prompts and examples to produce accurate estimates under practical evaluation budgets. 🔬 Efficient multi...

57:42

Influential Data Retrieval Explained

มุมมอง 1276 หลายเดือนก่อน

In this session, we welcome Huawei Lin from The Rochester Institute of Technology, who co-authored the paper "Token-wise Influential Training Data Retrieval for Large Language Models". About the paper: The authors propose RapidIn, a scalable framework adapting to LLMs for estimating the influence of each training data. The proposed framework consists of two stages: caching and retrieval. 🔬 Tok...

Unify x QDrant - Complex Agentic RAG Systems ✨

1:01:14

Unify x QDrant - Complex Agentic RAG Systems ✨

มุมมอง 3526 หลายเดือนก่อน

Join us as we collaborate with QDrant to investigate the limitations of traditional naive RAG systems and discover the revolutionary benefits of advanced agentic RAG systems. In this video, we will be joined by expert guest Atita Arora, a solutions architect with 17 years of experience in information retrieval. We will discuss the challenges of traditional RAG systems and explore how the latest...

Gen AI London - LLM Agents For the Enterprise

33:15

Gen AI London - LLM Agents For the Enterprise

มุมมอง 6486 หลายเดือนก่อน

Join us for an informative panel discussion on Generative AI from the recent GenAI London Meetup. Hosted by Ben Ellerby from Aleios, this session brings together industry experts Merym Arik (Titan ML) and Danilo Poccia (AWS) to explore current trends in enterprise AI. The panel covers key topics, including: • RAG vs Model Context • Trends in Agentic Workflows • Enterprise use cases for Generati...

Unify Project Demo - LLM Resume Analyser

25:38

Unify Project Demo - LLM Resume Analyser

มุมมอง 3756 หลายเดือนก่อน

We're excited to introduce a project created by Oscar, Sanjay, Jeyabalan, and Maissa in our Contributor Program: an LLM Resume Analyser. The app extracts and analyzes key skills from resumes, provides matching scores for job descriptions, and suggests resume improvements. Try it out for yourself: ai-llm-resume-analyser.streamlit.app/ Check out the Repo: github.com/OscarArroyoVega/LLM_Resume_Ana...

Unify Project Demo - LlamaIndex RAG Playground

5:44

Unify Project Demo - LlamaIndex RAG Playground

มุมมอง 1416 หลายเดือนก่อน

We are excited to introduce a project that one of our contributors, Abhijeet, has been working on. Abhijeet has developed the LLamaIndex RAG Playground, an application that allows users to upload PDF documents and ask questions about them. In this project, Abhijeet used LLamaIndex to create a RAG (Retrieval-Augmented Generation) application. The application's interface is built with Streamlit, ...

49:25

Unify x Baseten - Boost Deployment ✨

มุมมอง 1287 หลายเดือนก่อน

Supercharge Your LLM Deployment with Unify & Baseten Watch the full recording of our collaborative webinar with Baseten, featuring expert guest Phil Kiely. This session reveals how to seamlessly integrate Baseten model endpoints into the Unify Platform, unlocking powerful capabilities for your AI projects. In this video, you'll learn: • How to connect Baseten Model Endpoints into the Unify Plat...

48:36

YOCO Explained

มุมมอง 2437 หลายเดือนก่อน

In this session, we welcome Yutao Sun from Tsinghua University, who co-authored the paper "You Only Cache Once: Decoder-Decoder Architectures for Language Models". About the paper: YOCO is a decoder-decoder architecture for LLMs which only caches key-value pairs once to improve inference memory, prefill latency, and throughput across context lengths and model sizes. 🔬 You Only Cache Once: Deco...

9:39

Unify Project Demo - LLM Debate

มุมมอง 907 หลายเดือนก่อน

We are thrilled to introduce a new project that two of our contributors, Ogban and Sanjay, have been working on: the LLM Debate. In this project, they developed a web application that allows two Language Models (LLMs) to engage in a real-time conversation on a topic chosen by the user. Users can select any two models, input a query, and view the dialogue between the LLMs as it unfolds. This app...

8:19

Unify Project Demo - Semantic Router

มุมมอง 1047 หลายเดือนก่อน

We are excited to present a project two contributors have been working on in our Contributor Program. Indiradharshini and Jeyabalan have been working on a Semantic Router. In this project, the team built a web app that dynamically routes user queries to the most appropriate model based on semantic similarity. User queries are sent to predefined routes of maths or coding. Users also have the opt...

7:19

Unify Project Demo - Chatbot Arena

มุมมอง 537 หลายเดือนก่อน

Unify Project Demo - Chatbot Arena

9:52

Unify Project Demo - LLM Wars

มุมมอง 787 หลายเดือนก่อน

Unify Project Demo - LLM Wars

10:35

Unify Project Demo - RAG Playground

มุมมอง 1377 หลายเดือนก่อน

Unify Project Demo - RAG Playground

MLCon3 2024 - The Best LLM for Every Prompt with Unify

5:39

MLCon3 2024 - The Best LLM for Every Prompt with Unify

มุมมอง 1177 หลายเดือนก่อน

MLCon3 2024 - The Best LLM for Every Prompt with Unify

8:50

The Endpoint Era | AI Furnace 2024

มุมมอง 1667 หลายเดือนก่อน

The Endpoint Era | AI Furnace 2024

1:09:42

Towards Monosemanticity Explained

มุมมอง 8017 หลายเดือนก่อน

Towards Monosemanticity Explained

1:07:10

ReFT Explained

มุมมอง 3127 หลายเดือนก่อน

ReFT Explained

7:25

The Best LLM on Every Prompt | J12 Day

มุมมอง 2357 หลายเดือนก่อน

The Best LLM on Every Prompt | J12 Day

5:57

Benchmarking Your Prompts

มุมมอง 5587 หลายเดือนก่อน

Benchmarking Your Prompts

6:24

Training a Custom Router

มุมมอง 7517 หลายเดือนก่อน

Training a Custom Router

15:45

Selecting a Router

มุมมอง 2.2K7 หลายเดือนก่อน

Selecting a Router

1:26

Announcing Unify! ✨

มุมมอง 6K7 หลายเดือนก่อน

Announcing Unify! ✨

1:02:06

LayerSkip Explained

มุมมอง 5098 หลายเดือนก่อน

LayerSkip Explained

1:04:52

DSPy Explained

มุมมอง 1.6K8 หลายเดือนก่อน

DSPy Explained

57:39

Unify Weekly Webinar - 02: Laminar ✨

มุมมอง 1708 หลายเดือนก่อน

Unify Weekly Webinar - 02: Laminar ✨

ความคิดเห็น

@peileichen 3 วันที่ผ่านมา
By the way , Is the project serving the client?
@peileichen 3 วันที่ผ่านมา
Thanks, your project is helpful for me
@thrisharamkumar9566 5 วันที่ผ่านมา
Please leave the QA to the end..
@sndrstpnv8419 15 วันที่ผ่านมา
may you share python code pls
@vyrgill หลายเดือนก่อน
Thank you for making this open to everyone, I learned a lot. Thanks!
@trunghainguyen9872 2 หลายเดือนก่อน
Daniel 's comments are just spot on
@Sgt_duck 2 หลายเดือนก่อน
Amazing explanation, great work.
@soumyaranjanswain8574 3 หลายเดือนก่อน
How it works ??There is no explanation
@MorMor-xk4jv 4 หลายเดือนก่อน
Can you share a copy of this slides please?
@luthandonxumalo6163 4 หลายเดือนก่อน
Great vid, thanks guys
@lemonade2345-j4e 4 หลายเดือนก่อน
VS Code and Visual Studio are effectively a different products... So your title is bit off.
@andrew.derevo 5 หลายเดือนก่อน
great stuff! thanks a lot - Can be interesting to combine this idea with RAG Fusion
@usefulhyun 5 หลายเดือนก่อน
Thank you for sharing the great presentation!!!!
@kevon217 5 หลายเดือนก่อน
Totally agree with “engaging in the right process”
@orthodox_gentleman 6 หลายเดือนก่อน
Audio is terrible
@HarshalJadhav-l3t 6 หลายเดือนก่อน
Great Project
@anuragkurle4827 6 หลายเดือนก่อน
nice
@davidthiwa1073 7 หลายเดือนก่อน
From when is that talk? When will you guys be adding Groq?
@unifyai 7 หลายเดือนก่อน
It's on the roadmap, thanks for the heads up! Will try to get it added this week.
@davidthiwa1073 7 หลายเดือนก่อน
How do you handle request with longer context lengths? do you still use llama3 when the request is 100k long?
@jacobpotash 7 หลายเดือนก่อน
cool idea.
@starwin1159 7 หลายเดือนก่อน
Let's GO!
@NassosKranidiotis 7 หลายเดือนก่อน
I thought you were going to setup Visual Studio, but it was Visual Studio Code instead. I suggest you might amend the title of the video.
@TheRealHassan789 8 หลายเดือนก่อน
for those who that think "rooter" is wrong pronunciation, how do you pronounce "Roulette" or "Routine"? just wondering....
@dreamcore 7 หลายเดือนก่อน
We also have the ɢᴏᴏsᴇ vowel for _route,_ as well as for _router_ when the router is a person. When it's a device, it gets the ᴍᴏᴜᴛʜ vowel.
@MayssaRekik 8 หลายเดือนก่อน
Happy for the launch!
@jacobschlater 8 หลายเดือนก่อน
"rooter"
@unifyai 8 หลายเดือนก่อน
british accent coming through strongly here lol
@okachobe1 8 หลายเดือนก่อน
What benefit does this have over just when your writing the code sending the prompt to be processed by the llm of your choice while your writing the code instead of having to configure it with these sliders? I feel like i wouldn't know which models are actually being used under the hood to know if the optimum one is being used. In code if i was writing a function to read in a pdf and break it down into parts, and then maybe provide some example code based on those parts.I would just push the pdf to mixtral or something to read it in and summarize it since that is not technically difficult. and then pass that summary to something like gpt 4 to generate code based on those summaries. and not have to worry about outside configuring or whats actually being used. The benefit i see is that it would be fuzzy logic to try and handle this in one generic call rather than these specific calls simplifying the development process. but i dont see how it would handle a situation like this where one would be a very cheap model and one would be the most expensive/performant model how would the fuzzy logic handle that if you say you want to minimize costs would it just suddenly start using mixtral/llama for coding instead of gpt4/claude/gemini
@unifyai 8 หลายเดือนก่อน
The router gives full telemetry on what model and provider are being used behind the scenes, and it's also possible to make the final LLM decision on your local system, after just querying the neural scoring function as an API. We will release case studies explaining this soon!
@dotunoluyade2935 8 หลายเดือนก่อน
This is forward-looking and impressive!
@ndarray 8 หลายเดือนก่อน
Interesting
@anirbansaha1996 8 หลายเดือนก่อน
Really interesting product. Congratulations!
@horuszhang 8 หลายเดือนก่อน
substantial approach!
@bilalaamer5140 8 หลายเดือนก่อน
Great approach! Glad to have been a contributor to this amazing organisation
@technamrata3961 8 หลายเดือนก่อน
Great!
@tarakram.18 8 หลายเดือนก่อน
Wow… Looks really interesting!! Will definitely try it out :)
@mycloudvip 8 หลายเดือนก่อน
interesting approach! thanks for sharing your updates!
@henkjekel4081 8 หลายเดือนก่อน
its not a 2d convolution, its a 1d convolution. Im trying to figure out how that works
@unifyai 8 หลายเดือนก่อน
This does a pretty good job :) e2eml.school/convolution_one_d.html
@edgarsims9380 8 หลายเดือนก่อน
'Promo sm' 🔥
@jonmichaelgalindo 9 หลายเดือนก่อน
I wonder how much this would transfer to a diffuser. Run a step on the minidiffuser student, denoise that output by one step on the teacher model, then train the student on the teacher's output. So instead of directly copying the teacher's distribution, the student is navigating from its current distribution toward the teacher's.
@unifyai 9 หลายเดือนก่อน
Yeah I think that could work. Still a few weeks left before the NeurIPS submission deadline 😅
@oyemajaoyemade829 9 หลายเดือนก่อน
🚀
@aasthajain1619 9 หลายเดือนก่อน
great talk!
@Muennighoff 9 หลายเดือนก่อน
Thanks a lot for hosting! Slides are here: docs.google.com/presentation/d/1aHwDQaGfy2Dg6r68uRH7kFOuTKka3tzZzEIZIf35VNM/edit?usp=sharing&resourcekey=0-q6APQHTzHtB6jqYa-Snk2g There was a small mistake on the Caching slide where the prompts where displayed for the opposite caching strategies, but it's now fixed.
@muhammadumerfarooq9671 10 หลายเดือนก่อน
Amazing explanation professor. A really good overview of accelartor technologies
@Adiorz 10 หลายเดือนก่อน
Are you able to debug on Docker running on WSL (without Docker Desktop)?
@unifyai 10 หลายเดือนก่อน
Yes, you can debug on Docker running on WSL without Docker Desktop using Visual Studio Code or other IDEs that support WSL and Docker integration.
@Adiorz 10 หลายเดือนก่อน
@@unifyai PyCharm is supposed to support WSL and Docker integration, but for some reason it is failing on debugging in such setup
@alexanderclapp8326 10 หลายเดือนก่อน
niiiice
@tirub8863 11 หลายเดือนก่อน
is there any way to run the existing Fx pass on Fx graph before modifying for custom ASIC
@avi7278 11 หลายเดือนก่อน
You don't add any surcharge to the providers price, ok... but I would almost rather you did take a couple of a fractions of a cent for each request because if not it makes me wonder how you're actually making money.
@EternalKernel 11 หลายเดือนก่อน
After looking at there website it doesn't look like they are trying to at all. They have a contributors program. Its a volunteer program with like 3 or 4 tiers. I almost applied because it sounds nice to work on a team again, but nope, not working for free or even less the $20/hr. I hate it, but we live in CAPITALISM continuing to be an alive person takes money. :( IF I had all the money I needed to continue to be an alive person, I would apply to join them in an instant.
@unifyai 11 หลายเดือนก่อน
We plan on incorporating cost-saving dynamic routing between providers on a prompt-by-prompt basis, and then charging some % of the costs we're able to save for each user. Either way, we're determined that the user doesn't see any price inflation beyond the pay-per-compute rates when using our Model Hub. Happy to answer any other questions! :)
@unifyai 11 หลายเดือนก่อน
Thanks for the message @EternalKernel. Our contributor program is aimed at people who are starting out in the field, and we provide ongoing mentorship, weekly meetings and guidance for such people, as they work alongside our team and we help them start their OS + ML journey. It's not the same as an internship, and the target audience is different. Seems like our program isn't the best fit for you, which is totally fine. We wish you the best of luck landing your next internship! 💪
@bartvandelint 11 หลายเดือนก่อน
would be nice to be able to compare different models with eachother in the same graph
@unifyai 11 หลายเดือนก่อน
You read our minds! Here th-cam.com/video/o8yD_QBhmsw/w-d-xo.html I mention our plans to incorporate exactly this feature. It should hopefully be live in the next week or two!
@starshipx1282 11 หลายเดือนก่อน
cool stuff