Building a Real Application with Guidepad

How Do Data Science & AI Intersect? | AI Rising | English Podcast

Keynote: Linus Torvalds, Creator of Linux & Git, in Conversation with Dirk Hohndel, Verizon

แอบเล่นน้ำแม่ไม่ว่าหรอก!! #funny #เล่นน้ำ #shorts

PURPEECH - นี่ฉันเองคนที่... (It's me) [Official MV]

ไฮไลต์เต็ม (ACL ELITE-1) บุรีรัมย์ ยูไนเต็ด 0-0 วิสเซล โกเบ

Keynote: Deploying LLM Workloads on Kubernetes by WasmEdge and Kuasar - Tianyang Zhang & Vivian Hu

CNCF [Cloud Native Computing Foundation]

มุมมอง 119

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 18 ก.ย. 2024
Don't miss out! Join us at our next Flagship Conference: KubeCon + CloudNativeCon North America in Salt Lake City from November 12 - 15, 2024. Connect with our current graduated, incubating, and sandbox projects as the community gathers to further the education and advancement of cloud native computing. Learn more at kubecon.io
Keynote: Deploying LLM Workloads on Kubernetes by WasmEdge and Kuasar | 主论坛演讲: 使用WasmEdge和Kuasar在Kubernetes上部署LLM工作负载 - Tianyang Zhang, Huawei Cloud & Vivian Hu, Second State
LLMs are powerful artificial intelligence models capable of comprehending and generating natural language. However, the conventional methods for running LLMs pose significant challenges, including complex package installations, GPU devices compatibility concerns, inflexible scaling, limited resource monitoring and statistics, and security vulnerabilities on native platforms. WasmEdge introduces a solution enabling the development of swift, agile, resource-efficient, and secure LLMs applications. Kuasar enables running applications on Kubernetes with faster container startup and reduced management overheads. This session will demonstrate running Llama3-8B on a Kubernetes cluster using WasmEdge and Kuasar as container runtimes. Attendees will explore how Kubernetes enhances efficiency, scalability, and stability in LLMs deployment and operations.
LLM是强大的人工智能模型，能够理解和生成自然语言。然而，传统的运行LLM的方法存在重大挑战，包括复杂的软件包安装、GPU设备兼容性问题、不灵活的扩展性、有限的资源监控和统计，以及在本地平台上的安全漏洞。 WasmEdge提出了一种解决方案，可以开发快速、灵活、资源高效和安全的LLM应用程序。Kuasar使应用程序能够在Kubernetes上运行，具有更快的容器启动速度和减少的管理开销。本场演讲将演示如何使用WasmEdge和Kuasar作为容器运行时，在Kubernetes集群上运行Llama3-8B。与会者将探索Kubernetes如何提高LLM部署和运营的效率、可扩展性和稳定性。

ความคิดเห็น •

ต่อไป

เล่นอัตโนมัติ

Building a Real Application with Guidepad

Building a Real Application with Guidepad

How Do Data Science & AI Intersect? | AI Rising | English Podcast

How Do Data Science & AI Intersect? | AI Rising | English Podcast

Keynote: Linus Torvalds, Creator of Linux & Git, in Conversation with Dirk Hohndel, Verizon

Keynote: Linus Torvalds, Creator of Linux & Git, in Conversation with Dirk Hohndel, Verizon

แอบเล่นน้ำแม่ไม่ว่าหรอก!! #funny #เล่นน้ำ #shorts

แอบเล่นน้ำแม่ไม่ว่าหรอก!! #funny #เล่นน้ำ #shorts

PURPEECH - นี่ฉันเองคนที่... (It's me) [Official MV]

PURPEECH - นี่ฉันเองคนที่... (It's me) [Official MV]

ไฮไลต์เต็ม (ACL ELITE-1) บุรีรัมย์ ยูไนเต็ด 0-0 วิสเซล โกเบ

ไฮไลต์เต็ม (ACL ELITE-1) บุรีรัมย์ ยูไนเต็ด 0-0 วิสเซล โกเบ

Mother and Child Rescued from Road Mishap #shorts

Mother and Child Rescued from Road Mishap #shorts

Keynote: Supporting Large-Scale and Reliability Testing in Kubernetes u... Yuan Chen & Shiming Zhang

Keynote: Supporting Large-Scale and Reliability Testing in Kubernetes u... Yuan Chen & Shiming Zhang

Keynote: Kubernetes Community and Cloud Native Activities in China - Paco Xu & Wei Cai

Keynote: Kubernetes Community and Cloud Native Activities in China - Paco Xu & Wei Cai

No, Einstein Didn’t Solve the Biggest Problem in Physics

No, Einstein Didn’t Solve the Biggest Problem in Physics

James Beilby (Banking Industry): An Algo Execution System in Rust

James Beilby (Banking Industry): An Algo Execution System in Rust

Postgres just got even faster

Postgres just got even faster

Stop Using FirstOrDefault in .NET! | Code Cop #021

Stop Using FirstOrDefault in .NET! | Code Cop #021

CNCF Research End User Group: Work Session-Reference Architectures for Research Users (Sep 18, 2024)

CNCF Research End User Group: Work Session-Reference Architectures for Research Users (Sep 18, 2024)

From model weights to API endpoint with TensorRT LLM: Philip Kiely and Pankaj Gupta

From model weights to API endpoint with TensorRT LLM: Philip Kiely and Pankaj Gupta

Keynote: Accelerating Electric Vehicle Innovation with Cloud Native Tech... Kevin Wang & Saint Jiang

Keynote: Accelerating Electric Vehicle Innovation with Cloud Native Tech... Kevin Wang & Saint Jiang

HIGHLIGHTS : Bangkok United (THA) 4-2 Tampines Rovers (SGP) | AFC Champions League TWO | 18.09.24

HIGHLIGHTS : Bangkok United (THA) 4-2 Tampines Rovers (SGP) | AFC Champions League TWO | 18.09.24

WHO'S NEXT?! 🤔⚽️

WHO'S NEXT?! 🤔⚽️

ทีมชาติไทย VS คิวบา | HIGHLIGHT | 17 ก.ย. 67 | | ฟุตซอลชิงแชมป์โลก 2024 | T Sports 7

ทีมชาติไทย VS คิวบา | HIGHLIGHT | 17 ก.ย. 67 | | ฟุตซอลชิงแชมป์โลก 2024 | T Sports 7

แก้ว 2 แสนใบที่ผลิตออกมาผิดจะเอาไปทำไรดีนะ ? 👍🏻✨🚮 #BEARHOUSE #แบร์เฮาส์

แก้ว 2 แสนใบที่ผลิตออกมาผิดจะเอาไปทำไรดีนะ ? 👍🏻✨🚮 #BEARHOUSE #แบร์เฮาส์

I thought I was losing my daughter. #funny #cute #comedy

I thought I was losing my daughter. #funny #cute #comedy

เมื่อพระนารายณ์ส่งนักเรียนไทยไปเรียนที่ฝรั่งเศส #ศิลปวัฒนธรรม #SilpaMag #OneMinuteHistory

เมื่อพระนารายณ์ส่งนักเรียนไทยไปเรียนที่ฝรั่งเศส #ศิลปวัฒนธรรม #SilpaMag #OneMinuteHistory

ลิฟต์ปริศนา กีดกันคน!! #shorts

ลิฟต์ปริศนา กีดกันคน!! #shorts

เคสนี้เกือบตัดสินกันด้วยขนาดห**ม #เข็มขัดสั้น #สาระแทบไม่มี #zhevass

เคสนี้เกือบตัดสินกันด้วยขนาดห**ม #เข็มขัดสั้น #สาระแทบไม่มี #zhevass