Cohere For AI - Community Talks: Randall Balestriero

Cohere For AI - Community Talks: Cong Lu

Cohere For AI - Community Talks: Zhijing Jin

Haunted House 😰😨 LeoNata family #shorts

ซวย.หนัก!! หมอปลาย'พรายกระซิบ โดน.คนตื่นธรรม ซัด.ดับคาที่! | #คนตื่นธรรม

#JasonDeruloTV // 😁😁😁#GotPermissionToPost From @daryltufekci #SlowLow

Cohere For AI - Community Talks: Gwanghyun (Bradley) Kim

Cohere

มุมมอง 72

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 12 พ.ย. 2024
BeyondScene: Higher-Resolution Human-Scene Generation With Pretrained Diffusion
Speaker Bio: Gwanghyun (Bradley) Kim is a Ph.D. candidate in Electrical and Computer Engineering (ECE) at Seoul National University (SNU), under the supervision of Prof. Se Young Chun. He completed his M.S. degree at KAIST, where he was advised by Prof. Jong Chul Ye.This year, he is interning at NVIDIA Research, working with Umar Iqbal, Xueting Li, and Ye Yuan. Last year, he interned at Google Research with Alonso Martinez, Krishna Somandepalli, and Yu-Chuan Su.His research focuses on artificial intelligence (AI), particularly in computer vision (CV) and its intersection with Generative AI. His work emphasizes multimodal, high-dimensional, and human-centric generative AI. Gwanghyun has received the Qualcomm Innovation Fellowship and the Yulchon AI Star Scholarship.
Title: BeyondScene: Higher-Resolution Human-Scene Generation With Pretrained Diffusion
Abstract: We propose BeyondScene, a novel framework that overcomes prior limitations, generating exquisite higherresolution (over 8K) human-centric scenes with exceptional text-image correspondence and naturalness using existing pretrained diffusion models. BeyondScene employs a staged and hierarchical approach to initially generate a detailed base image focusing on crucial elements in instance creation for multiple humans and detailed descriptions beyond token limit of diffusion model, and then to seamlessly convert the base image to a higher-resolution output, exceeding training image size and incorporating details aware of text and instances via our novel instance-aware hierarchical enlargement process that consists of our proposed high-frequency injected forward diffusion and adaptive joint diffusion. BeyondScene surpasses existing methods in terms of correspondence with detailed text descriptions and naturalness, paving the way for advanced applications in higher-resolution human-centric scene creation beyond the capacity of pretrained diffusion models without costly retraining. Project page:janeyeon.githu...

ความคิดเห็น •

ต่อไป

เล่นอัตโนมัติ

Cohere For AI - Community Talks: Randall Balestriero

Cohere For AI - Community Talks: Randall Balestriero

Cohere For AI - Community Talks: Cong Lu

Cohere For AI - Community Talks: Cong Lu

Cohere For AI - Community Talks: Zhijing Jin

Cohere For AI - Community Talks: Zhijing Jin

Haunted House 😰😨 LeoNata family #shorts

Haunted House 😰😨 LeoNata family #shorts

ซวย.หนัก!! หมอปลาย'พรายกระซิบ โดน.คนตื่นธรรม ซัด.ดับคาที่! | #คนตื่นธรรม

ซวย.หนัก!! หมอปลาย'พรายกระซิบ โดน.คนตื่นธรรม ซัด.ดับคาที่! | #คนตื่นธรรม

#JasonDeruloTV // 😁😁😁#GotPermissionToPost From @daryltufekci #SlowLow

#JasonDeruloTV // 😁😁😁#GotPermissionToPost From @daryltufekci #SlowLow

"เต๋อ-เสือ" ทำกับแกล้มรสเด็ด แล้วเช็ดน้ำตาเซียน | เฮ็ดอย่างเซียนหรั่ง FULL EP.18 | One Playground

"เต๋อ-เสือ" ทำกับแกล้มรสเด็ด แล้วเช็ดน้ำตาเซียน | เฮ็ดอย่างเซียนหรั่ง FULL EP.18 | One Playground

Cohere For AI - Community Talks: Arthur Conmy

Cohere For AI - Community Talks: Arthur Conmy

What Is an AI Anyway? | Mustafa Suleyman | TED

What Is an AI Anyway? | Mustafa Suleyman | TED

AI Snake Oil-A New Book by 2 Princeton University Computer Scientists

AI Snake Oil—A New Book by 2 Princeton University Computer Scientists

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Paul Roetzer on AI in Marketing | Future of Selling #001

Paul Roetzer on AI in Marketing | Future of Selling #001

2 Years of LLM Advice in 35 Minutes (Sully Omar Interview)

2 Years of LLM Advice in 35 Minutes (Sully Omar Interview)

What do tech pioneers think about the AI revolution? - BBC World Service

What do tech pioneers think about the AI revolution? - BBC World Service

GraphRAG: The Marriage of Knowledge Graphs and RAG: Emil Eifrem

GraphRAG: The Marriage of Knowledge Graphs and RAG: Emil Eifrem

With Spatial Intelligence, AI Will Understand the Real World | Fei-Fei Li | TED

With Spatial Intelligence, AI Will Understand the Real World | Fei-Fei Li | TED

Incredibox Sprunki: Who's Really Friend ? Oren or Raddy or Fun Bot #shorts #animation

Incredibox Sprunki: Who's Really Friend ? Oren or Raddy or Fun Bot #shorts #animation

最后那一下，不可能有人能做到！我们一共啊了多少声？#電車 #車文化 #跑車

最后那一下，不可能有人能做到！我们一共啊了多少声？#電車 #車文化 #跑車

หมาผมมันเปลี่ยนไป #minecraft #shorts #มายคราฟ #fyp

หมาผมมันเปลี่ยนไป #minecraft #shorts #มายคราฟ #fyp

coco的赛跑 1%🔋vs100%🔋

coco的赛跑 1%🔋vs100%🔋

Resumo: Braga 2-4 Sporting (Liga 24/25 #11)

Resumo: Braga 2-4 Sporting (Liga 24/25 #11)

ฉันผิดอะไร แม่ลำเอียง #แม่สุซูกัส

ฉันผิดอะไร แม่ลำเอียง #แม่สุซูกัส

NCT DREAM 엔시티 드림 'When I’m With You' MV

NCT DREAM 엔시티 드림 'When I’m With You' MV

Rey Mysterio kept Kurt Angle guessing

Rey Mysterio kept Kurt Angle guessing