วีดีโอ

มุมมอง 0

ความคิดเห็น •

@TechTechPotato วันที่ผ่านมา ⁺³
Written version: morethanmoore.substack.com/p/the-future-of-big-iron-telum-ii-and
@esra_erimez วันที่ผ่านมา ⁺¹⁶
I'm shocked that I understood a lot of this conversation. I think it is a testimony to Ian's interview capabilites.
@capability-snob วันที่ผ่านมา ⁺²³
IBM are totally hiring! Let's hope they can find a replacement for the manager who laid everyone off, assuming they could be replaced with AI.
@lbgstzockt8493 วันที่ผ่านมา ⁺⁵
I wonder what working for them would be like, my intuition tells me it's a slow, big company similar to a government institution, but maybe I am wrong.
@cryptocsguy9282 วันที่ผ่านมา
@@lbgstzockt8493 I bet you're righting judging by their utter failure to keep up with newer tech companies like Microsoft , apple , IBM ect and their exit from consumer hardware 20 years ago but idk since I don't work for them
@henrikoldcorn 23 ชั่วโมงที่ผ่านมา
@@cryptocsguy9282IBM has more than 250k employees, they seem to be doing just fine. I don’t know _what_ they’re doing, but clearly someone values it.
@TechTechPotato 23 ชั่วโมงที่ผ่านมา
It's mostly consulting
@tristan7216 วันที่ผ่านมา ⁺³
Mainframes are a whole other world. 320 MB of cache?! I took a tour of fishkill when I was in high school in the 80s. They were making these liquid cooled multilayer ceramic modules with chips in them instead of circuit boards. A whole other world.
@LethalBB วันที่ผ่านมา ⁺⁴
15 mins in and this is grrrreat! More of this.
@Lossmars 3 ชั่วโมงที่ผ่านมา
Note that there is a sound echo issue with your microphone and an overall background noise each time that someone speaks. Otherwise thank you very much for this super interesting interview.
@lahma69 วันที่ผ่านมา ⁺¹
As always, a very interesting interview with a very interesting person! Thanks for the excellent content Ian.
@ConsistentlyAwkward วันที่ผ่านมา ⁺¹
Thank you for doing this conversation cuz how IBM is doing ai is really fascinating to me
@EyesOfByes 11 ชั่วโมงที่ผ่านมา
IBM Fellow = respect
$@fracturedlife1393$
@fracturedlife1393 วันที่ผ่านมา ⁺²
You always crack me up, must be the way you Telum
@freddellmeister วันที่ผ่านมา ⁺⁴
Interesting to hear about the AI accellerator development and the argument for shared accellerator as opposed to distributed accellerator ("Power"). You might also want to probe more into the fact that there was no Power announcement at Hotchips this year as usually Power and z are in cadence when it comes to Samsung manufactirung process, and should have launched simultaneously. Whatever portrayed as Power "next gen" might be a simple rebadge of existing generation from CPU and systems perspective.
@Razzbow วันที่ผ่านมา ⁺³
Anything about PPC10?
@billlodhia5640 วันที่ผ่านมา ⁺⁴
What about it? P10 has been around for a couple of years now. P11 is upcoming and OP10 is still in the weeds due to the blobs
@freddellmeister 17 ชั่วโมงที่ผ่านมา
@@billlodhia5640 please share how P11 differs from P10, any changes except for name?
@wolpumba4099 23 ชั่วโมงที่ผ่านมา
*IBM's Telum II Processor and Spyre AI Accelerator: A Conversation with Dr. Christian Jacobi*
* *0:00** Introduction:* Discussion about IBM's role in enterprise computing with its Z architecture, focusing on the new Telum II processor and Spyre AI accelerator.
* *2:10** Dr. Jacobi's Background:* Dr. Jacobi, IBM Fellow and CTO of Systems Development, discusses his 22-year career at IBM, starting with the Cell processor and including leadership roles in Z14 and Telum development.
* *3:00** IBM Fellow:* The title signifies the highest technical level at IBM, with responsibilities for advising on broad technical direction.
* *7:30** Z Architecture Ethos:* Focus on high availability, security, and scalability, emphasizing a design-for-purpose approach tailored to mission-critical workloads.
* *8:40** High Availability:* Defined as "eight nines" of availability (99.9999%), translating to approximately one hour of downtime every 11,400 years.
* *9:10** Monolithic Design:* Shift from multi-chip modules to a monolithic design for Telum, driven by efficiency gains and the ability to integrate more features like AI acceleration and post-quantum security.
* *11:00** Virtual Cache Hierarchy:* Telum utilizes a virtual cache hierarchy, leveraging underutilized L2 cache as virtual L3 and L4, improving effective cache capacity.
* *12:10** Core Design for Reliability:* Emphasis on error detection and recovery mechanisms built into the core design, including redundant cache line support and architectural state checkpoints.
* *14:50** Virtual Cache Performance:* Virtual cache design eliminates the need for replicating cache lines multiple times, leading to efficiency gains. Telum II increases total L2 SRAM from 256MB to 360MB.
* *16:20** Integrated AI:* The integrated AI accelerator is designed to address customer needs for infusing AI into transaction processing at millisecond latency.
* *18:10** AI Utilization:* The centralized AI accelerator offers more compute capacity compared to a distributed approach, efficiently serving the needs of individual cores as required.
* *19:50** Customer & Research Collaboration:* AI integration was driven by customer demand and collaboration with data scientists and application developers, along with insights from IBM Research.
* *21:10** AI Model Types:* Telum supports both smaller, low-latency models for real-time inference and larger language models (LLMs), often used in ensemble methods for improved accuracy.
* *23:50** Built-in DPU:* The integrated DPU handles IO, cryptography, and connects to the Spyre AI accelerator, enhancing performance and enabling expansion of AI capabilities. It has direct access to memory and has its own L2 cache.
* *27:40** Spyre AI Accelerator:* A second-generation AI chip optimized for LLMs, supporting use cases like code assist and general admin assistance within a secure environment. Up to eight can be clustered.
* *30:00** Expanded AI Performance:* The Spyre accelerator, particularly in clustered configurations, enables larger-scale AI workloads within the Z ecosystem.
* *31:40** Samsung Foundry Partnership:* IBM utilizes Samsung's 5nm high-performance process for both Telum II and Spyre, highlighting a positive relationship and successful results.
* *32:40** AI in Chip Design:* IBM is exploring the use of AI in chip design for tasks like simulation screening and knowledge management.
* *35:10** LinuxONE Response:* Positive market response to LinuxONE, which leverages the Z architecture for Linux-based workloads, with it being the fastest-growing area of the Z business.
* *37:00** Key Takeaways:* Telum II and Spyre represent IBM's continued innovation in the enterprise chip space, delivering high-performance, secure, and scalable solutions. IBM is actively hiring.
I used gemini-1.5-pro-002 on rocketrecap dot com to summarize the transcript.
Cost (if I didn't use the free tier): $0.03
Input tokens: 23443
Output tokens: 859
@henrikoldcorn 23 ชั่วโมงที่ผ่านมา
18:00 part of the chip, part of the core, part of the chip, part of the core…
@seanoneill9130 วันที่ผ่านมา ⁺²
I'm more of a chap than a fellow.
@talkingonthespectrum วันที่ผ่านมา ⁺¹
Oooo let's see what's going on
@esra_erimez วันที่ผ่านมา
Indeed!
@viggokallman1649 วันที่ผ่านมา ⁺¹
First.