We tour the world's fastest super computer at Oak Ridge National Laboratory!
ฝัง
- เผยแพร่เมื่อ 6 พ.ค. 2024
- Everything Art of Network Engineering: linktr.ee/artofneteng
In this video we get a tour of the world's fastest super computers, Frontier and Summit, at @OakRidgeNationalLab! Both of these High Performance Computing (HPC) environments have played significant rolls in various areas of research.
Our tour guide, Daniel Pelfrey, Principal HPC Network Engineer, takes us through the challenges of Networking in an HPC environment, and some of them might surprise you!
A huge thank you to our friends at the Knoxville Technology Council for connecting us with the Oakridge National Laboratory.
Also, thank you to Kate and Daniel for the tour of ORNL, the super computers there, and for making this video possible!
Chapters
-------------------
0:00 Intro
00:58 What's the high level mission of ORNL?
02:07 What makes a High Performance Computing Environment different from Enterprise Networks?
04:09 HPC Network Design
05:09 Introduction to the Frontier Super Computer
06:40 Inside the Frontier Data Center!
09:54 We get to peek inside a Frontier cabinet!
15:12 The Summit Super Computer
17:03 HPC Environment Operations
19:24 The teams that keep these HPC environments going
20:20 The Why
21:43 Wrap up
22:36 Outro - วิทยาศาสตร์และเทคโนโลยี
"And is 2 Exaflops" big smile 😊 Nice to see someone that is passionate about his work!
Biggest difference between HPC networks and corporate networks is lack of security in favor of performance at all costs. The compute nodes directly access remote memory over the network RoCE
ORNL is my dream job. I'd honestly sweep floors just to be in the building.
It isn't all rainbows and unicorns
Sorry for nitpicking but he got one thing wrong. The reason you don't use electrical network cables for longer distances is not primarily because of interference from the power cables but has all to do with attenuation.
At these speeds it is very hard to get the signal more than a few meters, it will be heavily attenuated and very hard to distinguish a 1 from a 0. The solution to the problem is to use fibre optics instead.
Great video. Thanks for taking us along with you all!
Take note of the power cables for each rack, similar to the amount a large house might use, per rack. Removing the heat from those racks is a big part of the design. Air flows from the floor and out the top in active exhausts. A little hard to believe, but compactness is a top priority.
incredibly good interview you did there.
Thank you!
Awesome tour/interview. Dan seems like a real genuine dude. 👍
After working with one we heard the gruntiest one is in Japan now rather than Oakridge.
I would love to work there. Tired of making 10 gb as fast as possible. Mind you, I got into a terraflop
The power supply behind them is unbelievable. Enough for a town.
Simply Awesome!
This is amazingly quiet for a system of that size
Water cooled! The other half of the data center not shown in the video was all storage and that side was LOUD!
Great Video, thank you. Interesting would have been the type of Failures they see - Overheating, Bad Solder, Caps fail, Fans/Plumbing fails etc
We did learn that they have full service staff provided by OEMs of the supercomputer. They were there performing maintenance that day. Our POC didn't have specifics on hardware failures of the HPC environment, I'll see if he has anything on the networking components.
L3.cache errs for 1
No matter how big or small:
The network IS the computer…
For the past few decades, outside of embedded applications (and even in many situations there), computers have to be connected to a network to have any practical value; every piece of software, and most if not all its data, is sent over a network at some time in its lifecycle.
Never underestimate the bandwidth of a FedEx truck.
Where old system goes, eBay?
In supercomputing it's either Network or Notwork :DD
thats really funny lil buddy 😊
So SkyNet is a Tennesseean.
But can it run crysis?
no
You spelled Doom wrong.
how many chrome tabs can it handle?
I’m surprised they can even talk in there. I’ve been in some major data centers and communication can be difficult.
They were water cooled so no fans on that side of the DC. The other side was storage which still had traditional cooling and was very loud!
@@artofneteng Ah, that makes sense.
Panduit.
No way do you get access to the world's fastest computer...
Hypersonic missile systems are classified :P
Open research, class are in other DCs
it should mine BTC :P
OMG he asks so many stupid and repeated questions about the network cables....
Some clarifying questions never hurt, and this channel is Network Engineering focused.