The Petabyte Pi Project

แชร์
ฝัง
  • เผยแพร่เมื่อ 22 พ.ย. 2024

ความคิดเห็น • 2.4K

  • @izzieb
    @izzieb 2 ปีที่แล้ว +6373

    More bottlenecks than a Coca Cola factory.

  • @bartz0rt928
    @bartz0rt928 2 ปีที่แล้ว +2441

    60 HDDs in RAID 0 is the definition of "all gas no brakes"

    • @staceixan
      @staceixan 2 ปีที่แล้ว +94

      Glorious, glorious RAID 0 🤩

    • @aim-at-me
      @aim-at-me 2 ปีที่แล้ว +84

      glass cannon lol

    • @monad_tcp
      @monad_tcp 2 ปีที่แล้ว +36

      Who needs raid, why aren't you using LVM and just mounting it as direct-IO.

    • @ampex189
      @ampex189 2 ปีที่แล้ว +12

      And no steering

    • @himmelsrand7527
      @himmelsrand7527 2 ปีที่แล้ว +27

      @@monad_tcp Who needs LVM. Just use ZFS.

  • @RaidOwl
    @RaidOwl 2 ปีที่แล้ว +1888

    A petabyte and a raspberry pi, the crossover we didn’t know we needed. This is sick.

    • @plica06
      @plica06 2 ปีที่แล้ว +17

      Except it's so slow it's like writing to one of those fake Chinese 2TB USB keys! Yes I fell for that scam on eBay.

    • @adrianteri
      @adrianteri 2 ปีที่แล้ว +4

      @@plica06 Why would you want a 2TB Thumb drive/Flash drive instead of an external HDD? They wear more quickly due to storage technology. Heck even utilities like #Ventoy now support HDDs which are more reliable and you can carry more OSes...

    • @AmruthReddi
      @AmruthReddi 2 ปีที่แล้ว +8

      literally Pi-tabyte lol

    • @dieSpinnt
      @dieSpinnt 2 ปีที่แล้ว

      @@adrianteri Man, it is a joke, a statement and also a confession by plica06.
      A joke like Jeff's project. Which doesn't matter, because it is not about if it is useful or practical, but: Showing that it CAN BE DONE!
      BTW Great job Jeff and respect to plica06!:)
      P.S.: I get it that you are joking, too: "They wear more quickly ..." NO! They don't wear, because they aren't real! Hehehehe

    • @dieSpinnt
      @dieSpinnt 2 ปีที่แล้ว +1

      @@adrianteri On the other side, when I take your comment serious ... I can't take you serious:) A portable media, 2022 in the 2TB range ..... and you say spinning media? That must be some back-to-the-future joke I do not understand.
      Anyway, misunderstanding plica06's grotesque comparison and taking it as a spin(a straw-man!) to compare USB-Sticks[2] against enterprise level hard disks (as in the video, or consumer ones doesn't matter) is like comparing apples to hard drives. Enterprise level HDDs have to be compared to enterprise level SSDs. Latter one beat the first in every point, including price[1], reliability, data safety and energy efficiency in that sector.
      [1] The purchase price is simply irrelevant compared to the maintenance costs, since the former is already planned in the product and can simply be written off after the service life of the medium.
      [2] actually 2TB(etc.) NVME "USB-Sticks"(miniature-Adapters) exist. Try to beat their MTBF (vs. spinning media that IS transported around) ... ;)

  • @myrkat
    @myrkat 2 ปีที่แล้ว +117

    Pushing the limits of "hackey" tech is what most hardware and software engineers should be shooting for. Very well done. Kudos to 45 Drives!

  • @BeyondDuctTapeFixItRight
    @BeyondDuctTapeFixItRight 2 ปีที่แล้ว +32

    This is a bonkers crazy setup. It's so mad I just had to watch, 100%. Kudos to you for getting it to work at all. Your persistence is inspiring.

  • @Axel_Andersen
    @Axel_Andersen 2 ปีที่แล้ว +101

    I had this colleague who told this story:
    I had boss who did not understand anything about computer or electronics. When ever I was trouble shooting something he would come and watch over my shoulder and comment "Have you checked the power?" This was very annoying as he did not understand anything. What made it doubly annoying that his advice was spot on so many times.
    Check the power!!

    • @shobsdamemedawg
      @shobsdamemedawg 3 หลายเดือนก่อน +4

      damn.. i feel that

    • @henryyylight
      @henryyylight 2 หลายเดือนก่อน

      Did you check the power?

    • @shobsdamemedawg
      @shobsdamemedawg 2 หลายเดือนก่อน

      no

    • @miriko1297
      @miriko1297 2 หลายเดือนก่อน

      why

    • @shobsdamemedawg
      @shobsdamemedawg 2 หลายเดือนก่อน

      @@miriko1297 idk, just lazy i guess

  • @SaxaphoneMan42
    @SaxaphoneMan42 2 ปีที่แล้ว +1229

    the PetaPi seems like a winner, I can only imagine the folks at 45 drives watching this with a mix of awe and horror.

    • @hannes-
      @hannes- 2 ปีที่แล้ว

      PetaPite

    • @glenncaughey5044
      @glenncaughey5044 2 ปีที่แล้ว +28

      Just like watching two ships colliding. 😎🚢

    • @BradCozine
      @BradCozine 2 ปีที่แล้ว +58

      It IS a petabyte file server... but probably SHOULDN'T go with the name "PetaFile".

    • @michaelterrell
      @michaelterrell 2 ปีที่แล้ว

      @@BradCozine PETAFile, = a database of People Eating Tasty Animals!

    • @JasperJanssen
      @JasperJanssen 2 ปีที่แล้ว

      @@BradCozine petaPile?

  • @SeanHodgins
    @SeanHodgins 2 ปีที่แล้ว +430

    Do I need a petabyte of storage? No. Would I mount one in my rack? Yes.

    • @kevinbissinger
      @kevinbissinger 2 ปีที่แล้ว +37

      30 years ago: Do I need a Gigabyte of storage? No.

    • @monad_tcp
      @monad_tcp 2 ปีที่แล้ว +6

      I absolutely need it to store my GPT-3 training data. Turns out having 16 GPUs wasn't enough, but I run out of space even before running out of compute time. Its probably more on the scale of 1PB + 8192 GPUs.

    • @monad_tcp
      @monad_tcp 2 ปีที่แล้ว +1

      10 years from now, Do I need a Petabyte, yes.

    • @sparkyispog
      @sparkyispog 2 ปีที่แล้ว +7

      do i really need 128 gb of storage on my mac?
      "your mac is almost out of storage"
      wait- whY IS MY SYSTEMS FOLDER 70GB
      whY does microsoft word take up 2gb?

    • @geraldcreager4432
      @geraldcreager4432 2 ปีที่แล้ว +1

      @@kevinbissinger I still remember buying my first gigabyte hard drive. Kept a grad student in school.

  • @t7732155980
    @t7732155980 2 ปีที่แล้ว +6

    Even though the idea is crazy, Jeff knows his craft. The tips at time 09:00 on how to select hard drives for the task are priceless!

  • @BrenskiIP
    @BrenskiIP 2 ปีที่แล้ว +4

    As someone who is about to endeavor on a new NAS project, this was a fun watch! Thanks!

    • @JeffGeerling
      @JeffGeerling  2 ปีที่แล้ว +2

      just keep upgrading the drives, I'm sure there'll be 1 PB 3.5" drives in like 20 years 🤪

  • @twiceineverymoment
    @twiceineverymoment 2 ปีที่แล้ว +387

    "I was going to unbox this on camera but Fedex already did"
    As someone who's had multiple expensive items destroyed by Fedex and actively avoids doing business with companies that ship with them, I felt that...

    • @philiam0420
      @philiam0420 2 ปีที่แล้ว +19

      The reason for FedEx packages being more damaged than USPS packages actually isn't FedEx's fault. They accept heavier packages than USPS, so that means if a shipper doesn't pack their package correctly, when that 150lb box of farm tools slams into your 6lb box of plastic in the sorting machine, your package gets crunched. Meanwhile USPS only goes up to 50(?) lbs, so your box just doesn't get slammed as hard.
      TL:DR your packages get damaged in FedEx because the shipper didn't pack them right. Source: I ship around 5 packages a day with all 3 major US carriers and rarely have an issue since I actually package my shit

    • @twiceineverymoment
      @twiceineverymoment 2 ปีที่แล้ว +28

      @@philiam0420 That doesn't mean Fedex isn't partially at fault. Especially because on the rare occasion that I do get a damaged shipment from UPS, they leave a note with a number to call offering to pay for a replacement if the item is broken. Fedex just delivers my $300 PC case looking like a truck backed over it and acts like it's no problem. And that's without even mentioning the countless packages that never showed up at all, or that got delivered to the wrong address and sent me on a trip across town trying to find it.

    • @topquark22
      @topquark22 2 ปีที่แล้ว +6

      Yep. I had a laptop battery crunched by FedEx. It's a wonder that they can stay in business this way.

    • @atlantic_love
      @atlantic_love ปีที่แล้ว

      You just didn't package correctly, how is that FedEx's fault?

    • @adwilson0286
      @adwilson0286 ปีที่แล้ว +4

      In 2021 I had 5 items fulfilled through FedEx. 1 never arrived, 2 arrived damaged, 1 arrived late, 1 was early and undamaged.
      "20% delivery reliability!" 😵‍💫

  • @MarcoGPUtuber
    @MarcoGPUtuber 2 ปีที่แล้ว +184

    I think i heard Seagate in the US having a heart attack watching drives being juggled.
    I live in Taiwan.

  • @happilicious
    @happilicious 2 ปีที่แล้ว +381

    How about one controller to one pi, then stripe 4 pis together, that should increase the throughput and scale down the issue to a more managable chunk, and still a pi project.

    • @JeffGeerling
      @JeffGeerling  2 ปีที่แล้ว +190

      This is probably the most reasonable way to do it, and use Ceph (or another network-based filesystem)... and indeed I will be testing that out soon. Still bottlenecked but probably more reliable and would not run into as many errors on the PCIe bus!

    • @qazwsx000xswzaq
      @qazwsx000xswzaq 2 ปีที่แล้ว +16

      I think you will need more than 4 Raspberry Pis unless the pool is for backup or archiving purposes only.
      Even a low-to-mid end Intel or AMD system would make more sense.. though then the project will lose its “Piliness”. 😂

    • @paulz1780
      @paulz1780 2 ปีที่แล้ว +6

      @@qazwsx000xswzaq Wirh these Network Speeds you would need 1 Pi per Disk to max out the disks🤣

    • @qazwsx000xswzaq
      @qazwsx000xswzaq 2 ปีที่แล้ว +8

      ​@@paulz1780 We can then expose each drive as an iSCSI target and aggregate them at a server over ethernet. And viola we have a not-so-poor man's SAN haha. It reminds me of those shiny new network addressable NVMoF SSDs btw.

    • @jamesfmilne
      @jamesfmilne 2 ปีที่แล้ว +1

      Gluster might be a good option too.

  • @ChrisContin
    @ChrisContin 2 ปีที่แล้ว +2

    Fascinating! A lot of work! The issue it seems is having to tether to the hard-disk through a band shared with other drives, rather than universal. There is a technique where using data-points written to each hard-drive mathematical computations can rewrite and measure the data for inference machiningly- without even using the shared io-bus anymore! You’d see a slowdown in computation monologue but all speed ahead on as many drives as you want!

  • @jerryw1608
    @jerryw1608 ปีที่แล้ว +40

    Well, pairing €20.000 - €24.000 worth of drives with a 100 dollar raspberry seemed like a solid plan to start with. Having it run a 1pb raid0 config while hosting that storage seems like a task made for this little arm processor 😂 Nice to see you try out such extreme things with the raspberry😀

  • @45Drives
    @45Drives 2 ปีที่แล้ว +366

    Woah.

    • @iloveseals5208
      @iloveseals5208 2 ปีที่แล้ว +6

      woah
      you get seal of approval

    • @venus007e6
      @venus007e6 2 ปีที่แล้ว +2

      Woah.

    • @Todija
      @Todija 2 ปีที่แล้ว +2

      Woah.

    • @starling000
      @starling000 2 ปีที่แล้ว +2

      Woah.

    • @hk4524
      @hk4524 2 ปีที่แล้ว +1

      woahhhhhhhhhh

  • @AlbaxArcade
    @AlbaxArcade 2 ปีที่แล้ว +289

    I always like to think that every time Jeff publishes a new video, the raspberry pi design team feels a disturbance in the force.

    • @jstan5802
      @jstan5802 2 ปีที่แล้ว +15

      As they should, it's been 3 years, where's the raspberry pi 5?

    • @UnderEu
      @UnderEu 2 ปีที่แล้ว +2

      I find your lack of faith disturbing 😅

    • @Thinktank-rn6dm
      @Thinktank-rn6dm ปีที่แล้ว

      @@jstan5802 you're gonna be happy

    • @stickmanland
      @stickmanland 11 หลายเดือนก่อน

      @@jstan5802 Here.

  • @tyrdchaos
    @tyrdchaos 2 ปีที่แล้ว +115

    I hope LTT sees this. Great stuff as always, Jeff!

    • @LyokoisGreat2
      @LyokoisGreat2 2 ปีที่แล้ว +6

      I wondered how long it would take for LTT to be mentioned in the vid

  • @zb9458
    @zb9458 2 ปีที่แล้ว +3

    Jeff you never cease to deliver, you're an absolute legend, great video!!

  • @abdusaidabduraufov5615
    @abdusaidabduraufov5615 2 ปีที่แล้ว +2

    My God! 20TB 60HDD I never dreamed of such a volume

  • @whitey4986
    @whitey4986 2 ปีที่แล้ว +37

    Hey Jeff, I’ve been using your ansible roles for almost 10 years. Love seeing you around HN and the other traps, great to see your TH-cam channel is doing so well. Very cool projects!

  • @CLU2O10
    @CLU2O10 2 ปีที่แล้ว +292

    Linus would be proud

    • @hardwarefromthegarbage3446
      @hardwarefromthegarbage3446 2 ปีที่แล้ว +42

      Torvalds too. Nice example of the Linux versatility

    • @shivsankermondal
      @shivsankermondal 2 ปีที่แล้ว +16

      electroboom too.

    • @TheBacktimer
      @TheBacktimer 2 ปีที่แล้ว +9

      I was waiting for the water bottle :D

    • @rvmiv_
      @rvmiv_ 2 ปีที่แล้ว +6

      It's a good example of why the ltt pedibyte project is so expensive

    • @subhimesto7123
      @subhimesto7123 6 หลายเดือนก่อน

      ​@@rvmiv_
      First of all and most importantly, the speed, those were gen 4 nvme ssd drives with extreme speeds, second of all this is just a storage rig but what ltt built is a server, I mean have you seen how they run NASA simulation?
      And lastly there drives are extremely reliable

  • @KlausWulfenbach
    @KlausWulfenbach 2 ปีที่แล้ว +264

    1942: "We need to figure out a solution to digitally store dozens of bytes at a time. Vaccuum tubes, maybe? This is going to cost us millions, but it will be worth it to finally have accurate artillery range tables!"
    2022: "I'm going to hook up this petabyte of data storage to this cheap single board computer!"

    • @vaisakh_km
      @vaisakh_km 2 ปีที่แล้ว +41

      1969: "we need to figure out how to use our cutting edge 1.5 million doller, 4kb ram 32kb harddrive to bring people moon"
      2025: lauching a rocker with a single board computer...

    • @Nordlicht05
      @Nordlicht05 2 ปีที่แล้ว +6

      Wait until the average person doesn't click on a video below 16k 120fps. There will be always a way to fill it 😅 but I remember it's gotten better over time.

    • @NovemberOrWhatever
      @NovemberOrWhatever 2 ปีที่แล้ว +11

      @@vaisakh_km Pi's and Arduino's are already used for avionics on model rockets, and cubesats can have total build costs of like $50,000. It's amazing how far the industry has come and is going

  • @Bronathan
    @Bronathan ปีที่แล้ว +2

    This showed up in my recommendations. I have no clue what happened here and I know nothing about data managment or IT in general. But I watched it to the end not realizing that this vid is 22 minutes long 😅good content 9/10

  • @EDATEC
    @EDATEC ปีที่แล้ว +1

    It's impressive that this works at all.

  • @jmr
    @jmr 2 ปีที่แล้ว +114

    You tried all the things I wanted to see. You know your audience!

    • @JeffGeerling
      @JeffGeerling  2 ปีที่แล้ว +21

      The last thing I wanted to try (even had it in the final edit but cut it for time) was 4x hardware RAID cards... but I only have one on hand. I was thinking of setting up 4 hardware RAID 6 arrays, then uniting them on the Pi as a RAID 0 array and seeing if that performed better since individual drives would all go through one HW raid card, and that would also give redundancy.
      (And who said hardware RAID is dead? You still need it if your computer performs like one from 2010!).

    • @Winnetou17
      @Winnetou17 2 ปีที่แล้ว

      @@JeffGeerling Yeah, exactly, who said hardware RAID is dead ? Clearly unrelated, did you ever thought of doing a collaboration with Wendell ? Him and Red Shirt Jeff would surely push themselves to insanity :D Seriously speaking you two seem to do similar exploratory courses, though, of course, he's much less Pi-centric.

  • @WildBikerBill
    @WildBikerBill 2 ปีที่แล้ว +5

    When I first saw the title I figured this would be one of your massive multiprocessor/multi-Pi projects combined with a massive amount of storage. Something closer to 20 TB per Raspberry Pi. At 60 Drives and 60 Raspberry Pi's, that would still be way beyond any normal homebrew project.

  • @MichaelDude12345
    @MichaelDude12345 2 ปีที่แล้ว +60

    I saw you posted some on the homelab subreddit last week, THIS is what you were hiding from us?? What a fun idea. Can't wait to see your next project!

  • @InterprisesTV
    @InterprisesTV 2 ปีที่แล้ว +3

    Jeff, I'm catching up with thx, since I've admired your ability for a while. I am an old geek and you do magic. Reminds me of my S-100 days with CromixOS.

    • @spoils8179
      @spoils8179 ปีที่แล้ว

      1 month ago and no recognition? Especially for a $50 USD donation? Sadge.

    • @InterprisesTV
      @InterprisesTV ปีที่แล้ว

      @@spoils8179 Thanks for the sentinent, Aiden, but no problem. Hope he's doing well, and you as well for that matter. 👍

  • @communitycollegegenius9684
    @communitycollegegenius9684 2 ปีที่แล้ว +1

    I just completed a 24 X 18T build that's almost 1/2 PB. I bought the drives a few at a time all Segate recertified it seems to be the sweet spot in price for me. I had to upgrade and rebuild things several times. Your video was just like my experience switching OS, FS, etc. I had endless drives/arrays just dropping out, mostly on startup. I tried Fedora, Centos, Open Suse, Suse JeOs, and Ubuntu Server all let me down for one reason or another. I tried them with various shares, raid arrays and file systems; plus not all would run my app. I ultimately got it working with Ubuntu workstation and the Ubuntu share - no samba. I'm not happy with ZFS. I had to kill the swap and add more RAM (which meant a new motherboard) to keep ZFS cache from crashing. I solved the dropouts problem by putting the drives really close to the host board and using short/expensive data cables that are all the same length. I'm using a very old Athlon FX-8300 8 core and 64 gig of Ram. I found a great last generation Adaptec 52445 raid card new old stock. I had to install 2 power supplies and rewire one to all molex to get enough amperage on the 5 volt rail. New 1200 watt supplies have plenty of 12v but almost no 5v power. I also upgraded to 2.5 gbps network card. The write speed is NOT stellar. With Raid0 it goes real fast at first 450mbps filling all that cache, but slows down to about 150mbps. With a single JBOD I only get 130mbps, two drives at the same time still go 130 each, and I can transfer to 4 drives at the same time before it bogs down the network and write speed drops to 70 each drive or 280 total. I already got 4 sas expanders and plan to continue adding drives (and power supplies) up to 2.5 PB. My box is an old IBM 2401 tape drive converted to rack space. I yell at the You Tube screen, not my computers. That's not true I also yell at my computer at work (it's windows).

  • @scbtripwire
    @scbtripwire 2 ปีที่แล้ว +27

    Oh my goodness, that juggling of those drives.😳

    • @wayland7150
      @wayland7150 2 ปีที่แล้ว +30

      Every IT engineer has a stack of broken hard drives just for juggling with.

    • @JeffGeerling
      @JeffGeerling  2 ปีที่แล้ว +10

      Haha true.

    • @SyphistPrime
      @SyphistPrime 2 ปีที่แล้ว +7

      @@wayland7150 yep, pretty much. My old boss when I worked in computer repair used laptop hard drives to level out his microscope. Funny thing is the drives weren't even dead, they were just something like 160GB 5400RPM drives that were more useful for that task than storing data.

  • @maxd7228
    @maxd7228 2 ปีที่แล้ว +4

    Jeff, you have carved yourself a niche channel in an overcrowded tech community, taking the Pi to new heights in every video. Keep it up. Loving every video. I'm left astounded on the Pi's capabilities and untapped potential.

  • @haidenshober6732
    @haidenshober6732 2 ปีที่แล้ว +23

    Many Pi’s running something like Minio might allow for some interesting single box hardware redundancy. Also might be able to get over the 1Gb limitation since you’d have many pi’s each with their own 1Gb connections.

  • @thenoisyelectron
    @thenoisyelectron 2 ปีที่แล้ว +4

    There's something about seeing you go from putting the last drive in the rack to immediately plugging in the dinky micro SD card that makes me giggle 😃

  • @adrianaa3059
    @adrianaa3059 ปีที่แล้ว +2

    I think Arthur C Clarke hypothesized that this would be enough to store a few people's minds into it

  • @crashmatrix
    @crashmatrix 2 ปีที่แล้ว +10

    Well Jeff, one thing's for sure, if you and others aren't pushing the pi to bleed on the edge, no progress will ever be made in this direction. I'm not sure what kind of useful stuff this direction will yield, but it surely will yield something. Keep on pushing ya madlad!

    • @JeffGeerling
      @JeffGeerling  2 ปีที่แล้ว +6

      My hope is the next Pi at least has the PCIe bus bugs sorted so any card will 'just work'. After that, any more bandwidth they could squeeze out would be appreciated.
      The CM4 is actually great for many 1 Gbps network use cases-but with a little more bandwidth, it could be great for 2.5 Gbps (or heck, more than that if we're dreaming!).

    • @levygaming3133
      @levygaming3133 2 ปีที่แล้ว

      @@JeffGeerling with a USB3.something port, it could get almost-5Gbe, so not that much of a stretch for faster than 2.5gbe

  • @devnol
    @devnol 2 ปีที่แล้ว +7

    One thing I saw you doing when wiring up the NAS was that you connected the molex adapters to two sata power connectors on the same line. If the psu has another line, try connecting to that, as each line can draw a limited amount of current and there might be an issue there. Then again, the issue might be anywhere else but that's a thing you can easily try

  • @TheOleHermit
    @TheOleHermit 2 ปีที่แล้ว +39

    Glad to see Red Shirt Jeff back. I was wondering what happened to him.
    Geez, your projects are soooo extreme and cutting edge. Your troubleshooting processes are very informative and helpful to your viewers, who otherwise wouldn't have a clue where to look. We never see that on "How to" setup videos, where everything just automagically works.
    BTW, I recently used Styrofoam to separate 2 prototype boards while testing them out. It got hotter than expected and the foam sagged, allowing the power rail of one board to touch the other board, which fried as soon as I powered up the following morning.
    Don't try this at home, folks. Cardboard insulation good. Styrofoam bad.
    Thanks for sharing Jeff. I'm fully confident that you'll get your bandwidth up on the 60 HDD RAID. Looks to me like you're already nearly there.
    Instead of installing it inside a server rack, perhaps a locked cage would be a better idea to protect it from Red Shirt Jeff. 😎

    • @JeffGeerling
      @JeffGeerling  2 ปีที่แล้ว +5

      Heh, cardboard insulation 'better', but I now have it on a 3D printed box that's a little more secure too.

    • @TheOleHermit
      @TheOleHermit 2 ปีที่แล้ว +2

      @@JeffGeerling Good point. 👍
      Now, my 2nd attempt is safely mounted inside a custom acrylic case, too. One less thing to worry about, amongst the multitude of other potential mistakes. Live & learn, eh?
      Just gotta keep putting one foot in front of the other until the final goal is achieved, right?
      The hardware for the Teensy Laser Synth waveform module w/ILDA DAC is complete. Only need to flush out the code to full functionality, before moving on to the custom MIDI controller.
      BR 😎

  • @AFA_TRES
    @AFA_TRES 2 ปีที่แล้ว +2

    In a few years I can imagine an image comparing this to a micro sd card and a caption saying “this used to be a petabyte in 2022”

  • @mrcade2591
    @mrcade2591 3 หลายเดือนก่อน +1

    finally a way to store warzone updates, updates will take a shit ton of time cuz its hdds though

  • @mahtin
    @mahtin 2 ปีที่แล้ว +11

    Even before doing a raid or zfs test; I’d have run a single drive test (either benchmark or simple linear read/write). Loop that 60 times and see success. Then repeat with two drives in parallel and loop 30 times. Redo with three in parallel 20 times, etc etc. When you start seeing failures you really will see the cause-effect point. There is technically no reason why this won’t work with 60 drives - if you ignore performance. Any bugs that are exposed that can be fixed will simply improve the base users world. This is an awesome test that pushes the RPi and kernel to the limit. Making this work at that limit helps all of us just running one drive. Plus, the errors you saw, as bad as they were, should somehow restart the drive (without a reboot).

    • @JeffGeerling
      @JeffGeerling  2 ปีที่แล้ว +6

      When those errors occurred, the HBA reset itself and the drives always came back-at least 15/16 of them!
      That was one concern from the Broadcom engineer I spoke with, and the reason he really wanted me to run the latest firmware. Unfortunately due to time constraints I couldn't flash all the cards in a separate PC then bring them back to the Pi and re-test. But I plan on trying that out.

    • @JeffGeerling
      @JeffGeerling  2 ปีที่แล้ว

      Extra testing has been done-tl;dr the breaking point is 3 cards (or more than 30 direct attached drives). But forcing PCIe Gen 1 speed also fixes the issue. More to come in my next video!

  • @caseyhefner1966
    @caseyhefner1966 2 ปีที่แล้ว +25

    Peta-Pi go BRRRRR
    Im genuinely surprised this worked, very impressive. As for what to do with it, a video on downloading/hosting a local copy of Wikipedia would be pretty cool.

  • @SpiritmanProductions
    @SpiritmanProductions 2 ปีที่แล้ว +100

    Maybe the CABD order is like the firing sequence of a 4-cylinder petrol engine lol 🤔

    • @falxonPSN
      @falxonPSN 2 ปีที่แล้ว +6

      The amount of power this thing draws could probably be measured more easily in horsepower, so you're not wrong there!

    • @JasperJanssen
      @JasperJanssen 2 ปีที่แล้ว +4

      @@falxonPSN the PSU isn’t that big - a horsepower is roughly 750W, so it’s going to be around the 1-2 mark.

    • @falxonPSN
      @falxonPSN 2 ปีที่แล้ว +3

      @@JasperJanssen fair enough. I can't argue with good pedantry! 🤪

    • @fohkukohgeki
      @fohkukohgeki 2 ปีที่แล้ว +2

      @@JasperJanssen Could run a petabyte server off a lawnmower engine...

    • @falxonPSN
      @falxonPSN 2 ปีที่แล้ว

      @@fohkukohgeki this is the project we need to see!

  • @red03golf
    @red03golf ปีที่แล้ว

    Jeff - I mourn you missed seizing the opportunity to officially name this The Pi-tabyte Project - a portmanteau teed-up for a long drive, but you duffed it, lol. - great vid, keep 'em coming, I'm having a grand time doing some of your projects. Cheers.

  • @stuffinfinland
    @stuffinfinland 2 ปีที่แล้ว

    Yet another absurd carage setup. Just love it!

  • @b00573d
    @b00573d 2 ปีที่แล้ว +37

    Be careful with those sata to molex adapters...they are prone to fires!

    • @JeffGeerling
      @JeffGeerling  2 ปีที่แล้ว +17

      Red Shirt Jeff liked this comment.

  • @MarcoGPUtuber
    @MarcoGPUtuber 2 ปีที่แล้ว +13

    Jeff Geerling: My storage setup registers on the Richter Scale

  • @MrSnuffyX
    @MrSnuffyX 2 ปีที่แล้ว +4

    It almost hurts to watch the board being removed. I've been wishing for a storeage like this for decades.
    Still a very interesting project.

  • @markonfilms
    @markonfilms 2 ปีที่แล้ว +1

    This is an awesome project. Makes me wanna build a large storage server.

  • @rollerboogie
    @rollerboogie 7 หลายเดือนก่อน

    As an HDD engineer usually drives are built to compensate for vibration due to certain fan RPMs. Especially if we have a big customer we'll optimize things for the frequencies of vibration in their trays.

  • @jaxxarmstrong
    @jaxxarmstrong 2 ปีที่แล้ว +24

    60x USB-connected HDDs... I'm telling you, a missed opportunity :D

    • @marcogenovesi8570
      @marcogenovesi8570 2 ปีที่แล้ว +5

      Yeah use the maximum possible number of USB hub daisy chaining

    • @JeffGeerling
      @JeffGeerling  2 ปีที่แล้ว +22

      Heh, the USB controller would probably just set itself on fire!

  • @nezu_cc
    @nezu_cc 2 ปีที่แล้ว +27

    My NAS is a pi4 with a desoldered USB chip to expose the PCIe bus. It's connected to a PCIe switch(to improve transmission and prevent crashes) and then to a cheap ASMedia SATA controller. The kernel is also patched to force PCIe gen 1 to prevent crashes(this is probably the same thing that happened to you in the video btw). PCIe gen1 is still faster than gigabit so it doesn't matter. Over samba or NFS I can get the full gigabit speed even on large data transfers so no bottlenecks there. Would I recommend this setup? no. Does it work? hell yeah (longest uptime was like 90 days or something and then a power outage killed it, I need to get(or more likely make) a UPS, I know). If anyone wants photos lmk

    • @JeffGeerling
      @JeffGeerling  2 ปีที่แล้ว +6

      Nice. Coreforge also suggested forcing Gen 1 speeds elsewhere in the comments, so I may need to test that out.

    • @NoIPHU
      @NoIPHU 2 ปีที่แล้ว

      @@JeffGeerling That was my first idea as well. Those switches are used to run gen1 or maybe max gen2, when used for GPU mining. Above that it will throw errors.

  • @Neilhuny
    @Neilhuny 2 ปีที่แล้ว +15

    Absolutely fascinating and completely, totally barmy! What an immense amount of work you obviously put in to this - research, sweet talking 45Drives, research, RAID solutions, research, talking to 45Drives techs, research, etc.
    I am *extremely* impressed, both by you AND by 45Drives for their courage!
    And my abiding thought? "Chassis" is pronounced "shassey", not "chassey"! Pfft!
    So I looked up chassis in Cambridge Dictionary to back my obviously accurate opinion and ... well, blow me, North Americans really do say "chassey"!
    Every day is a learning day! Pedantry isn't good

    • @JeffGeerling
      @JeffGeerling  2 ปีที่แล้ว +2

      lol we North Americans are weirdos. Or maybe you are... I guess it's a matter of perspective, neighbour!

    • @Neilhuny
      @Neilhuny 2 ปีที่แล้ว

      @@JeffGeerling I have to picture you in dungarees, a battered straw hat, and with a long stalk of grass hanging from you mouth when you call me neighbour! Hock-diggardy, or something like that

  • @badwolf1984
    @badwolf1984 2 ปีที่แล้ว

    Ripps out high end setup for a PI! You Monster! Though a Fun Project to play with a pi, maybe in the future with a PI 14 hope you enjoy your new Perabyte Server

  • @bryanenglish7841
    @bryanenglish7841 2 ปีที่แล้ว

    You are an absolute madman and I'm rooting for you the whole way

  • @JPToto
    @JPToto 2 ปีที่แล้ว +18

    This is amazing. Good job charming 45drives into sending the case! They must be Red Shirt fans 🤣

  • @braydennturner
    @braydennturner 2 ปีที่แล้ว +5

    Amazing content as always, keep up the great work!

  • @hoagy_ytfc
    @hoagy_ytfc 2 ปีที่แล้ว +5

    Jeff doesn't have the word "why" in his vocabulary :)
    (Just kidding, things like this are fun, which is "why" enough for me).

  • @Marco-gx7ok
    @Marco-gx7ok 2 ปีที่แล้ว

    I am afraid someone addicted to this work is called a petaphile person! 😱 But great, that there are people doing these kind of projects!

  • @Captn_Grumpy
    @Captn_Grumpy 2 ปีที่แล้ว

    OMG the original hardware is so beautiful it brought a tear to my eye to see it removed. Given the resources I would have 2 or 3, one for local redundancy and one for off site but given net speeds the off site would mostly be pointless. And no I don't need one, I just want one so bad it feels like I need it :)

  • @zambonidriver42
    @zambonidriver42 2 ปีที่แล้ว +8

    I have smaller versions of those EXOS drives.
    >200 drives, over the past 5 years, I’ve had 4 failures. 3 covered under RMA. 1 had just expired.

    • @JeffGeerling
      @JeffGeerling  2 ปีที่แล้ว +5

      For every anecdote (usually it's "all my Seagate drives exploded in giant fireballs!"), there's an opposite anecdote. In aggregate, if drives like these were truly failing at the rates some people think, Seagate would not be in business :)

    • @llortaton2834
      @llortaton2834 2 ปีที่แล้ว +5

      @@JeffGeerling The reason all of this information was popularized in the first place is because of backblaze's reports (back in the day) but if we look at their stats now, year over year, seagate is constant.

    • @marcogenovesi8570
      @marcogenovesi8570 2 ปีที่แล้ว +4

      EXOS and businness drives in general are fine, it's the consumer lines that are more "hit and miss", but even then it's easy to find a pattern unless you buy hundreds of them. I.e. a brand doesn't just "consistenly fail 4x more than another"

  • @MegaManNeo
    @MegaManNeo 2 ปีที่แล้ว +9

    **CM4:** Hey Jeff! What are we going to do today?
    **Jeff:** You will handle 60 enterprise grade hard drives.
    **CM4:** Oof

  • @falcychead8198
    @falcychead8198 2 ปีที่แล้ว +4

    As a St. Vincent fan, I'm calling it "Pietabyte" whether anybody else does or not.

  • @adamfoxton6341
    @adamfoxton6341 ปีที่แล้ว

    "Somehow I convinced 45 Drives to send me the server AND all these hard drives"
    The impressive bit to me is that you were able to get hold of a CM4!

  • @marcogenovesi8570
    @marcogenovesi8570 2 ปีที่แล้ว +6

    45drives marketing team is on a roll

    • @45Drives
      @45Drives 2 ปีที่แล้ว +4

      hi

    • @wayland7150
      @wayland7150 2 ปีที่แล้ว

      Were they taking ab big chance? What if Jeff had proved you could do it all better with a PI? Hahaha.

    • @YeOldeTraveller
      @YeOldeTraveller 2 ปีที่แล้ว

      @@wayland7150 Not much of a chance. They are well aware of the performance of their product, and the limitations of a Pi.

  • @Space_Reptile
    @Space_Reptile 2 ปีที่แล้ว +15

    Next step: 3.14PB on a PI

  • @davidmcken
    @davidmcken 2 ปีที่แล้ว +16

    I'd almost be interested in seeing each raid card assigned to 1 PI and then them clustered together.

  • @johnnytarponds9292
    @johnnytarponds9292 2 ปีที่แล้ว +1

    Hey! 45 Drives is here in my home town!

  • @marcodebortoli
    @marcodebortoli ปีที่แล้ว

    It was only about time that the new storage measurement of reference was going to be the petapite... I'll show myself out :D

  • @Finkelfunk
    @Finkelfunk 2 ปีที่แล้ว +6

    If lsblk starts to run out of letters and shows drives as "sdaa, sdab, sdac" etc. you know you have a data hoarding problem.

  • @jbrown-acuity
    @jbrown-acuity 2 ปีที่แล้ว +5

    Would love to see this with a RISC V processor

  • @zambonidriver42
    @zambonidriver42 2 ปีที่แล้ว +6

    How many times did you recompile the kernel?

  • @miolini
    @miolini 11 หลายเดือนก่อน

    Integrating a Pi compute module with Ethernet into each hard drive can make scaling easier. It lets each drive connect to a network independently, simplifying data handling in big storage setups.

  • @tankgrrl
    @tankgrrl 2 ปีที่แล้ว

    You gotta admire a guy who takes $50k worth of server and... plops a Raspberry Pi in it. Brave and crazy. :)

  • @Jimmy_Jones
    @Jimmy_Jones 2 ปีที่แล้ว +7

    "I grabbed a small piece of cardboard to insulate the boards from each other. At least temporarily."
    Yeah. Sure.
    I think red shirt Jeff was trying the break back into the room.

    • @wayland7150
      @wayland7150 2 ปีที่แล้ว +1

      You can see red shirt Jeff is actually in the room with Jeff at one point in the video.

  • @popcorny007
    @popcorny007 2 ปีที่แล้ว +5

    They must really trust you to borrow $35k+ worth of disks

    • @JeffGeerling
      @JeffGeerling  2 ปีที่แล้ว +5

      s/'borrow'/'juggle' :D

  • @loganiushere
    @loganiushere 2 ปีที่แล้ว +10

    it’s a shame you didn’t call it “PetaPi”

  • @Echobar
    @Echobar 2 ปีที่แล้ว

    Jeff another wonderful video. Keep up the great work.

  • @insanehd2940
    @insanehd2940 ปีที่แล้ว +1

    I would love to see a reboot with the Pi5

  • @jeffreyumeh8580
    @jeffreyumeh8580 ปีที่แล้ว

    Hu, I know that modern cases have that 1 stud so you can place your motherboard in on the stud which makes it easier to align the rest of the screws, but even then every motherboard I have installed has had 8 - 9 screw holes, sometimes there is a heatsink or something in the way of having all 9 screw holes, but 8 is the minium I have seen, this is only for full ATX though not ITX / mATX.

  • @km6mmo843
    @km6mmo843 2 ปีที่แล้ว

    Yay! Tiny computers doing huge things!!! 🔥🔥🔥

  • @ronguin7062
    @ronguin7062 2 ปีที่แล้ว +1

    This guy is the only human capable of winning in a debate with Data from star trek.

  • @funnygrunt_o7
    @funnygrunt_o7 ปีที่แล้ว +1

    dude that hard drive juggle made me freak out a little

  • @cervyvin
    @cervyvin ปีที่แล้ว

    Very interesting watching "The PP Project"! :D

  • @mamayl8592
    @mamayl8592 9 หลายเดือนก่อน

    When you're testing tech and it registers on a seismometer, you've accomplished something.

  • @thomasgoirand488
    @thomasgoirand488 ปีที่แล้ว

    Hi there! I work for Infomaniak, and I am managing storage networks. We offer backup services and connected drive (we call it kDrive, it's a bit like google drive, just we protect your data and they are hosted in Switzerland).
    Adding 96 Exos 20TB HDD in my swift storage cluster is what I do every day. I manage "moderately large" Swift clusters. On them, we add 6 2U servers at a time, each of them holding 16 HDD (so 6*16 = 96 HDDs). In total, I calculated that I am managing more than 5000 spinning drives in our OpenStack swift clusters, which amounts probably around 100PB (so not just 1 PB like you're doing...). Just this week, I added 12 HDD storage nodes, and 6 proxies (with 2x SSD each, plus the system drive), and 9x NVMe storage nodes (10 NVMe each, to be used in a Ceph cluster).
    Your idea of a Raspbery-Pi is fun, but instead of one RPi for all HDDs, I would setup one RPi *PER* HDD, and then it all makes sense, and you may have decent speed.
    BTW the casing you bought seems of very bad quality compared to what we get from HPe or Lenovo.
    I've been running this service for nearly 5 years now, and we never lost a single bit of data... :)

  • @ninguern7693
    @ninguern7693 2 ปีที่แล้ว

    1:52 I remember when people were saying that about the terabyte. Technology is crazy.

  • @yourdogsnews
    @yourdogsnews 2 ปีที่แล้ว

    Phillips heads, you savage. Robertson all the way baby

  • @Unidentified7002
    @Unidentified7002 2 ปีที่แล้ว

    I'M SO INTERESTED IN THIS PETABYTE THING

  • @_caith
    @_caith 9 หลายเดือนก่อน

    I built my first datahoarder set up using nothing but 8tb SSDs XD a whole bunch of them. solves the issues of "spinning" in an easy way

  • @C21H30O2
    @C21H30O2 8 หลายเดือนก่อน

    I understood about 3% of this video. But it was still interesting. Great performance.

  • @volvaary2724
    @volvaary2724 ปีที่แล้ว

    Thanks for this idea to store my stuff.

  • @ksp2viking
    @ksp2viking 10 หลายเดือนก่อน

    Omg... 11:42
    The hilarity of having all that storage and still booting from a micro SD card :D

  • @weimenli7342
    @weimenli7342 ปีที่แล้ว +1

    When the price of hard drives keeps falling, and the price of Raspberry Pis keeps riding.

  • @pantegministries
    @pantegministries ปีที่แล้ว

    I have a Pi running RAID1 with 2 External SSDs and on times I have to reboot as it seems to forget that the disk exists. That's Pi 4 and Samba and a Window share. So to get as far as you did its amazing.

  • @ryanskewer1534
    @ryanskewer1534 ปีที่แล้ว

    PPP is a great acronym, so I stand by Petabyte Pi Project

  • @Jamie-0liv3
    @Jamie-0liv3 ปีที่แล้ว

    The linus video is literally in the other tab on opera. I was bouta watch it next.

  • @percivul1786
    @percivul1786 หลายเดือนก่อน

    The problem you're facing is that USB 3 connection to the board. The way the Pi interprets all of that data is time sensitive. Basically, when the USB 3 connection to the board and then through the kernel saturates, it only gives the data in the buffer so much time to be read/written and when that time expires, it basically dumps the packets and then goes back to look for more data. The BIG problem here is the IRQ signals are ALSO going through that port to the PCIe Expansion board. This is why your drives will just disappear after awhile.
    I ran into the same problem setting up a Pi driver 70TB NAS at home and in the end, I had to basically setup a CRON job to monitor for things like dropped IRQ's and whatnot and if there were any hardware failures, it would force a reboot.
    Bottom line is that yes, you can connect allot of hardware through expansion boards and use USB 3 connections to bridge it all, but that is going to be VERY flakey at best.

  • @97oweb
    @97oweb 2 ปีที่แล้ว

    The real impressive thing ist that he was able to get a pi compute module 4
    I am trying to find one now for almost a year but the only place I even found one listed was on Amazon in the us, it is sold out since then, here in europe I did not even find a seller who listed it

  • @xWatexx
    @xWatexx ปีที่แล้ว

    Dude can fit 0.01% of the internet on this damn thing. Insane.