Dad! Where did my gaming rig go!!! Now listen up there Junior, this is for science. Just don't tell your Mum. And you can have the car keys for Saturday.
What I see is 1/3 working time caused by using a model built on close to 6 times the data points. That's nice! And... Enabled by distributed mode. Am I correct? Is there a way that the quant factor affects the computation of those general factors (time/tokens vs size) that would make this more 1:1? Already it's nice you can get the big models in.
Congrats! You unlocked a masssive achievement running this on your own hardware!!! All hail the AI and the kilowatts we feed them
Skynet is possible in your basement 🦾
Dad! Where did my gaming rig go!!! Now listen up there Junior, this is for science. Just don't tell your Mum. And you can have the car keys for Saturday.
Better than stealing their GPU's out of their rigs right? 😂
I liked every comment on this video!
Thanks!
What I see is 1/3 working time caused by using a model built on close to 6 times the data points. That's nice! And... Enabled by distributed mode.
Am I correct? Is there a way that the quant factor affects the computation of those general factors (time/tokens vs size) that would make this more 1:1?
Already it's nice you can get the big models in.
Really impressive, congrats ! Do you know the impact of a limited PCIe bus (1x , 4x GEN3) for those GPU cards ?
Would you be able to make a tutorial on getting lovalAI working in kubernetes?
Sure, I think that's overdue at this point!
wild. does this work for llava images too?
Damn
What's the network bandwidth? I wonder what could be done if you connected to a bunch of buddies with gigabit symmetrical fiber connections.
As much as you can pump for distributing the model - during inference it's really only about 10-20 MB/s per node
around 4 Times faster than cpu only... But around 100x more expensive...
It's just money and power! like always.... 😁
First
Congrats!
It would be more intelligible if your results mention (Higher is better or Lower is better) beside the chart headings.
Thanks for the feedback!