More

justaboutanyone · 2026-03-07T23:46:31 1772927191

This failed when I put in an australian postal code.

justaboutanyone · 2026-02-26T21:08:45 1772140125

Shell programming is high density inter-language glue. You simply have more options of implementations to call out to and so less to write.

I can trivially combine a tool written in rust with one written in js/java/C/whatever without writing bindings

justaboutanyone · 2026-02-04T00:59:30 1770166770

First thing a public spacex would want to do is sell off all the non-spacex crap

tyre · 2026-02-04T22:08:47 1770242927

A public SpaceX will still be run by Musk. A public SpaceX would have to sell assets like X for a huge loss given its debt load, which would also take a propaganda machine out of Musk’s hands.

They’re stuck with those assets.

justaboutanyone · 2026-02-04T00:58:25 1770166705

This sort of thing will be great for the SpaceX IPO :/

stubish · 2026-02-04T03:14:34 1770174874

Especially if contracts with SpaceX start being torn up because the various ongoing investigations and prosecutions of xAI are now ongoing investigations and prosecutions of SpaceX. And next new lawsuits for creating this conflict of interest by merger.

justaboutanyone · 2026-02-04T00:55:34 1770166534

Running llama.cpp rather than vLLM, it's happy enough to run the FP8 variant with 200k+ context using about 90GB vram

cmrdporcupine · 2026-02-04T02:08:06 1770170886

yeah, what did you get for tok/sec there though? Memory bandwidth is the limitation with these devices. With 4 bit I didn't get over 35-39 tok/sec, and averaged more like 30 when doing actual tool use with opencode. I can't imagine fp8 being faster.

justaboutanyone · 2026-01-28T05:31:48 1769578308

You can run large-ish MoE model at good speeds, like gpt-oss-120b, it's snappy enough even with big context.

But large and dense at the same time is a bit slow.

Running a local LLM will be a load of money for something much slower than the api providers though.

storystarling · 2026-01-28T10:04:55 1769594695

Makes sense regarding the MoE performance. I am not sure the cost argument holds up for high volume workloads though. If you are running batch jobs 24/7 the hardware pays for itself in a few months compared to API opex. It really just comes down to utilization.

storystarling · 2026-01-28T11:50:14 1769601014

Do you have specific t/s numbers for those dense models? I'm curious just how severe the memory bandwidth bottleneck gets in practice.

I'm not sure I agree on the cost aspect though. For high-volume production workloads the API bills scale linearly and can get painful fast. If you can amortize the hardware over a year and keep the data local for privacy, the math often works out in favor of self-hosting.

justaboutanyone · 2026-01-29T04:51:32 1769662292

For Qwen2.5-72B-Instruct-Q5_K_M at 32k context, I fed it a 26k token file (truncated fiction novel) asking it to summarize, and it input processed at 224 tok/s and output generated at 3 tok/s. Not really good enough for interactive use without frustration. Not just from watching it reply, but also the long wait for it to actually read the book.

On the same hardware gpt-oss-120b at 128k context, I fed it a longer version of the input (a whole novel, 97k tok), and it input processed at 1650 tok/s and output generated at 27 tok/s. Just fast enough IMO

justaboutanyone · 2026-01-27T04:34:34 1769488474

We may as well have the LLMs use the hardest most provably-correct language possible

justaboutanyone · 2026-01-25T18:25:55 1769365555

What is the article text for those stopped by the paywall

wyldfire · 2026-01-25T18:33:09 1769365989

https://archive.is/Z6s9U

justaboutanyone · 2026-01-25T04:38:37 1769315917

It really feels like more than 1 in 20 driving around the 101/280

omoikane · 2026-01-25T05:20:43 1769318443

Probably because Santa Clara County has more EV sales compared to its neighbors, according to this map:

https://www.energy.ca.gov/data-reports/energy-almanac/zero-e...

ZeroGravitas · 2026-01-25T08:34:22 1769330062

And newer cars get driven more than old cars on average so 1/20 cars being EVs will do more than 1/20th of the miles.

justaboutanyone · 2026-01-18T16:10:25 1768752625

Where are the RVA23 boards that have been hinted at for so long?