More

mannyv · 2026-03-24T23:03:38 1774393418

Indexing all your porn and skipping all the filler.

iso1631 · 2026-03-25T12:12:27 1774440747

isn't the "fill her" the point of porn?

mannyv · 2026-03-23T15:46:02 1774280762

The software has real software engineers working on it instead of researchers.

Remember when people were arguing about whether to use mmap? What a ridiculous argument.

At some point someone will figure out how to tile the weights and the memory requirements will drop again.

snovv_crash · 2026-03-23T15:57:12 1774281432

The real improvement will be when the software engineers get into the training loop. Then we can have MoE that use cache-friendly expert utilisation and maybe even learned prefetching for what the next experts will be.

zozbot234 · 2026-03-23T16:29:44 1774283384

> maybe even learned prefetching for what the next experts will be

Experts are predicted by layer and the individual layer reads are quite small, so this is not really feasible. There's just not enough information to guide a prefetch.

yorwba · 2026-03-23T17:26:36 1774286796

It's feasible to put the expert routing logic in a previous layer. People have done it: https://arxiv.org/abs/2507.20984

snovv_crash · 2026-03-23T16:34:43 1774283683

Manually no. It would have to be learned, and making the expert selection predictable would need to be a training metric to minimize.

zozbot234 · 2026-03-23T16:40:08 1774284008

Making the expert selection more predictable also means making it less effective. There's no real free lunch.

mannyv · 2026-03-22T16:29:58 1774196998

Everyone is focused on the bad 2 bit result but who cares? He says don’t use it because it’s bad.

Aurornis · 2026-03-22T18:32:44 1774204364

If you don’t care about the output, why not reduce to 1-bit and only 1 active expert? It will be completely useless but it will be faster!

mannyv · 2026-03-22T02:03:16 1774144996

The structure of most residential construction in the US is standardized. Foundation (or slab), wood framing, etc. There are different levels of quality, but codes and standards mean that standardization is the norm.

mannyv · 2026-03-16T00:14:27 1773620067

It's fun, but testing has become more of a PITA. When I write code I test and understand each piece. With AI generated code I need to figure out how it works and why it isn't working.

mannyv · 2026-03-13T17:24:40 1773422680

Why bother transcoding on the fly? Storage is cheaper than CPU and the work it takes to determine what needs encoding is excessive.

It implies that you guys are generating the playlists on the fly, tracking the client requests, then feeding that over to your transcoder - which then needs to get the original, seek, and transcode. Why bother?

jon_dahl · 2026-03-13T17:45:16 1773423916

Mux founder here :wave:

Two answers.

First, it does save money. A meaningful percentage of videos on the internet are never watched in the first place, and an even larger percentage are watched soon after upload and never watched again. We're able to prune unwatched renditions, and if they happen to be requested years later, they're still playable. Transcoding on the fly lets us save both CPU and storage.

Second, it is ridiculously fast. Our median time-to-publish for a 5-20 minute video is 9 seconds. We had a customer (God bless them) complaining a few months ago that it took us something like 40 seconds to transcode a 40 minute video, which actually was slower than normal for us. If you do an async transcode up front, you're looking at 20 minutes, not <1 minute.

Blog post on this: https://www.mux.com/blog/how-to-transcode-video-100x-faster-...

steve_adams_86 · 2026-03-13T19:35:24 1773430524

> A meaningful percentage of videos on the internet are never watched in the first place [...] We're able to prune unwatched renditions, and if they happen to be requested years later, they're still playable.

I worked on something similar a while back, and the data that helped me make a call on whether or not we should transcode on the fly or store renditions was looking at analytics for how often the files are accessed.

I figured out that a large file being transcoded and stored would use more compute resources in ~15 minutes than it was likely to use over the span of _several years_ if it was transcoded on the fly. In a situation where you don't know if the company will exist in several years... You opt for the choice which allows you to stack on the storage later if it's necessary.

That's probably one of very few times I've ever applied YAGNI properly. That was ripe for over engineering

mannyv · 2026-03-14T00:03:25 1773446605

But can you still seek on a video that's being instantly transcoded? To be honest I don't know if anyone does that except YouTube, and it jumps to the time so theoretically you have about a second or two when the request comes into pull the file and start encoding. It sounds like the mezzanine file is chunked, so the time to pull it down is pretty fast.

Since it's your own player you can hint to the backend.

Do you dynamically generate the manifests too? Or do things get transcoded on request?

mannyv · 2026-03-14T00:30:21 1773448221

Oh, never mind - yeah, by using the access logs of segments you can effectively anticipate and pre-encode when you need to. And once the hls or cmaf stabilizes you can just encode one resolution. And the player will tell you it wants to move up or down, so you can trigger the encode it wants.

It's interesting your customers want the video immediately; ours don't care about that. But you guys can really build your manifest files and encode immediately, since you're making those mezzanine files.

Then the encoded files are basically a cache that you can evict whenever.

How long did it take you guys to prove that design out?

jon_dahl · 2026-03-16T14:34:36 1773671676

Yes and yes: you can immediately seek wherever you want. If it's the first time a rendition has been watched, and if we haven't chosen to pre-encode the later segments (which we do sometimes), you might see a slightly longer seek time as we create the first segment after seek, but the difference is marginal and goes away on second view.

> It's interesting your customers want the video immediately

Yep. Some do, some don't. User-uploaded workflows usually care about this; imagine uploading a video to post to a social network and then waiting 20 minutes for the post to go live. (News and sports care about fast publishing too.) Premium media usually doesn't; if you spend a few hours recording a lecture or a yoga class, you don't care if it takes 10 seconds or 10 minutes to publish.

> How long did it take you guys to prove that design out?

You don't want to know. It wasn't easy. The biggest challenge is the ongoing tax; other additions to our transcoding layer have an added degree of complexity. But it's been absolutely worth it for us.

mannyv · 2026-03-13T17:11:34 1773421894

Talent is definitely not evenly distributed...like luck.

mannyv · 2026-03-13T04:35:33 1773376533

It would be just as interesting to see how things have changed over the time - from the 1910s to now.

mannyv · 2026-03-11T21:21:22 1773264082

Iran has perfected the "full of shit" playbook. The press repeat it because clickbait.

general1465 · 2026-03-11T21:45:02 1773265502

They already hit 3 ships. They don't need to mine the strait, just making it impassable due to insurance cost because you can end up with a hole in a tanker is enough.

cosmicgadget · 2026-03-12T00:49:22 1773276562

Meanwhile on the other side of the conflict we have "war is complete" and "Iran better remove their mines or we will bomb them".

It's shit all the way down.

mannyv · 2026-03-11T00:01:03 1773187263

The correct question is: from whom?

When I was in finance there was a question whether US debt would crowd out other debt instruments. The answer, obviously, is "no." There seems to be an unlimited appetite for zero-risk debt, which makes no sense.