More

sorenjan · 2026-03-16T16:47:15 1773679635

I disagree, I don't want another ffmpeg binary, I already have one. Winget works well, especially since this is already a terminal program.

sorenjan · 2026-03-16T16:45:53 1773679553

I don't find trimming videos with ffmpeg particularly difficult, is just-ss xx -to xx -c copy basically. Sure, you need to get those time stamps using a media player, but you probably already have one so that isn't really an issue.

What I've found to be trickier is dividing a video into multiple clips, where one clip can start at the end of another, but not necessarily.

ramon156 · 2026-03-16T16:53:01 1773679981

I don't find Sharing files with people very difficult, just login to your FTP and give an account to another user. - Person commenting on OneDrive

sorenjan · 2026-03-16T17:03:04 1773680584

Missed opportunity to reference the famous Dropbox hn comment.

I just think there are other closely related use cases where a separate program can add more value, especially in the terminal. I wouldn't suggest most people should use ffmpeg instead of a gui, those are too dissimilar. Another example is cutting out a part of a video, with ffmpeg you need to make two temporary videos and then concatenate them, that process would greatly benefit from a better ux.

tptacek · 2026-03-16T18:19:15 1773685155

Point of order: the Dropbox HN comment is famously misconstrued. People think it was about Dropbox; it was about the Dropbox YC application, and was both well-intentioned and constructive.

gyan · 2026-03-16T18:35:15 1773686115

> with ffmpeg you need to make two temporary videos and then concatenate them

It can be done in a single command, no temp files needed.

neRok · 2026-03-16T23:23:45 1773703425

There's nothing easy about it. Here's a taste.

  # make a 6 second long video that alternates from green to red every second.
  ffmpeg -f lavfi -i "color=red[a];color=green[b];[a][b]overlay='mod(floor(t)\,2)*w'" -t 6 master.mp4; # creates 150 frames @ 25fps.

  # try make a 1 second clip starting at 0sec. it should be all green.
  ffmpeg -ss 0 -i "master.mp4" -t 1 -c copy "clip1.mp4"; # exports 27 frames. you see some red.
  ffmpeg -ss 0 -t  1 -i "master.mp4" -c copy "clip2.mp4"; # exports 27 frames. you see some red.
  ffmpeg -ss 0 -to 1 -i "master.mp4" -c copy "clip3.mp4"; # exports 27 frames. you see some red.

  # -t and -to stop after the limit, so subtract a frame. but that leaves 26...
  # so perhaps offset the start time so that frame#0 is at 0.04 (ie, list starts at 1)?
  ffmpeg -itsoffset 0.04 -ss 0 -i "master.mp4" -t 0.96 -c copy "clip4.mp4"; # exports 25 frames, all green, time = 1.00. success.

  # try make another 1 second clip starting at 2sec. it should be all green.
  ffmpeg -itsoffset 0.04 -ss 2 -i "master.mp4" -t 0.96 -c copy "clip5.mp4"; # exports 75 frames, time = 1.08, and you see red-green-red.
  # maybe don't offset the start, and drop 2 at the end?
  ffmpeg -ss 2 -i "master.mp4" -t 0.92 -c copy "clip6.mp4"; # exports 75 frames, time = 1.08, and you see green-red.
  ffmpeg -ss 2 -t 0.92 -i "master.mp4" -c copy "clip7.mp4"; # exports 75 frames, time = 0.92, and you see green-red.
  
  # try something different...
  ffmpeg -ss 2 -i "master.mp4" -c copy -frames 25 "clip8.mp4"; # video is broken.
  ffmpeg -ss 2 -i "master.mp4" -c copy -frames 25 -avoid_negative_ts make_zero "clip9.mp4"; # exports 25 frames, all green, time = 1.00. success?
  # try export a red video the same way.
  ffmpeg -ss 3 -i "master.mp4" -c copy -frames 25 -avoid_negative_ts make_zero "clip10.mp4"; # oh no, it's all green!

sorenjan · 2026-03-17T00:41:52 1773708112

I've never tried doing frame perfect clips like that, that does sound annoying. But from a cursory read of the source, I don't think this program will solve that issue either? Because the time stamps in your examples are all correct, and the TUI is using ffmpeg with -ss and -t as well.

  func BuildFFmpegCommand(opts ExportOptions) string {
   output := opts.Output
   if output == "" {
    output = generateOutputName(opts.Input)
   }
   duration := opts.OutPoint - opts.InPoint
  
   args := []string{"ffmpeg", "-y",
    "-ss", fmt.Sprintf("%.3f", opts.InPoint.Seconds()),
    "-i", filepath.Base(opts.Input),
    "-t", fmt.Sprintf("%.3f", duration.Seconds()),
   }

I think the best way of getting frame accurate clips like that is putting the starting time after the input (or rather before the output), which decodes the video up to that time, and reencode it instead of copying. Both of these commands gives the expected output:

  ffmpeg -i master.mp4 -ss 0 -t 1 -c:v libx264 green.mp4
  ffmpeg -i master.mp4 -ss 1 -t 1 -c:v libx264 red.mp4

neRok · 2026-03-17T10:01:03 1773741663

Yer, I noticed that this tool was just doing `-ss -i -t` from its demo gif, which is what prompted me to reply. I'm sure people will discover that all sorts of problems will manifest if they don't start a lossless clip on a keyframe. One such scenario is when you make a clip that plays perfect on your PC, but then you send it someone over FB Messenger, and all of a sudden there's a few seconds of extra video at the start!

perching_aix · 2026-03-17T02:14:08 1773713648

Can't make frame perfect cuts without re-encoding, unless your cut points just so happen to be keyframe aligned.

There are incantations that can dump for you metadata about the individual packets a given video stream is made up of, ordered by timecode. That way you can sanity check things.

This is terribly frustrating. The paths of least resistance either lead to improper cuts or wasteful re-encoding. Re-encoding just until the nearest keyframe I'm sure is also possible, but yeah, this does suck, and the tool above doesn't seem to make this any more accessible either according to the sibling comment.

neRok · 2026-03-17T10:20:00 1773742800

> Re-encoding just until the nearest keyframe I'm sure is also possible Yer, I've done that, and it's a pain to do "manually" (ie, without having a script ready to do it for you). I've also manually sliced the bitstream to re-insert the keyframe, which if applied to my clip5.mp4 example, could potentially reduce the 50* negative ts frames to maybe 2 or 3. It would be easier if there were tools that could "unpack" and "repack" the frames within the bitstream, and allow you to modify "pointers"/etc in the process - but I don't know of any such thing.

coppsilgold · 2026-03-17T07:01:23 1773730883

For frame perfect cuts you need to re-encode. You can use lossless H264 encoding for intermediary cuts before the final one so that you don't unnecessarily degrade quality.

I wonder if there is a solution which would just copy the pieces in between the starting and ending points while only re-encoding the first and last piece as required.

bolangi · 2026-03-16T19:22:32 1773688952

FWIW, here's a simple command line utility for joining and trimming the multiple video files produced by a video camera.

https://metacpan.org/dist/App-fftrim/view/script/fftrim

skeeter2020 · 2026-03-16T23:50:25 1773705025

I've been trying to cut precise clips from a long mp4 video over the past week or so and learned a lot. I started with ffmpeg on the command line but between getting accurate timestamps and keyframe/encoding issues it is not trivial. For my needs I want a very precise starting frame and best results came from first reencoding at much higher quality, then marking & batching with LossLessCut, then down coding my clips to desired quality. Even then there's still some manual review and touch-up. It's not crazy-hard, but by no means trivial or simple.

https://github.com/mifi/lossless-cut

hiccuphippo · 2026-03-16T17:32:47 1773682367

I used a plugin in mpv to do it but I can't find it anymore. You just pressed a key to mark the start and end. And with . and , you could do it at keyframe resolution not just seconds.

rigrassm · 2026-03-16T20:43:05 1773693785

Found a few links to projects that fit this description in an awesome-mpv repo.

https://github.com/stax76/awesome-mpv?tab=readme-ov-file#vid...

Appreciate you mentioning the MPV route for making clips, I might actually go through and process all the game recordings I saved for clips over the years.

opan · 2026-03-17T02:51:33 1773715893

There's mpv-webm, which is great, but has no way to make a lossless clip AFAIK.

sorenjan · 2026-03-15T12:52:33 1773579153

Both Russia and Ukraine build millions of drones per year, most of them fpv drones that are basically remote controlled flying grenades. There's plenty of electronic warfare with radio jamming, so in some places they use drone mounted spools of fiber optic cable to control them. It's probably been the most impactful weapon type in the war for the past years.

sorenjan · 2026-03-13T17:33:53 1773423233

  > uvx --with pillow --with okmain python -c "from PIL import Image; import okmain; print(okmain.colors(Image.open('bluemarble.jpg')))"
  [RGB(r=79, g=87, b=120), RGB(r=27, g=33, b=66), RGB(r=152, g=155, b=175), RGB(r=0, g=0, b=0)]

It would make sense to add an entrypoint in the pyproject.toml so you can use uvx okmain directly.

sorenjan · 2026-03-13T15:06:13 1773414373

I wonder if one of the LLMs could generate code from a screenshot of a layout designed by this.

anonu · 2026-03-13T15:09:25 1773414565

Claude Code built a TUI for me last night, in this case to step through nanosecond timestamped ITCH market data messages and rebuild an order book visual in the terminal. This type of stuff would have taken a day - but done in 5 minutes now.

sorenjan · 2026-03-13T15:00:22 1773414022

You can right click on it and choose "Show controls", at least in Firefox.

voidUpdate · 2026-03-13T15:04:20 1773414260

Oh, that's odd, it didn't show up on chrome when I first tried it, but it does now. I was wondering how they'd managed to hide the video context menu

stanac · 2026-03-13T15:11:58 1773414718

It's probably just <video> element without "controls" attribute.

https://developer.mozilla.org/en-US/docs/Web/HTML/Reference/...

> controls

> If this attribute is present, the browser will offer controls to allow the user to control video playback, including volume, seeking, and pause/resume playback.

Edit: I misunderstood, you are asking

> how they'd managed to hide the video context menu

Not sure, but it works in FF for me

voidUpdate · 2026-03-13T15:16:44 1773415004

Its entirely possible I did something to it accidentally that made the context menu not work properly. I had the dev tools open to pull the actual video address when I right clicked, so I might have messed something up. Or maybe the devs are secretly looking at the comments and fixed it between me and you trying :P

sam1r · 2026-03-13T15:19:25 1773415165

It won't let me reply to parent's child comment, but i wanted to say:

That is what HN is for!

sorenjan · 2026-03-11T22:05:59 1773266759

Planet labs has a solution specifically for ships.

https://www.planet.com/pulse/illuminate-the-dark-fleet-with-...

ge96 · 2026-03-12T14:41:03 1773326463

Dang, that is hard to do, 4 pixels of orange to work with

sorenjan · 2026-03-04T19:14:12 1772651652

But that's not something you'd use an LLM for. There have been computer vision systems sorting bad peas for more than a decade[0], of course there are plenty of use cases for very fast inspection systems. But when would you use an LLM for anything like that?

[0] https://www.youtube.com/watch?v=eLDxXPziztw

arcanemachiner · 2026-03-04T23:34:07 1772667247

Nobody said you would use an LLM for that. It's an example of a process where "industrial inspection, in particular, [would] benefit from lower latency in exchange for accuracy".

The point of their comment isn't that you would use an LLM to sort fruit. It was just an illustrative example.

sorenjan · 2026-03-05T00:24:25 1772670265

The discussion was about fine-tuned Qwen models, not industrial inspection in general. I would also find it interesting to learn about what kind of edge AI industrial inspection task you could do with fine-tuned llms, not some handwavy answer about how sometimes latency is important in real time systems. Of course it is, so generally you don't use models with several billion parameters unless you need to.

arcanemachiner · 2026-03-05T01:51:51 1772675511

The thread you're in broke away from the main discussion topic.

Again: Nobody is using LLMs to (for example) sort fruit. But there are some industrial processes that prioritize latency over reliability.

nl · 2026-03-05T06:34:26 1772692466

No, we are literally trying to find a use case where using a lower accuracy LLM makes sense for a vision task.

But fine - what are these industrial processes where that prioritize latency over reliability and using a LLM - as mentioned by the OP - makes sense?

IanCal · 2026-03-05T16:27:11 1772728031

> No, we are literally trying to find a use case where using a lower accuracy LLM makes sense for a vision task.

They're reconfigurable on the fly with little technical expertise and without training data, that's really useful. Personally in projects for people I've found models have fewer unusual edge cases than traditional models, are less sensitive to minor changes in input and are easier to debug by asking them what they can see.

sorenjan · 2026-03-05T21:37:25 1772746645

Seems like a way to use a sledgehammer to hammer in screws, and inviting nondeterminism in important systems. Besides being way larger and more complex than what most specialized industrial processes need, they are also vulnerable to adversarial attacks.

https://www.lakera.ai/blog/visual-prompt-injections

https://www.theverge.com/2021/3/8/22319173/openai-machine-vi...

IanCal · 2026-03-06T20:57:47 1772830667

> Seems like a way to use a sledgehammer to hammer in screws

The lazy analogy the other way is that developing a custom system to do these jobs is like hiring a team of experts to spend 2 years designing the perfect crosshead screwdriver that fits exactly one screw (and doesn't work if the screw starts slightly rotated) when you have a flathead one right next to you that'll work and it'll work right now.

> and inviting nondeterminism in important systems.

Traditional ML is just as non-deterministic.

> they are also vulnerable to adversarial attacks.

Typically not relevant in these kinds of cases but also this is easily a problem in many traditional ML algos.

Have you worked on things like this?

sorenjan · 2026-03-08T18:12:36 1772993556

A flathead screwdriver is not a valid analogy, because LLMs are big complicated and opaque machines. And while other ML methods are non-deterministic as well, gaussian process, decision trees or even CNNs are easier to try to make sense of than these huge black boxes.

And I still haven't seen a single example of anyone actually using a finetuned Qwen in industrial inspection, which leads me to believe than nobody is actually using it for that, but some people want to use it because it's their new favorite toy. You don't need a VLM to count cells in microscopy images, or find scratches in painted parts, or estimate output from a log in a saw mill. I can see the use case for things like describing a scene from a surveillance camera, finding a car of a certain model and colour, or other tasks that demand more reasoning or description. But in those cases latency is not super important compared to getting the right output, which was the tradeoff discussed from the start of this thread.

The last thing I'd want to deal with is to have a computer say something like "You're absolutely right, it was wrong of me to classify the metal debris as food".

IanCal · 2026-03-08T22:07:51 1773007671

I’ve used multimodal LLMs for this sort of task and if a fine tuned model would get reasonable performance compared to frontier models I’d use that. Running things purely locally lets you massively simplify the overall architecture and data transfer requirements of some of these tasks if nothing else and lower latency means you can report problems much faster (vs transfer images off device, batch process).

> The last thing I'd want to deal with is to have a computer say something like "You're absolutely right, it was wrong of me to classify the metal debris as food".

The cnn will do that potentially more often and it can be because it’s just not seen enough examples of the debris at that angle or something else equally irrelevant to a human.

0xbadcafebee · 2026-03-04T21:33:00 1772659980

You would use a VLM (vision language model). The model analyzes the image and outputs text, along with general context, that can drive intelligent decisions. https://tryolabs.com/blog/llms-leveraging-computer-vision

sorenjan · 2026-03-02T21:24:26 1772486666

They worked best when everybody were farmers and had to get up early and go to bed early. Now most people don't live their lives centered around noon, our free time comes after our work is done at around 17:00, so having more light in the evening instead of worthless light in the night makes sense.

bryanlarsen · 2026-03-02T21:33:59 1772487239

That's a myth.

Farmers have to wake up early because their animals wake up at sunrise and some tasks are best performed at that time. So they wake up before sunrise regardless of the clock time.

Human, like farm animals, are better off if they wake up at sunrise and go to sleep in full dark. At the equator that's easy, wake at 6, bed at 10PM. And standard work hours are 7-3 or 8-4.

dessimus · 2026-03-02T21:59:17 1772488757

So, it sounds like you're actually arguing that the numbers are just a construct and that we should all just use UTC and set appropriate work hours to the times that most correlate to the solar day in our region rather than adjust the clock approximately 1 hour per 15 degrees around the equator and have an International Date Line.

I think this would make way more sense, when they say the Olympic Opening Ceremony start at 18:00, its 18:00 for everyone around the world. No one as to work out which TZ Italy is in or scheduling meetings with Tech Support in far flung locales does not require knowing IST is how far ahead or behind.

shagie · 2026-03-02T22:16:28 1772489788

Yes. https://en.wikipedia.org/wiki/Sandford_Fleming ( https://www.smithsonianmag.com/smithsonian-institution/sandf... )

> He promoted worldwide standard time zones, a prime meridian, and use of the 24-hour clock as key elements to communicating the accurate time, all of which influenced the creation of Coordinated Universal Time.

The one bit where this would be problematic would be "what day is it?" When does today become tomorrow?

There are a lot of systems that we've built that depend on that distinction. Things like business days and running end of day so that everything that happens on March 2nd is logged as March 2nd. I've encountered fun with Black Friday sales where the store is open over the midnight boundary and the backend system really wants today to be today rather than yesterday (sometimes this has involved unplugging a register from the network so that it doesn't run end of day, running EOD on the store systems, and then plugging the register back in after it completes and then running a reconciliation.).

Other than that particular mess of banks and businesses... yea, running everything on UTC would be something nice in today's world.

---

This is also kind of what happens in China (with a complicated history). https://github.com/eggert/tz/blob/main/asia#L272

https://en.wikipedia.org/wiki/Time_in_China UTC+08:00 is observed throughout the country even though it spans about 60° of longitude.

---

Aside on the "changing clocks" and realizing my flexible schedule privilege at a company I worked at I switched my schedule from 8-4 to 9-5 with the change in daylight savings so that I maintained a consistent "this is the hour I wake up".

zarzavat · 2026-03-03T05:07:15 1772514435

China shows why this is impossible.

When people propose switching to UTC what they are actually proposing is that everyone nominally switches to UTC but still uses local time informally in everyday life, which is a worse system than time zones. At least with time zones there is a way to know what time it is in any given place. With informal time you lose that.

est · 2026-03-03T05:19:50 1772515190

how so?

Eastern parts of China gets up at 05:00 AM and westtern part gets up at 10:00 AM.

People get used to it.

bryanlarsen · 2026-03-03T05:36:14 1772516174

Local time tells you things like "when is it a good time to call this person". Unless the person is calling is in China.

est · 2026-03-03T10:31:54 1772533914

That's a fair point. And CRM system should take notes. Not everyone lives a 9 to 5 schedule.

pseudalopex · 2026-03-03T02:15:13 1772504113

https://qntm.org/abolish

fc417fc802 · 2026-03-02T22:19:21 1772489961

> arguing that the numbers are just a construct

Yes.

> and that we should all just use UTC and ...

No. that does not follow. Abstraction is useful. Having commonly understood terms (in this case hours of the day) that share certain traits regardless of where you happen to be in the world facilitates communication.

sorenjan · 2026-03-02T21:41:47 1772487707

Right, but where I live sunrise is in the middle of the night in the summer (around 03:30). Using standard time in the summer gives me one less hour of useful sunlight in the evening, and while it doesn't technically disappear it gets moved to where I can't use it because that's when I sleep. It's the same for people further south as well, another bright hour in the early morning before they wake up is a wasted bright hour that would make more sense in the evening, when most modern humans are awake. The argument "noon should coincide with solar noon" is nonsensical to me, the clock is a social construct and should make sense for how most of us live our lives.

1718627440 · 2026-03-02T21:52:36 1772488356

But the social construct of work hours shifted later by more than that one hour during the last century, so this is not what people actually prefer by their actions.

bryanlarsen · 2026-03-02T21:47:31 1772488051

Optimizing for summer is silly. Summer gets lots of daylight already. We need to optimize for winter.

layer8 · 2026-03-02T22:41:08 1772491268

People disagree on whether to prioritize mornings or afternoons in the winter. For the summer, only very few people care if the sun rises at four or five (or whatever), but most people like having long summer evenings. Therefore the summer tips the scales.

1718627440 · 2026-03-02T22:55:59 1772492159

Then they are also social activities that you just need to wait for in summer, because they can only happen after sunset. Viewing a movie (outside), sitting around a fire, having a party all just really happen after sunset.

Zanfa · 2026-03-03T08:23:09 1772526189

The extra hour of daylight in the evening on summer time is even more valuable in the winter.

sorenjan · 2026-02-26T16:21:51 1772122911

Is this a distillation of Nano Banana Pro?

meetpateltech · 2026-02-26T16:24:51 1772123091

Gemini 3.1 Flash Image is based on Gemini 3 Flash.

source: https://deepmind.google/models/model-cards/gemini-3-1-flash-...