More

dr_dshiv · 2025-12-16T11:06:09 1765883169

“The Strength of Ease” — a mantra I tell myself

dr_dshiv · 2025-12-12T00:46:48 1765500408

That’s… too bad

dr_dshiv · 2025-12-12T00:39:29 1765499969

Vibes

dr_dshiv · 2025-12-11T08:45:10 1765442710

> It almost sounds like you’re saying there’s essentially an LLM inside everyone’s brain. Is that what you’re saying?

>Pretty much. I think the language network is very similar in many ways to early LLMs, which learn the regularities of language and how words relate to each other. It’s not so hard to imagine, right?

Yet, completely glosses over the role of rhythm in parsing language. LLMs aren’t rhythmic at all, are they? Maybe each token production is a cycle, though… hmm…

GolDDranks · 2025-12-11T09:24:03 1765445043

I think it's obvious that she means that it's something _like_ LLMs in some aspects. You are correct in that rhythm and intonation are very important in parsing language. (And also an important cue when learning how to parse language!) It's clear that the human language network is not like LLM in that sense. However, it _is_ a bit like an _early_ LLM (remember GPT2?) in the sense that it can produce and parse language, not that it makes much deeper sense in it.

tgv · 2025-12-11T11:48:53 1765453733

However ... language production and perception are quite separated in our heads. There's basically no parallel to LLMs. Note that the article doesn't give any, and is extremely vague about the biological underpinnings of language.

GolDDranks · 2025-12-11T16:31:43 1765470703

> language production and perception are quite separated in our heads

Do you have any evidence for this?

I am a former linguistics student (got my masters), and, after years of absenteeism in academia, interested in the current state of the affairs. So: "quite separated in our heads" Evidence for? against?

tgv · 2025-12-11T18:13:06 1765476786

Afasia, and general measures of "normal" performance.

There are various kinds of afasia, often linked to specific brain areas (Wernicke's and Broca's are well-known). And M/EEG and fMRI research suggests similar distinctions. It is difficult to reconcile with the idea that there is only one language system.

And you will also have noticed that your skills in perception and production differ. You can read/listen better than write/speak. Timing, ambiguity and errors in perception and production differ.

And more logically: the tasks are very different. In perception, you have to perceive the structure and meaning from a highly ambiguous, but ordered input of sound triggering auditory nerves, while during production, meaning is given (in non-linear order), and you have to find a way to fit it in a linear, grammatical order with matching words, which then have to be translated to muscle movements.

GolDDranks · 2025-12-15T10:57:48 1765796268

Ah, totally agreed. At least there is a clear auditory / motor part in the tasks that seems quite separate.

However, I find it also unlikely that the networks are totally separate, and I wonder if there are any evidence of areas that encode the "core/abstract" linguistic de/serialization (multidimensional and messy internal semantic information ←→ linear morphophonological information) both ways, or at least mechanism that manages to use gained input network competence to "train" or "manage" output network competence.

Why? Because even though, as you say, there is a differing performance in perception and production, there is also plenty of evidence of gaining linguistic competence from input, and then managing to convert that to performance in output.

Terretta · 2025-12-11T19:44:49 1765482289

> It's clear that the human language network is not like LLM in that sense.

Is it though? If rhythm or tone changes meaning, then just add symbols for rhythm and tone to LLM input and train it. You'll get not just words out that differ based on those additional symbols wrapping words, but you'll also get the rhythm and tone symbols in the output.

coldtea · 2025-12-13T02:27:09 1765592829

>Yet, completely glosses over the role of rhythm in parsing language.

If you're talking about speech cadence/rhythm, then we also parse written language which doesn't have that. And we're quite capable of parsing a monotone robotic voice speaking with a monotonous mechanical rhythm too.

dr_dshiv · 2025-12-11T08:36:36 1765442196

I used this to integrate image generation and gif generation in my Claude coding — very helpful for making beautiful websites. They have Midjourney API

dr_dshiv · 2025-12-10T15:59:54 1765382394

Come to the Netherlands! It’s awesome. And the visa is easy: just put 5k in a business account. Look up Dutch American friendship treaty.

thayne · 2025-12-11T07:14:36 1765437276

I looked that up. It sounds like you have to start an entrepreneurial business there (in which you invest at least 5k). I have no interest in starting my own business, so that is probably not an option.

busterarm · 2025-12-10T16:07:37 1765382857

But finding housing is brutal.

swat535 · 2025-12-10T16:47:23 1765385243

Genuinely asking, are there any country that doesn't have a housing crisis right now? From my understanding, it's everywhere in Western nations

ponector · 2025-12-10T20:49:37 1765399777

Is housing really unaffordable in US? How many years average Joe needs to work to buy a 1bd apartment in the big city, let's say 500sq ft? Price in annual net salary is much lower in USA than in big EU cities.

stop50 · 2025-12-10T17:26:39 1765387599

At least for my state in germany we have an crisis of housing in big cities. In small vilages and cities it is pretty cheap to buy a house.

jonkoops · 2025-12-10T16:10:31 1765383031

It is. Unfortunately, the most of the EU is affected by the housing crisis.

koakuma-chan · 2025-12-10T16:11:33 1765383093

Which country is not affected by housing crisis?

mothballed · 2025-12-10T16:23:26 1765383806

Kazakhstan, Laos, maybe Romania. Croatia, not terrible.

Ylpertnodi · 2025-12-10T16:14:17 1765383257

Yes! If you are coming over, please, please, go to the Netherlands.

eli_gottlieb · 2025-12-10T16:00:27 1765382427

So a business account is one belonging to a business, right?

searedsteak · 2025-12-10T16:11:47 1765383107

Is there reciprocity for this treaty?

returningfory2 · 2025-12-10T18:43:01 1765392181

Yes: E-2 visa. Though I suspect the investment amount has to be much higher than the Dutch requirement of ~$5k.

dr_dshiv · 2025-12-10T15:58:46 1765382326

Google recently has made efforts to link Gemini to Google Earth data. “Earth AI”

https://ai.google/earth-ai/

Has anyone tried this? I’m not quite sure how to prompt it effectively.

mistrial9 · 2025-12-10T16:53:13 1765385593

arxiv:2510.18318

dr_dshiv · 2025-12-05T01:20:05 1764897605

> I find it very appealing to consider the idea that the world is not somehow running “hidden mathematics”, somewhere and somehow, to solve some complicated equations in a seemingly magical way, but rather, that things are radically simpler, in that the world is simply implementing a set of trivially simple rules. The world is not concerned with, or made with mathematics, mathematics just emerges, with inherent and irreducible complexity, from extreme simplicity.

Wouldn’t those simple rules be mathematics? It’s very hard for me to see how the world isn’t made of math. Then again, I am a Pythagorean.

ui23 · 2025-12-05T17:11:57 1764954717

Mathematics is just a tool, like language, for describing reality, not reality itself.

A cake is not made of numbers like 5 cups flour + 3 eggs, but we can model it as such. In principle we could invent any such system of symbols to describe the physical world but those symbols don’t define it. The physical world only nudges us toward what symbols work and which don’t.

dr_dshiv · 2025-12-05T18:33:43 1764959623

Strawman. You are claiming math isn’t real?

ui23 · 2025-12-05T19:22:20 1764962540

Not a strawman. I’m not stating that math isn’t real. It’s real as an abstract framework that humans create and refine. It’s not, however, foundational to the physical world in the way fundamental particles or gravity is. Numbers and equations don’t push particles around, they simply help us represent that kind of phenomena that we observe.

dr_dshiv · 2025-12-05T20:45:28 1764967528

So spheres are invented by humans? In another universe, there are no spheres or triangles?

mrguyorama · 2025-12-05T01:57:50 1764899870

There is a distinction between "What the universe is made of or how it works just happens to be really compatible with how math describes things" and "The universe is just "running" math and we discovered that math and use it for other things"

But like, words stop working at these levels of rigor.

What the hell does "The universe is made of math" mean? How can something be made of a field of study? Where is the "Addition" particle? How does 1+1=2 give rise to what we see as an electron?

Like it's bad enough dealing with "quantum fields" that might be "real" or maybe are just really nice mathematical objects that happen to be useful for calculating the future.

Does math take up space? Does space take up math? Does blue afraid of seven? Can I eat integrals or will they go straight to my thighs?

If the universe is "made of" math, what is the consequence? For example, the consequence of being made of "quantum fields" in my lay mind is that we get observations like entanglement and the hilarity of whatever is going on in the higgs field.

>Then again, I am a Pythagorean.

Ah, let me just move this sqrt(2) out of the way real quick :P

I want simple rules because I am a simple man, and if those simple rules happen to actually be math, that sucks for me because the "simple rules" are really hard math.

dr_dshiv · 2025-12-05T14:29:21 1764944961

Pythagoras almost certainly was misinterpreted.

Unsayable numbers (the way the Greeks said irrational numbers) can take the wrong meaning. Like, why are they unsayable? Because you’d die before you could say them. Well, it’s not a threat!

Then it turns into this whole ahistorical fabrication impugning Pythagoras who was, otherwise, pretty much the most incredible guy ever.

Now, the “addition particle” is a strawman, but harder to deal with is just numbers. Are numbers real? Are there discrete “things” in the universe? Well, yes there are. Frequencies or quanta do just fine. Now, when there are numbers, they can be added, whether we want to or not.

Another example would be geometries. Are spheres real? Surely! Do they exist on any planet in the universe? It would seem. Are there any perfect spheres? Nope. Do they precede matter and energy? It would seem.

I think we are saying the same thing. Unfortunately, these beliefs are slippery and metaphysical. I take pride, though, in the pythagoreanness of so many of the scientific greats, from Newton to Penrose.

dr_dshiv · 2025-12-04T18:50:09 1764874209

Autism was split into autism and Asperger’s.

But calling people with social challenges “Assburgers,” I mean, wow. Just wow.

lcnPylGDnU4H9OF · 2025-12-04T19:41:50 1764877310

> Asperger’s

This term has a complicated history so people use "high-functioning" now. Many refer to Hans Asperger as a Nazi eugenicist. Reasonably, I'd say.

https://en.wikipedia.org/wiki/Hans_Asperger#Children_sent_to...

https://en.wikipedia.org/wiki/Am_Spiegelgrund_clinic#Experim...

> Just as the physician must often make painful incisions during the treatment of individuals, we must also make incisions in the national body, out of a sense of responsibility: we must make sure that those patients who would pass on their diseases to distant generations, to the detriment of the individual and of the Volk, are prevented from passing on their diseased hereditary material

Der_Einzige · 2025-12-04T23:12:35 1764889955

Same stuff with lisp being the name of a condition that people with lisps can’t pronounce. They did it on purpose and they know what they did.

The same folks who cause all the Mandela effects did this cus they thought it’s funny. We are all job in his biblical story.

dr_dshiv · 2025-12-04T09:54:43 1764842083

Prediction: Vibe coding systems will be better at security in 2 years than 90% of devs.

input_sh · 2025-12-04T10:02:46 1764842566

Prediction: it won't.

You can't fit every security consideration into the context window.

dr_dshiv · 2025-12-04T11:12:14 1764846734

90% of human devs are not aware of every security consideration.

input_sh · 2025-12-04T11:47:22 1764848842

90% of human devs can fit more than 3-5 files into their short-term memory.

They also know not to, say, temporarily disable auth to be able to look at the changes they've made on a page hidden behind auth, which is what I observed Gemini 3 Pro doing just yesterday.

dr_dshiv · 2025-12-04T12:33:52 1764851632

Ok, and that’s your prediction for 2 years from now? It’d be quite remarkable if humans had a bigger short term memory than LLMs in 2 years. Or that the kind of dumb security mistakes LLMs make today don’t trigger major, rapid improvements.

input_sh · 2025-12-04T12:51:58 1764852718

Do you understand what the term "context window" means? Have you ever tried using an LLM to program anything even remotely complex? Have you observed how the quality of the output drastically reduces the longer the coversation gets?

That's what makes it bad at security. It cannot comprehend more than a floppy drive worth of data before it reverts to absolute gibberish.

eric-burel · 2025-12-04T12:41:48 1764852108

You may want to read about agentic AI, you can for instance call an LLM multiple times with different security consideration everytime.

input_sh · 2025-12-04T14:43:24 1764859404

There's about a dozen workarounds around context limits, agents being one of them, MCP servers being another one, AGENTS.md being the third one, but none of them actually solve the issue of a context window being so small that it's useless for anything even remotely complex.

Let's imagine a codebase that can fit onto a revolutionary piece of technology known as a floppy drive. As we all know, a floppy drive can store <2 megabytes of storage. But a 100k tokens is only about 400 kilobytes. So, to process the whole codebase that can fit onto a floppy drive, you need 5 agents plus the sixth "parent process" that those 5 agents will report to.

Those five agents can report "no security issues found" in their own little chunk of the codebase to the parent process, and that parent process will still be none the wiser about how those different chunks interact with each other.

eric-burel · 2025-12-06T18:02:14 1765044134

You can have an agent that focuses on studying the interactions. What you're saying is that an AI cannot find every security issue but neither do humans otherwise we wouldn't have security breaches in the first place. You are describing a relatively basic agentic setup mostly using your AI-assisted text editor but a commercial security bot is a much more complex beast hopefully. You replace context by memory and synthesis for instance, the same way our brain works.

joshribakoff · 2025-12-04T15:55:35 1764863735

In one instance it could not even describe why a test is bad unit test (asserting true is equal to true), which doesn’t even require context or multi file reasoning.

Its almost as if it has additional problems beyond the context limits :)

eric-burel · 2025-12-06T18:04:02 1765044242

In an agentic setup you are still dependent on having relatively smart models that's true.

joshribakoff · 2025-12-04T15:52:40 1764863560

You may want to try using it, anecdotes often differ from theories, especially when they are being sold to you for profit. It takes maybe a few days to see a pattern of ignoring simple instructions even when context is clean. Or one prompt fixes one issue and causes new issues, rinse and repeat. It requires human guidance in practice.

ethbr1 · 2025-12-04T16:17:40 1764865060

Strongman: LLMs aren't a tool, they're fuzzy automation.

And what keeps security problems from making it into prod in the real world?

Code review, testing, static and dynamic code scanning, and fuzzing.

Why aren't these things done?

Because there isn't enough people-time and expertise.

So in order for LLMs to improve security, they need to be able to improve our ability to do one of: code review, testing, static and dynamic code scanning, and fuzzing.

It seems very unlikely those forms of automation won't be improved in the near future by even the dumbest form of LLMs.

And if you offered CISOs a "pay to scan" service that actually worked cross-language and -platform (in contrast to most "only supported languages" scanners), they'd jump at it.

joshribakoff · 2025-12-05T00:05:47 1764893147

There is an argument here that the LLM is a tool that can multiply the addition or removal of the defects depending on how it is wielded.

ethbr1 · 2025-12-05T00:12:00 1764893520

I think the father figure of a developer who was bitten by a radioactive spider once made a similar quip.

windexh8er · 2025-12-04T13:37:37 1764855457

And that buys you what, exactly? Your point is 100% correct and why LLMs are no where near able to manage / build complete simple systems and surely not complex ones.

Why? Context. LLMs, today, go off the rails fairly easily. As I've mentioned in prior comments I've been working a lot with different models and agentic coding systems. When a code base starts to approach 5k lines (building the entire codebase with an agent) things start to get very rough. First of all, the agent cannot wrap it's context (it has no brain) around the code in a complete way. Even when everything is very well documented as part of the build and outlined so the LLM has indicators of where to pull in code - it almost always cannot keep schemas, requirements, or patterns in line. I've had instances where APIs that were being developed were to follow a specific schema, should require specific tests and should abide by specific constraints for integration. Almost always, in that relatively small codebase, the agentic system gets something wrong - but because of sycophancy - it gleefully informs me all the work is done and everything is A-OK! The kicker here is that when you show it why / where it's wrong you're continuously in a loop of burning tokens trying to put that train back on the track. LLMs can't be efficient with new(ish) code bases because they're always having to go lookup new documentation and burning through more context beyond what it's targeting to build / update / refactor / etc.

So, sure. You can "call an LLM multiple times". But this is hugely missing the point with how these systems work. Because when you actually start to use them you'll find these issues almost immediately.

joshribakoff · 2025-12-04T15:59:18 1764863958

To add onto this, it is a characteristic of their design to statistically pick things that would be bad choices, because humans do too. It’s not more reliable than just taking a random person off the street of SF and giving them instructions on what to copy paste without any context. They might also change unrelated things or get sidetracked when they encounter friction. My point is that when you try to compensate by prompting repeatedly, you are just adding more chances for entropy to leak in — so I am agreeing with you.

windexh8er · 2025-12-04T20:43:24 1764881004

> To add onto this, it is a characteristic of their design to statistically pick things that would be bad choices, because humans do too.

Spot on. If we look at, historically, "AI" (pre-LLM) the data sets were much more curated, cleaned and labeled. Look at CV, for example. Computer Vision is a prime example of how AI can easily go off the rails with respect to 1) garbage input data 2) biased input data. LLMs have these two as inputs in spades and in vast quantities. Has everyone forgotten about Google's classification of African American people in images [0]? Or, more hilariously - the fix [1]? Most people I talk to who are using LLMs think that the data being strung into these models has been fine tuned, hand picked, etc. In some cases for small models that were explicitly curated, sure. But in the context (no pun) of all the popular frontier models: no way in hell.

The one thing I'm really surprised nobody is talking about is the system prompt. Not in the manner of jailbreaking it or even extracting it. But I can't imagine that these system prompts aren't collecting mass tech debt at this point. I'm sure there's band aid after band aid of simple fixes to nudge the model in ever so different directions based on things that are, ultimately, out of the control of such a large culmination of random data. I can't wait to see how these long term issues crop and and duct taped for the quick fixes these tech behemoths are becoming known for.

[0] https://www.bbc.com/news/technology-33347866 [1] https://www.theguardian.com/technology/2018/jan/12/google-ra...

eric-burel · 2025-12-06T18:08:35 1765044515

Talking about the debt of a system prompt feels really weird. A system prompt tied to an LLM is the equivalent of crafting a new model in the pre-LLM era. You measure their success using various quality metrics. And you improve the system prompt progressively to raise these metrics. So it feels like bandaid but that's actually how it's supposed to work and totally equivalent to "fixing" a machine learning model by improving the dataset.

aduwah · 2025-12-04T12:24:20 1764851060

This will age badly

dr_dshiv · 2025-12-04T12:31:17 1764851477

That’s why we make concrete measurable predictions.

MangoToupe · 2025-12-04T13:36:41 1764855401

Agreed, but "vibe coding will be better at security" is not one of them. Better by which metric, against which threat model, with which stakes? What security even means for greenfield projects is inherently different than for hardened systems. Vibe coding is sufficient for security today because it's not used for anything that matters.

Cthulhu_ · 2025-12-04T12:36:22 1764851782

It'll play a role in both securing and security research I'm sure, but I'm not confident it'll be better.

But also, you'd need to have some metrics - how good are developers at security already? What if the bar is on the floor and LLM code generators are already better?

wizzledonker · 2025-12-04T12:49:45 1764852585

Only if they work in a fundamentally different manner. We can't solve that problem the way we are building LLMs now.