Hacker Newsnew | past | comments | ask | show | jobs | submit | villgax's commentslogin

The fact that I wasn’t able to link llama.cpp server locally without fuss kinda beats the whole open point. Open for proprietary APIs only?

Lol do your DD properly before posting

Not only that, but unfiltered, unedited LLM responses to the comments.

shudder, I'd rather talk to the LLM directly than to this wetware shell :)


lol certifications for a proprietary model stack is not worth the storage or paper

> lol certifications for a proprietary model stack is not worth the storage or paper

Are you sure? What about all those AWS, Azure, etc certifications that many places require their engineers to have?


That’s literally to go and do that thing that AWS promises to offer off-the-shelf.

Anthropic is trying to push architects for something which changes behavior every month pretty much so what works today may not work the same way in a quarter even ignoring determinism across hardware for the sake of it.

Not one startup goes about trying to hire people with certifications. It’s usually body shops offering SMEs on specific technologies. Unless anthropc is offering an ever evolving architect then I’d be wrong.


just waiting on whatsapp to rug pull as well & then bye bye privacy & meta from my life

Wouldn't bye bye meta be hello privacy into your life?

To be online is gonna just become pointless. Only literal need is for news or payments.

NSFW stuff doesn’t need the internet anymore. Critique of regimes aren’t safe either so being online is just a crutch and thats so sad.


You should travel, the whole point of being remote is to enjoy life without being tagged to a location, since you have pets depending on how comfortable you are with them being in hostels you should definitely be traveling a lot more and then be able to meet folks and have a more filling life, start local and then go abroad often

This has sparked a discussion


I think they did work with a few state governments and defence entities. So something like micro-Anthropic X Palantir.


Literally admitting to theft & whining about the modus which got them caught lol


Got nuked on day zero by Qwen models at tenth or so of params.

Does not handle critical inputs even for moderation tasks

These guys did not even bother with an official huggingface space

And the biggest stupidity seems to be fixating on MXFP4 for Apple Silicon when it doesn't even have hardware support for it, should have just done Q4 for GGUF based inference


> These guys did not even bother with an official huggingface space

https://huggingface.co/sarvamai


That is their profile not a HF Space


What do you mean? I can see the files, download count, deploy/use this model options etc.


What part of a HuggingFace Space do you not understand?

They’ve also not bothered with upstreaming the model arch to transformers and require remote code for their modeling code to run……


Responding to my question with your own is not an answer. So again; what do you mean by "official huggingface space"? Their profile page does list the various models and their weights. Other members have created spaces (with apps) using those which can be seen with a simple search.

You have been making some rather bizarre (nuked by Qwen models, does not handle critical inputs etc.) statements which make no sense.

Have you actually downloaded/used/played-with the models? Can you share what you exactly tried out?


Got to start somewhere.

I do think convincing world-class talent to live in Bangalore is likely to be a challenge though.


Indians deep-down often aren't comfortable in the West given the subtle racism and general social-rejection (last year's anti-Indian hate on X remains fresh in memory).

BLR has of late become a sort of "refuge" of tech retunees (with horrible third-world government and infrastructure, though). And it shows - the Matryoshka Embeddings being used in Gemini on-device / embedded models, came out of Deepmind BLR.


For sure, there’s no place like home, and people have families and networks they can’t take with them. Still, getting that Western passport is a draw, and there’s always Abu Dhabi if you want quite close to home and a decent biryani, but also want world-class infrastructure and high (although not quite US) wages


Bigger issue here is why the government is involved with select companies for subsidizing compute. There’s no pre or post criterion to assess success, it should have just been an open market for people with money to purchase compute instead of 10 companies with no prior experience in making models of any kind.

Public funds should beget public datasets and training scripts to see how it is being aligned as well and not just pandering to a particular govt.


> Bigger issue here is why the government is involved with select companies for subsidizing compute.

Government-choosing-winners has worked much better, in many such cases, than free-market absolutists would have you believe…


it's clearly labeled a research project, feel free to DIY


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: