More

6thbit · 2026-04-02T02:05:34 1775095534

Given enough eyeballs, all prohibited items are shallow.

6thbit · 2026-04-01T23:58:47 1775087927

To focus on other things.

100x of a small number could be less than 1x of a really large number from a product that they are more comfortable monetizing.

6thbit · 2026-04-01T23:57:21 1775087841

OpenAI is growing fast, pivoting is only to be expected. It would normally be something HN folk would typically value and encourage on startups.

They have clearly been lacking focus and now finally they seem to be working towards a narrower direction, which is usually highly valued by investors.

This article doesn't explore the depth of the decisions and only regurgitate what you may find your neighbor complaining about on X but with a better stylesheet.

6thbit · 2026-03-31T14:13:23 1774966403

I don’t buy the “wait 7 days” being thrown around as a guard.

Wouldn’t that just encourage the bad actors to delay the activation of their payloads a few days or even remotely activated on a switch?

roflcopter69 · 2026-03-31T14:28:20 1774967300

Of course the "wait 7 days" are not a silver bullet, but it gives automated scanners plenty of time to do their work. Those automated scanners surely catch this `eval(base64.decode("..."))` stuff that some of those attacks used so in my book this dependency cooldown is a net win. I guess the skilled malicious actors will then up their game but I think it's okay to kick off an arms race between them and the security scanners in the dependency world.

6thbit · 2026-03-31T15:40:51 1774971651

That's a good point. In some level I'd prefer the delay to happen on publication of the package itself. Do any of these scanners have cryptographic attestations or similar?

6thbit · 2026-03-31T14:11:44 1774966304

> published manually via a stolen npm access token with no OIDC binding and no gitHead

So this and litellm one would’ve been preventable by proper config of OIDC Trusted Publishers.

6thbit · 2026-03-30T21:42:50 1774906970

Quotes should be around “journalism”.

Let’s recognize those bait posts as what they are, which for sure is not journalism.

6thbit · 2026-03-30T20:22:15 1774902135

Yeah having this opens up the LLM assisting path to build shortcuts. Which is great! Maintaining them by hand is not

6thbit · 2026-03-30T20:19:11 1774901951

Writing down specs for technical projects is a transformational skill.

I've had projects that seemed tedious or obvious in my head only to realize hidden complexity when trying to put their trivial-ness into written words. It really is a sort of meditation on the problem.

In the most important AI assisted project I've shipped so far I wrote the spec myself first entirely. But feeding it through an LLM feedback loop felt just as transformational, it didn't only help me get an easier to parse document, but helped me understand both the problem and my own solution from multiple angles and allowed me to address gaps early on.

So I'll say: Do your own writing, first.

6thbit · 2026-03-27T21:10:59 1774645859

So both this and litellm went straight to PyPI without going to GitHub first.

Is there any way to setup PyPI to only publish packages that come from a certain pattern of tag that exists in GH? Would such a measure help at all here?

woodruffw · 2026-03-27T23:02:58 1774652578

Yes: if you use a Trusted Publisher with PyPI, you can constrain it to an environment. Then, on GitHub, you can configure that environment with a tag or branch protection rule that only allows the environment to be activated if the ref matches. You can also configure required approvers on the environment, to prevent anyone except your account (and potentially other maintainers you’d like) from activating the environment.

LtWorf · 2026-03-28T09:09:29 1774688969

If they have compromised the token wouldn't that mean the developer is compromised and such access can be used to just put "curl whatever" into the build and publish that payload on pypi?

woodruffw · 2026-03-28T13:03:46 1774703026

I don’t understand the question, sorry.

LtWorf · 2026-03-28T15:06:05 1774710365

I'll try to reformulate in a simpler way.

On debian, all builds happen without internet access. So whatever ends up on the .deb file is either contained on the dependencies or in the orig tarball.

Is anything similar done for builds that create artifacts for pypi, so that a certain correspondence between binary file and sources exists? Or is there unrestricted internet access so that what actually ends up on pypi can come from anywhere and vetting the sources is of little help?

woodruffw · 2026-03-28T16:18:32 1774714712

That’s a nice property of centralized package management systems; I don’t think anything exactly like that exists for PyPI. The closest thing would be a cryptographic attestation.

(If I wanted to taxonomize these things, I say that the Debian model is effectively a pinky promise that the source artifacts correspond to the built product, except that it’s a better pinky promise because it’s one-to-many instead of many-to-many like language package managers generally are. You can then formalize that pinky promise with keys and signatures, but at the end of the day you’re still essentially binding a promise.)

functional_dev · 2026-03-28T17:32:52 1774719172

wasnt PEP 740 an attempt to solve this?

woodruffw · 2026-03-28T22:38:04 1774737484

Depends on what you mean by “this.” If you mean build provenance, yes, if you mean transmuting PyPI into the kind of trust topology that Debian (for example) has, no.

(I think PEP 740 largely succeeds at providing build provenance; having downstream tooling actually do useful things with that provenance is harder for mostly engineering coordination reasons.)

aniceperson · 2026-03-27T21:23:13 1774646593

Don't have the token on your hands. Use OICD ideally, or make sure to setup carefully as a repository secret. Ensure the workflow runs in a well permission read, minimal dependency environment. The issue with OICD is that it does not work with nested workflows because github does not propagate the claims.

sh-cho · 2026-03-28T10:44:48 1774694688

*OIDC

6thbit · 2026-03-25T20:15:32 1774469732

Not clear to me the diff with v2?

ACCount37 · 2026-03-25T20:30:18 1774470618

They stacked the deck. If v2 was still rule inference + spatial reasoning, a bit like juiced up Raven's progressive matrices, then v3 adds a whole new multi-turn explore/exploit agentic dimension to it.

Given how hard even pure v2 was for modern LLMs, I'm not surprised to see v3 crush them. But that wouldn't last.

jasonjmcghee · 2026-03-25T20:32:17 1774470737

v2 was a static fill in the blank task instead of v3 which is interactive.

There's world state that you can change. Not just place pixel.

Here's v2:

https://arcprize.org/tasks/ce602527