More

x_may · 2026-03-25T11:48:08 1774439288

KV cache compression, so how much memory the model needs to use for extending its context. Does not affect the weight size.

x_may · 2026-02-23T13:30:15 1771853415

Isn’t there also basically 0 American DRAM?

UltraSane · 2026-02-23T14:09:20 1771855760

Micron Technology, Inc. is an American semiconductor company that manufactures computer memory

input_sh · 2026-02-23T14:16:58 1771856218

They don't produce them within the US. They're building some factories to do so in the future, but as of now their output is 0.

okanat · 2026-02-23T20:06:32 1771877192

However, the US government has / can have control over Micron's production. They are headquartered in the US. They have the intellectual property and know-how to erect a vertically integrated supply chain. Europe doesn't have this strategic investment.

Fnoord · 2026-02-25T01:10:23 1771981823

Micron/Crucial has bailed the consumer market. Enterprise only now, FWIW.

x_may · 2026-02-16T14:52:11 1771253531

The 80/20 rule always wins

x_may · 2026-02-15T13:28:05 1771162085

Wait sorry how did you use and expose seeds? That’s the most interesting part of your post

anonymous908213 · 2026-02-15T15:43:58 1771170238

We were not a ChatGPT wrapper; we used a finetuned open-source model running on our own hardware, so we naturally had full control of the input parameters. I apologize if my language was ambiguous, but by "expose seeds" I simply meant users can see the seed used for each prompt and input their own in the UI, rather than "exposing secrets" of the frontier LLM APIs, if that's what you took it to mean.

x_may · 2026-02-15T17:41:47 1771177307

I just wanted deterministic outputs and was curious how you were doing it. Sounds like probably temp = 0, which major providers no longer offer. Thanks for your response.

anonymous908213 · 2026-02-15T18:17:50 1771179470

No, seed and temperature are separate parameters accepted by the inference engine. You can still get deterministic outputs with high temp if you're using the same seed, provided the inference engine itself operates in a deterministic manner, and the hardware is deterministic (in testing, we did observe small non-deterministic variations when running the same prompt on the same stack but a different model of GPU).

x_may · 2026-02-02T03:21:23 1770002483

It might have been explicitly targeted, but they did say that there were older versions of Notepad ++ with ""insufficient update verification controls" so it might have just been there was only one subset of users actually susceptible to this.

pavon · 2026-02-02T03:26:20 1770002780

No, the additional update verification was added after this attack was discovered. All Notepad++ installations were vulnerable during the time of the hijacking campaign.

x_may · 2026-01-18T14:09:48 1768745388

I believe the what the parent comment was referring to is the advice not to praise character, but instead praise hard work.

“You’re so smart” leaves room for failure when they encounter something that challenges their image of being smart. Praising the amount of effort they put in is not something that is taken away or challenged regardless of the outcome.

durandal1 · 2026-01-18T17:24:43 1768757083

One of my kids are particularly brilliant and what I found is that the combination works best, "you are smart therefore I have high expectations" AND "without doing the work being smart doesn't matter". Together this creates a self image of the capable doer.

x_may · 2025-07-06T16:50:43 1751820643

I think it’s also largely driven by the apparently cheapness of turning the CapEX of server buying to the OpEX of cloud renting. Less up front investment and auditing/access controls for SoC2 compliant are so much easier m.

x_may · 2025-05-27T12:41:24 1748349684

Unfortunate name collision on that one

neutronicus · 2025-05-27T14:38:35 1748356715

I do donate to NVDA indirectly via the S&P500

x_may · 2025-05-22T19:37:33 1747942653

Obviously its not at the scale of the top auto-regressive models yet but there are some OSS models https://github.com/dllm-reasoning/d1

x_may · 2025-04-18T11:41:51 1744976511

It may be that it was time for the hardware that was previously running Arxiv to be retired and this is just another Capex -> Opex decision being made by so many tech companies.

I'd like to know if GCP is covering part of the bill? Or will Cornell be paying all of it? The new architecture smells of "[GCP] will pay/credit all of these new services if you agree to let one of our architects work with you". If GCP is helping, stay tuned for a blog post from google some time around the completion of the migration with a title like "Reaffirming our commitment to science" or something similarly self affirming.

khuey · 2025-04-18T16:02:26 1744992146

> If GCP is helping, stay tuned for a blog post from google some time around the completion of the migration with a title like "Reaffirming our commitment to science" or something similarly self affirming.

"Google pays to run an enormous intellectual resource in exchange for a self-congratulatory blogpost" seems like a perfectly acceptable outcome for society here.

stonogo · 2025-04-18T21:14:07 1745010847

It wasn't when it happened to Usenet.

toomuchtodo · 2025-04-19T01:11:37 1745025097

Frequent backups to the Internet Archive for rehydration when needed. RIP Dejanews. Hopefully we’ve learned from past experience.

mistrial9 · 2025-04-19T02:35:45 1745030145

mirrors, please

yumraj · 2025-04-18T21:40:08 1745012408

> If GCP is helping, stay tuned for a blog post from google some time around the completion of the migration with a title like "Reaffirming our commitment to science" or something similarly self affirming.

This is an odd criticism. If a company is footing the bill, it can’t even talk about it to gain some publicity/good will?

nophunphil · 2025-04-19T03:29:43 1745033383

Footing the bill for how long?

helsinki · 2025-04-19T15:21:09 1745076069

How much is the bill for running Arxiv? $1000 - $3000/month? Yeah, I don't think Google deserves any recognition for footing that bill. Likely just another self-congratulatory bullshit move on behalf of big G.

flakiness · 2025-04-18T15:58:57 1744991937

https://info.arxiv.org/about/supporters.html

  Our Supporters
  ...
  Gold Sponsors
  Google, Inc (USA)

TZubiri · 2025-04-18T22:38:42 1745015922

"Reaffirming our commitment to science" or something similarly self affirming.

While I understand that something is more genuine if done in secret, it doesn't stop being a real commitment to science just because you make a pr post about it.

If company X contributes to Y open source foundation, that's real and they get to claim clout, nobody cares about a post anyways.