I realize that this is essentially a ridiculous question, but has anyone offered...

p_j_w · on Dec 6, 2023

>Like, I feel that GPT-4 (pre-turbo) was an extremely powerful model for almost anything I wanted help with. Whereas I feel like Bard is not great. So does this mean that my experience aligns with "HellaSwag"?

It doesn't mean that at all because Gemini Turbo isn't available in Bard yet.

carbocation · on Dec 7, 2023

I am not sure what Gemini Turbo is. Perhaps you meant Gemini Ultra? Because Gemini Pro (which is in this table) is currently accessible in Bard.

p_j_w · on Dec 7, 2023

Yes, that's what I meant.

tarruda · on Dec 6, 2023

I get what you mean, but what would such "qualitative evaluation" look like?

carbocation · on Dec 6, 2023

I think my ideal might be as simple as a few people who spend a lot of time with various models describing their experiences in separate blog posts.

tarruda · on Dec 6, 2023

I see.

I can't give any anecdotal evidence on ChatGPT/Gemini/Bard, but I've been running small LLMs locally over the past few months and have amazing experience with these two models:

- https://huggingface.co/mlabonne/NeuralHermes-2.5-Mistral-7B (general usage)

- https://huggingface.co/deepseek-ai/deepseek-coder-6.7b-instr... (coding)

OpenChat 3.5 is also very good for general usage, but IMO NeuralHermes surpassed it significantly, so I switched a few days ago.

fasttransients · on Dec 12, 2023

Thank you for the suggestions – really helpful for my hobby project. Can't run anything bigger than 7B on my local setup, which is a fun constraint to play with.

carbocation · on Dec 6, 2023

Thanks! I’ve had a good experience with the deepseek-coder:33b so maybe they’re on to something.