HyperTech News Report #0003 - Expanding Horizons

Blaed@lemmy.world · 10 months ago

HyperTech News Report #0003 - Expanding Horizons

Blaed@lemmy.world · 10 months ago

HyperTech News Report #0002 - A New Challenger Approaches!

Blaed@lemmy.world · 11 months ago

HyperTech News Report #0001 - Happy FOSAI Friday!

Blaed@lemmy.world · 11 months ago

HyperTech News Report #0001 - Happy FOSAI Friday!

Blaed@lemmy.world · 1 year ago

Beating GPT-4 on HumanEval with a Fine-Tuned CodeLlama-34B

Blaed@lemmy.world · 1 year ago

Cheetor - A New Multi-Modal LLM Strategy Empowered by Controllable Knowledge Re-Injection

Blaed@lemmy.world · 1 year ago

Incognito Pilot: The Next-Gen AI Code Interpreter for Sensitive Data

Blaed@lemmy.world · edit-2 1 year ago

I used to feel the same way until I found some very interesting performance results from 3B and 7B parameter models.

Granted, it wasn’t anything I’d deploy to production - but using the smaller models to prototype quick ideas is great before having to rent a gpu and spend time working with the bigger models.

Give a few models a try! You might be pleasantly surprised. There’s plenty to choose from too. You will get wildly different results depending on your use case and prompting approach.

Let us know if you end up finding one you like! I think it is only a matter of time before we’re running 40B+ parameters at home (casually).

Blaed@lemmy.world · 1 year ago

Vicuna v1.5 Has Been Released!

Blaed@lemmy.world · 1 year ago

Free Open-Source AI LLM Guide

Blaed@lemmy.world · 1 year ago

Free Open-Source AI LLM Guide

Blaed@lemmy.world · 1 year ago

Llama-2 FOSAI & LLM Roundup Series! (Summer 2023 Edition)

Blaed@lemmy.world · 1 year ago

Llama-2 FOSAI & LLM Roundup Series! (Summer 2023 Edition)

Blaed@lemmy.world · 1 year ago

Introducing Llama 2 - Meta's Next-Generation Commercially Viable Open-Source AI & LLM

Blaed@lemmy.world · edit-2 1 year ago

New AI/LLM Breakthrough - FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness

Blaed@lemmy.world · edit-2 1 year ago

New AI/LLM Breakthrough - FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness

Blaed@lemmy.world · edit-2 1 year ago

Petals - Run large language models at home, BitTorrent‑style

Blaed@lemmy.world · 1 year ago

Mark Zuckerberg & Meta to Release Commercial Version of its AI/LLM (LLaMA) In Effort to Catch Rivals

Blaed@lemmy.world · 1 year ago

New LLM OpenOrca-Preview1-13B Released!

Blaed@lemmy.world · 1 year ago

Thanks for sharing this!

Blaed@lemmy.world · 1 year ago

Good bot, I will do that next time.

Blaed@lemmy.world · edit-2 1 year ago

Come hangout with us at [email protected]

I run this show solo at the moment, but do my best to keep everyone informed. I have much more content on the horizon. Would love to have you if we have what you’re looking for.

FOSAI Posts:

Blaed@lemmy.world · 1 year ago

Introducing OpenLLaMA: An Open-Source Reproduction of Meta's LLaMA

Blaed@lemmy.world · 1 year ago

Microsoft Announces a New Breakthrough in AI: LongNet: Scaling LLM Transformers to 1,000,000,000 Tokens & Context Length

Blaed@lemmy.world · 1 year ago

For anyone unaware, this is probably one of the better short and sweet explanations in regards to what HuggingFace is.

It is a hub for many code repositories hosting AI specific files and configurations, which has become a core ecosystem of many artificial intelligence breakthroughs, platforms, and applications.

Blaed@lemmy.world · 1 year ago

🤗

Blaed@lemmy.world · 1 year ago

FWIW, it’s a new term I am trying to coin in FOSS communities (Free, Open-Source Software communities). It’s a spin off of ‘FOSS’, but for AI.

There’s literally nothing wrong with FOSS as an acronym, I just wanted to use one more focused in regards to AI tech to set the right expectations for everything shared in /c/FOSAI

I felt it was a term worth coining given the varied requirements and dependancies AI/LLMs tend to have compared to typical FOSS stacks. Making this differentiation is important in some of the semantics these conversations carry.

Blaed@lemmy.world · 1 year ago

Big brain moment.

Ironically, I think using this technology to do exactly that is one of its greatest strengths…

GL, HF!

Blaed@lemmy.world · 1 year ago

Lol, you had me in the first half not gonna lie. Well done, you almost fooled me!

Glad you had some fun! gpt4all is by far the easiest to get going with imo.

I suggest trying any of the GGML models if you haven’t already! They outperform almost every other model format at the moment.

If you’re looking for more models, TheBloke and KoboldAI are doing a ton for the community in this regard. Eric Hartford, too. Although TheBloke is typically the one who converts these into more accessible formats for the masses.

Blaed@lemmy.world · edit-2 1 year ago

Thank you! I appreciate the kind words. Please consider subscribing to /c/FOSAI if you want to stay in the loop with the latest and greatest news for AI.

This stuff is developing at breakneck speeds. Very excited to see what the landscape will look like by the end of this year.

Blaed@lemmy.world · 1 year ago

Absolutely! I’m having a blast launching /c/FOSAI over at Lemmy.world. I’ll do my best to consistently cross-post to everyone over here too!