Skip to main content
Go to side pane
PieFed
Home
Home
Popular
All posts
Topics
Browse by topic
All communities
Log in
Register
Donate
Home
Communities
LocalLLaMA@sh.itjust.works
LocalLLaMA
Create post
Hot
Top
New
Active
List
Tile
Wide tile
0
0
Beginner questions thread
by
noneabove1182
2023-10-02T15:30:18Z
2
0
0
Free Open-Source AI LLM Guide
by
Blaed
2023-07-27T00:07:53Z
0
14
0
Best Upgrade Path for my Desktop
by
projectmoon
2024-05-16T17:11:20Z
5
24
10
I'm I the only one blown away by AI?
by
Possibly linux
2024-05-14T05:10:05Z
16
27
1
Mozilla's Llamafile 0.8.2 Scores Big With New AVX2 Performance Optimizations
(
phoronix.com
)
by
ylai
2024-05-10T06:34:40Z
0
27
2
Llama 3 Establishes Meta as the Leader in “Open” AI
(
spectrum.ieee.org
)
by
ylai
2024-04-25T19:42:28Z
2
9
3
Localllama setup for $100k.
by
Timely_Jellyfish_2077
2024-04-20T22:24:54Z
12
13
2
Eric Hartford on X: "I am super excited to announce that I've accepted a position with @TensorWaveCloud - focused on training AI models with @AMDInstinct technologies!"
(
twitter.com
)
by
OpticalMoose
2024-04-20T13:49:49Z
0
21
13
Meta's Llama 3 will force OpenAI and other AI giants to up their game
(
itpro.com
)
by
ylai
2024-04-20T04:24:59Z
10
36
3
Meta releases Llama 3, claims it's among the best open models available
(
yahoo.com
)
by
veer66
2024-04-18T19:06:11Z
2
29
1
New Mistral model is out
(
twitter.com
)
by
The Hobbyist
2024-04-10T09:05:32Z
7
71
5
Meta confirms that its Llama 3 open source LLM is coming in the next month
(
techcrunch.com
)
by
ylai
2024-04-10T02:57:45Z
4
34
0
LLaMA Now Goes Faster on CPUs
(
justine.lol
)
by
ylai
2024-04-08T19:23:03Z
1
8
0
What's the current recommendation for an anime oriented model?
by
Toes♀
2024-04-05T22:34:29Z
0
7
1
Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to ach
(
github.com
)
by
suoko
2024-04-04T15:40:39Z
0
14
0
Dock GPU to Laptop or to small SOC?
by
Pantherina
2024-03-31T23:44:29Z
5
5
0
Dock GPU to Laptop or to small SOC?
by
Pantherina
2024-03-31T23:44:29Z
2
25
7
AnythingLLM | The ultimate AI business intelligence tool
(
useanything.com
)
by
suoko
2024-03-31T07:35:31Z
9
9
1
Open web UI - a web UI primarily for ollama that has a bunch of useful functionally
(
github.com
)
by
Possibly linux
2024-03-28T05:19:39Z
1
0
0
Evolving New Foundation Models: Unleashing the Power of Automating Model Development
(
sakana.ai
)
by
ylai
2024-03-25T09:19:54Z
0
0
0
GaLore: Advancing Large Model Training on Consumer-grade Hardware
(
huggingface.co
)
by
ylai
2024-03-25T08:15:23Z
0
1
0
Mistral 7B v0.2 Base (released at SHACK15sf hackathon)
(
github.com
)
by
ylai
2024-03-25T06:51:54Z
0
4
0
Ollama now supports AMD graphics cards
(
ollama.com
)
by
turkishdelight
2024-03-16T18:25:59Z
0
0
0
T-Ragx - Enhancing Translation with RAG-Powered LLMs
(
github.com
)
by
rayliuca
2024-03-06T14:54:32Z
0
0
0
My personal collection of interesting models I've quantized from the past week (yes, just week)
(
twitter.com
)
by
noneabove1182
2024-02-29T19:38:55Z
0
0
0
[Paper] The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
(
huggingface.co
)
by
rufus
2024-02-28T14:09:59Z
0
0
0
Gemma 2B vs Phi-2
by
acec
2024-02-23T09:07:21Z
0
0
0
NVIDIA Chat With RTX
(
nvidia.com
)
by
OpticalMoose
2024-02-14T12:27:26Z
0
0
0
Meet ‘Smaug-72B’: The new king of open-source AI
(
venturebeat.com
)
by
ylai
2024-02-07T17:03:23Z
0
0
0
itsme2417/PolyMind: A multimodal, function calling powered LLM webui.
(
github.com
)
by
noneabove1182
2024-02-07T15:35:04Z
0
0
0
Introducing Nomic Embed: A Truly Open Embedding Model
(
blog.nomic.ai
)
by
noneabove1182
2024-02-07T14:06:28Z
0
0
0
Uncensored Mixtral 8x7B with 4 GB of VRAM
by
glibg10b
2024-02-04T14:14:08Z
0
0
0
Mistral CEO confirms ‘leak’ of new open source AI model nearing GPT-4 performance
(
venturebeat.com
)
by
ylai
2024-02-01T08:27:08Z
0
0
0
Meta releases ‘Code Llama 70B’, an open-source behemoth to rival private AI development
(
venturebeat.com
)
by
ylai
2024-01-29T21:25:05Z
0
1
0
A question about running LLMs with an AMD card
by
Gunpachi
2024-01-27T13:22:47Z
0
0
0
Noob here, what's the best overall model for getting started with ?
by
Gunpachi
2024-01-25T07:07:42Z
0
0
0
Zuckerberg wants to build artificial general intelligence with 350K Nvidia H100 GPUs
(
theregister.com
)
by
ylai
2024-01-20T20:06:08Z
0
0
0
InternLM2 models llama-fied
by
noneabove1182
2024-01-18T22:59:29Z
0
0
0
Building a fully local LLM voice assistant to control my smart home
(
johnthenerd.com
)
by
exu
2024-01-14T08:04:20Z
0
0
0
argilla released distilabeled-Hermes-2.5-Mistral-7B
(
huggingface.co
)
by
Reddy
2024-01-10T18:42:51Z
0
0
0
Mixtral of Experts
(
arxiv.org
)
by
ylai
2024-01-10T17:24:26Z
0
0
0
WizardLM/WizardCoder-33B-V1.1 released!
(
huggingface.co
)
by
noneabove1182
2024-01-04T15:42:44Z
0
0
0
Microsoft announces WaveCoder
(
twitter.com
)
by
noneabove1182
2023-12-26T04:13:30Z
0
0
0
How to make LLMs go fast
(
vgel.me
)
by
Alex
2023-12-22T21:49:56Z
0
0
0
GitHub - SJTU-IPADS/PowerInfer: High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
(
github.com
)
by
ylai
2023-12-20T16:18:58Z
0
0
0
2023, year of open LLMs
(
huggingface.co
)
by
ylai
2023-12-18T18:00:57Z
0
0
0
A Systems Programmer's Perspectives on Generative AI
(
bennee.com
)
by
Alex
2023-12-15T08:18:24Z
0
0
0
Training a model without a GPU
by
Matburnx
2023-12-14T16:31:24Z
0
0
0
Looking for a low-end setup
by
Rez
2023-12-13T15:35:29Z
0
0
0
Mistral shocks AI community as latest open source model eclipses GPT-3.5 performance
(
venturebeat.com
)
by
ylai
2023-12-11T23:59:59Z
0
0
0
Mixture of Experts Explained (Huggingface blog)
(
huggingface.co
)
by
noneabove1182
2023-12-11T23:01:12Z
0
0
0
Mistral releases version 0.2 of their 7B model
(
mistral.ai
)
by
noneabove1182
2023-12-11T22:59:35Z
0
0
0
QuIP#: SOTA 2 bit LLMs
(
github.com
)
by
Even_Adder
2023-12-09T17:06:05Z
0
0
0
ByteDance AI Promises Stronger than Gemini Open Weight GPT Dropping Soon
(
reddit.com
)
by
ylai
2023-12-08T19:35:51Z
0
0
0
Mistral drops a new magnet download
(
twitter.com
)
by
noneabove1182
2023-12-08T16:04:22Z
0
0
0
Inside the A.I. Arms Race That Changed Silicon Valley Forever
(
archive.ph
)
by
ylai
2023-12-06T21:52:40Z
0
0
0
LLMs made into single-file executables with llamafile
(
hacks.mozilla.org
)
by
meteokr
2023-12-03T02:09:41Z
0
0
0
Unsloth: 80% faster 50% less memory LLM finetuning
(
github.com
)
by
Even_Adder
2023-12-01T18:27:49Z
0
0
0
I'm having a fantastic time with this model.
(
huggingface.co
)
by
undermine
2023-11-28T20:33:31Z
0
0
0
Orca 2: Teaching Small Language Models How to Reason
(
microsoft.com
)
by
noneabove1182
2023-11-21T07:29:13Z
0
0
0
Hundreds of OpenAI employees threaten to resign and join Microsoft
(
theverge.com
)
by
noneabove1182
2023-11-20T15:32:03Z
0
0
0
Catch me if you can! How to beat GPT-4 with a 13B model | LMSYS Org
(
lmsys.org
)
by
noneabove1182
2023-11-15T18:58:44Z
0
0
0
TensorRT-LLM evaluation of the new H200 GPU achieves 11,819 tokens/s on Llama2-13B
(
github.com
)
by
noneabove1182
2023-11-14T00:21:56Z
0
0
0
...
by
CoderSupreme
2023-11-07T18:33:44Z
0
0
0
ExUI - a lightweight web UI for ExLlamaV2 by turboderp
(
github.com
)
by
noneabove1182
2023-11-06T16:51:37Z
0
0
0
...
by
CoderSupreme
2023-11-05T14:14:55Z
0