This story talks about MLX and Ollama but doesn't mention LM Studio - https://lm...

ZeroCool2u · 2025-12-21T22:05:03 1766354703

LMStudio is so much better than Ollama it's silly it's not more popular.

thehamkercat · 2025-12-21T22:16:44 1766355404

LMStudio is not open source though, ollama is

but people should use llama.cpp instead

smcleod · 2025-12-21T22:50:09 1766357409

I suspect Ollama is at least partly moving away open source as they look to raise capitol, when they released their replacement desktop app they did so as closed source. You're absolutely right that people should be using llama.cpp - not only is it truly open source but it's significantly faster, has better model support, many more features, better maintained and the development community is far more active.

calgoo · 2025-12-22T10:09:55 1766398195

Only issue I have found with llama.cpp is trying to get it working with my amd GPU. Ollama almost works out of the box, in docker and directly on my Linux box.

Lapel2742 · 2025-12-22T16:14:26 1766420066

>Only issue I have found with llama.cpp is trying to get it working with my amd GPU.

I had no problems with ROCm 6.x but couldn't get it to run with ROCm 7.x. I switched to Vulkan and the performance seems ok for my use cases

parthsareen · 2025-12-22T05:09:30 1766380170

Desktop app is open-source now.

nateb2022 · 2025-12-22T00:14:54 1766362494

> but people should use llama.cpp instead

MLX is a lot more performant than Ollama and llama.cpp on Apple Silicon, comparing both peak memory usage + tok/s output.

edit: LM Studio benefits from MLX optimizations when running MLX compatible models.

behnamoh · 2025-12-21T22:47:56 1766357276

> LMStudio is not open source though, ollama is

and why should that affect usage? it's not like ollama users fork the repo before installing it.

thehamkercat · 2025-12-21T22:49:08 1766357348

It was worth mentioning.

DavideNL · 2025-12-22T18:53:06 1766429586

Note that there's also "LlamaBarn" (macOS app): https://github.com/ggml-org/LlamaBarn

ekianjo · 2025-12-22T03:00:11 1766372411

Ollama did not open source their GUI.

jmorgan · 2025-12-22T03:59:33 1766375973

The source is available here: https://github.com/ollama/ollama/tree/main/app

ekianjo · 2025-12-22T16:06:26 1766419586

Thanks, I stand corrected.

skhameneh · 2025-12-22T07:10:39 1766387439

ik_llama is almost always faster when tuned. However, when untuned I've found them to be very similar in performance with varied results as to which will perform better.

But vLLM and Sglang tend to be faster than both of those.

Abishek_Muthian · 2025-12-22T02:55:07 1766372107

Besides optimizations specific to running locally lands in lamma.cpp first.

midius · 2025-12-21T22:10:02 1766355002

Makes me think it's a sponsored post.

Cadwhisker · 2025-12-21T22:18:06 1766355486

LMStudio? No, it's the easiest way to run am LLM locally that I've seen to the point where I've stopped looking at other alternatives.

It's cross-platform (Win/Mac/Linux), detects the most appropriate GPU in your system and tells you whether the model you want to download will run within it's RAM footprint.

It lets you set up a local server that you can access through API calls as if you were remotely connected to an online service.

vunderba · 2025-12-21T22:22:09 1766355729

FWIW, Ollama already does most of this:

- Cross-platform

- Sets up a local API server

The tradeoff is a somewhat higher learning curve, since you need to manually browse the model library and choose the model/quantization that best fit your workflow and hardware. OTOH, it's also open-source unlike LMStudio which is proprietary.

randallsquared · 2025-12-21T22:50:48 1766357448

I assumed from the name that it only ran llama-derived models, rather than whatever is available at huggingface. Is that not the case?

fenykep · 2025-12-21T23:04:53 1766358293

No, they have quite a broad list of models: https://ollama.com/search

[edit] Oh and apparently you can also directly run some models directly from HuggingFace: https://huggingface.co/docs/hub/ollama

ashirviskas · 2025-12-23T02:30:21 1766457021

Just use llama.cpp. Ollama tried to force their custom API (not the openai standard), they obscure the downloaded models making them a pain to use with other implementations, blatantly used llama.cpp as a thin wrapper without communicating it properly and now has to differentiate somehow to start making money.

If you've ever used a terminal, use llama.cpp. You can also directly run models from llama.cpp afaik.

fenykep · 2025-12-23T13:19:35 1766495975

Yes, I wanted to try it already but setting up an environment with an MI50 was a bit tricky so I wanted to try something I knew first. Now that I have ollama running I will give llama.cpp a shot.

ashirviskas · 2025-12-23T14:15:24 1766499324

Ooh, I have experience with it. If you're on linux, just use Vulkan. If you face any other issues, just google my username + "MI50 32GB vbios reddit". It depends on which vBIOS you have, but that post on reddit has most of the info you may need. Good luck!

thehamkercat · 2025-12-21T22:40:48 1766356848

I think you should mention that LM Studio isn't open source.

I mean, what's the point of using local models if you can't trust the app itself?

rubymamis · 2025-12-22T08:29:13 1766392153

You can always use something like Little Snitch to not allow it to dial home.

behnamoh · 2025-12-21T22:48:34 1766357314

> I mean, what's the point of using local models if you can't trust the app itself?

and you think ollama doesn't do telemetry/etc. just because it's open source?

parthsareen · 2025-12-22T05:10:40 1766380240

You're welcome to go through the source: https://github.com/ollama/ollama/

thehamkercat · 2025-12-21T22:49:29 1766357369

That's why i suggested using llama.cpp in my other comment.

satvikpendem · 2025-12-21T22:42:14 1766356934

Depends what people use them for, not every user of local models is doing so for privacy, some just don't like paying for online models.

thehamkercat · 2025-12-21T22:48:27 1766357307

Most LLM sites are now offering free plans, and they are usually better than what you can run locally, So I think people are running local models for privacy 99% of the time

ekianjo · 2025-12-22T02:59:06 1766372346

Lmstudio runs llama.cpp under the hood.

selcuka · 2025-12-22T04:09:03 1766376543

They also run the Apple MLX engine on macOS.

evacchi · 2025-12-21T23:27:59 1766359679

ramalama.ai is worth mentioning too