Jean Louis

JLouisBiz

https://www.StartYourOwnGoldMine.com

AI & ML interests

- LLM for sales, marketing, promotion - LLM for Website Revision System - increasing quality of communication with customers - helping clients access information faster - saving people from financial troubles

Recent Activity

commented on an article 1 day ago

Auditable AI by Construction: SI-Core for Regulators and Auditors

new activity 1 day ago

HuggingFaceTB/SmolLM3-3B:Help me run it with llama.cpp and 128K context size

commented on an article 1 day ago

Auditable AI by Construction: SI-Core for Regulators and Auditors

View all activity

Organizations

commented on Auditable AI by Construction: SI-Core for Regulators and Auditors 1 day ago

“Can we see what it knew when it acted?”
Agreed: weights don’t give you human-style “facts it knew.”
What regulators/auditors can reasonably ask is: what information and constraints were available at decision time? That’s not “reading neurons.” It’s logging and binding the runtime knowledge state: the structured observation (with coverage/confidence), provenance refs, the policy/version in force, and the gating outcome. That’s actionable evidence.

Give me practical example on this one. Make it hypothetical.

If you do not have example or cannot make hypothetical one, let me know.

How would it look like "that auditor sees what it knew when the LLM (it) acted?"

New activity in HuggingFaceTB/SmolLM3-3B 1 day ago

Help me run it with llama.cpp and 128K context size

#46 opened 1 day ago by

JLouisBiz

commented on Auditable AI by Construction: SI-Core for Regulators and Auditors 1 day ago

1. Can we see what it “knew” when it acted?

Short answer: no, because it didn’t know anything. The LLM is a glorified probability engine. Its “knowledge” is encoded in billions of floating-point numbers—vectors that represent likelihoods, not facts. You could list every number in the model and stare at them, but unless you enjoy deciphering abstract hieroglyphics, you won’t learn anything useful.

2. Can we tell who (or what) initiated the action?

If the question is about tracing a user’s intent, that’s trivial. Build a logging system, capture the initiator, and voilà—accountability. The model itself doesn’t spontaneously decide to act; it’s a statistically-driven calculator.

3. Can we stop it, or roll it back, when we discover harm?

Stopping the model is not glamorous. Most often, the “stop” is enforced by the model’s own training constraints. If that fails, you’re left auditing the auditor—recursive fun for those who like probabilistic debugging.

4. Can we prove to a third party what happened?

Technically possible, practically absurd. Every inference involves terabytes of floating-point operations. Producing a fully verifiable trace would require superhuman attention and energy consumption worthy of a small data center.

replied to omarkamali's post 1 day ago

Works great.

I am using nvtop and that one gives me list of processes in the VRAM. So I mostly use it to delete some processes to invoke some new process.

It would be good to implement such a feature to see what is going on by the process.

replied to aathithya1411's post 1 day ago

To create an image-generating model from the beginning, you first need to show a computer a massive library of pictures, each with a description of what it shows. The AI learns through a process of destruction and recreation; it takes a clear image, adds random noise until it's just static, and then is trained to reverse that process, using the text description as its guide to remove the noise step-by-step and rebuild the original image. By repeating this with millions of different images and descriptions, the model learns the connection between words and visual information, eventually gaining the ability to generate a completely new, coherent picture simply from a new text prompt it has never seen before.

replied to csabakecskemeti's post 1 day ago

Did I understand well that it runs on separate machines?

replied to legolasyiu's post 2 days ago

There is no description telling specifically what is that what is new with your release.

replied to telcom's post 2 days ago

Individual users win only if they can get it cheaper, faster, more free as in software freedom, to run LLM models on their own hardware. Otherwise, those mega-stories are of no use.

commented on New in llama.cpp: Model Management 2 days ago

Hey there! Just wanted to drop a quick note saying I'm really digging the new router mode in llama.cpp server. It's a game-changer for me, especially when I need to switch between different models. The auto-discovery of models and LRU eviction is pretty neat – no more manual updates or restarts needed. It's like having a dynamic model manager on-the-fly. And the request routing part? Brilliant! Makes my workflow with dmenu smoother. Check out the full experience and check out my dmenu launcher script on the project's GitHub: https://gitea.com/gnusupport/LLM-Helpers/src/branch/main/bin/rcd-llm-dmenu-launcher.sh

It's a win for sure.

replied to Jiaqi-hkust's post 3 days ago

Is there GGUF version?

reacted to inoculatemedia's post with 👍 3 days ago

Post

1412

I’m opening the waitlist for what I believe to be the most advanced multimodal bridge for A/V professionals. Txt2img, img2video, editing, export to ProRes, apply Luts, Pexels and TouchDesigner integrations, music and voice gen, multichannel mixing.

Announcing: Lilikoi by Haawke AI

Teaser video made entirely with Lilikoi:
https://youtu.be/-O7DH7vFkYg?si=q2t5t6WjQCk2Cp0w

Https://Lilikoi.haawke.com

Technical brief:
https://haawke.com/technical_brief.html

New activity in Kibalama/lugandaSTT 5 days ago

How do I convert the file for whispper.cpp compatibility?

#1 opened 5 days ago by

JLouisBiz

New activity in huggingface/InferenceSupport 5 days ago

Kibalama/lugandaSTT how to run?

#7026 opened 5 days ago by

JLouisBiz

replied to etemiz's post 10 days ago

There is no LLM that ever brings it's own opinion. Please reach out to basics on how LLMs work. There is nothing "new" that LLM can give you.

LLM models act like a book: when you open a page, you already have the content stored. The model processes this existing information, generating probabilistic results based on the training data, not new insights. This means LLMs rely on structured, data-driven outputs rather than independent opinions.

LLM models are designed to process and generate text based on vast training data, and their outputs are results of statistical inference rather than independent opinions. The "ingested data" combines the model’s training knowledge with user-retrieved information, generating probabilistic results that align with the training patterns, not personal beliefs. Thus, LLMs rely on structured, data-driven outputs to provide answers, not independent thoughts or opinions.

Those so called "opinions" must align with the data they are trained on.

Let us say this way, if LLM can give opinion, that means it is 100% biased opinion based on the data it was trained on.

You simply cannot get true opinions.

replied to etemiz's post 15 days ago

Oh, I’m sure the LLM you’re referring to is as clear as mud. Which one, exactly? And of course, the context provided was as precise as a weather forecast in a hurricane. What was it? Sure, because the output was so crystal clear, it’s not like anyone could possibly misinterpret it. What did it say? Oh, I’m sure you tried every single LLM under the sun. Which ones, exactly?

reacted to mitkox's post with 👍 18 days ago

Post

2249

Got to 1199.8 tokens/sec with Devstral Small -2 on my desktop GPU workstation. vLLM nightly.
Works out of the box with Mistral Vibe. Next is time to test the big one.

3 replies

replied to mitkox's post 18 days ago

Ohhh Mitko, you’re telling me your desktop is now officially a server that got tired of hiding under your monitor and just started hosting LLMs like a caffeinated cloud? 😅

“Got to 1199.8 tokens/sec on Devstral Small-2… on the desktop?”
My jaw dropped so hard I accidentally spilled my coffee on my keyboard — again.
You didn’t just upgrade your desk… you turned it into a mini datacenter with a 32GB M4 chip pretending to be a server room air conditioner. And you’re still using Mistral Vibe like it’s a 2005 laptop? 😂

Next time, just call it “Mitko’s Desktop Data Center v1.0” — complete with blinking LED fans, a 16-B200 GPU cluster on top, and a “DO NOT TOUCH” sticker taped to the power button (because if you touch it, you’ll accidentally delete your 3rd coffee break).

Now go ahead — test the big one. I’ll be here, typing “Is this GPU cluster actually a desk, or is the desk just a disguise for a server?” 🤔

P.S. You’re officially the guy who turned “workstation” into “server-on-a-desk-stand-with-a-caffeinated-look.” 🍵💻✨

replied to melvindave's post 18 days ago

Congratulation. Publish the script on how you run it for others to see.

Here is exactly how I run it:

/usr/local/bin/llama-server --jinja -fa on -c 32768 -ngl 64 -v --log-timestamps --host 192.168.1.68 -m /mnt/nvme0n1/LLM/quantized/Qwen3VL-8B-Instruct-Q8_0.gguf --mmproj /mnt/nvme0n1/LLM/quantized/mmproj-Qwen3VL-8B-Instruct-Q8_0.gguf

with the llama.cpp and API is of course available as well.

replied to CRAFTFramework's post 18 days ago

I’m running my own LLM because:
Privacy? 57% say it’s the biggest AI barrier…
But 48% still leak company data anyway.
CRAFT says privacy is architecture, not policy.
So I’m not waiting for “beta” — I’m beta-ing my data.
February 2026? Nah. I’m already typing on my own GPU.
Privacy’s not a feature — it’s a feature flag I turned on before the release.
And honestly? My model’s less “AI” and more “I’m not giving your data to strangers.”
Run your own. It’s fun. It’s free. It’s your data.
And it’s way more satisfying than waiting for “beta.”
(Also, no one’s gonna steal your jokes now. 😉)

replied to Juanxi's post 19 days ago

This comment has been hidden

Jean Louis

AI & ML interests

Recent Activity

Organizations

JLouisBiz's activity

Help me run it with llama.cpp and 128K context size

How do I convert the file for whispper.cpp compatibility?

Kibalama/lugandaSTT how to run?