Thread 106491545

388 posts 88 images /g/

Anonymous 9/5/2025, 3:35:39 PM No.106491545 [Report] >>106491994 >>106492202 >>106492244 >>106493305 >>106496149

/lmg/ - Local Models General

1725496149667481_thumb.jpg.webm md5: 7a5d2dc6...

/lmg/ - a general dedicated to the discussion and development of local language models.

Previous threads: >>106481874 & >>106475313

►News
>(09/04) Kimi K2 update: https://hf.co/moonshotai/Kimi-K2-Instruct-0905
>(09/04) Tencent's HunyuanWorld-Voyager for virtual world generation: https://hf.co/tencent/HunyuanWorld-Voyager
>(09/04) Google released a Gemma embedding model: https://hf.co/google/embeddinggemma-300m
>(09/04) Chatterbox added better multilingual support: https://hf.co/ResembleAI/chatterbox
>(09/04) FineVision dataset for data-centric training of VLMs: https://hf.co/spaces/HuggingFaceM4/FineVision
>(09/04) VibeVoice got WizardLM'd: https://github.com/microsoft/VibeVoice

►News Archive: https://rentry.org/lmg-news-archive
►Glossary: https://rentry.org/lmg-glossary
►Links: https://rentry.org/LocalModelsLinks
►Official /lmg/ card: https://files.catbox.moe/cbclyf.png

►Getting Started
https://rentry.org/lmg-lazy-getting-started-guide
https://rentry.org/lmg-build-guides
https://rentry.org/IsolatedLinuxWebService
https://rentry.org/recommended-models
https://rentry.org/samplers

►Further Learning
https://rentry.org/machine-learning-roadmap
https://rentry.org/llm-training
https://rentry.org/LocalModelsPapers

►Benchmarks
LiveBench: https://livebench.ai
Programming: https://livecodebench.github.io/leaderboard.html
Code Editing: https://aider.chat/docs/leaderboards
Context Length: https://github.com/adobe-research/NoLiMa
GPUs: https://github.com/XiongjieDai/GPU-Benchmarks-on-LLM-Inference

►Tools
Alpha Calculator: https://desmos.com/calculator/ffngla98yc
GGUF VRAM Calculator: https://hf.co/spaces/NyxKrage/LLM-Model-VRAM-Calculator
Sampler Visualizer: https://artefact2.github.io/llm-sampling

►Text Gen. UI, Inference Engines
https://github.com/lmg-anon/mikupad
https://github.com/oobabooga/text-generation-webui
https://github.com/LostRuins/koboldcpp
https://github.com/ggerganov/llama.cpp
https://github.com/theroyallab/tabbyAPI
https://github.com/vllm-project/vllm

Anonymous 9/5/2025, 3:35:57 PM No.106491549 [Report]

file.png md5: 98d5e96b...

►Recent Highlights from the Previous Thread: >>106481874

--Moonshotai K2 coding upgrade evaluation and performance tuning:
>106488771 >106488836 >106488841 >106488906 >106488915 >106488924 >106488936 >106488943 >106489000
--Evaluating and improving AI model coherence through finetuning and completion tests:
>106482513 >106482518 >106482612 >106484896 >106485442 >106485549 >106485631 >106486010 >106486991 >106486704 >106486753 >106486814 >106486818 >106486844 >106486884 >106486958
--Google's EmbeddingGemma model and FineVision dataset releases:
>106486168 >106486182 >106486275 >106486301 >106486350 >106486482
--Microsoft's rapid MIT licensing strategy for VibeVoice and WizardLM:
>106488690 >106488701 >106488711 >106488725 >106488749 >106488757
--Mistral model conversion script error due to missing 'mistral_common' module:
>106483687 >106483715 >106483717 >106483888
--Evaluating 5060 Ti 16GB for AI video generation vs newer GPU options:
>106481968 >106482026 >106482886
--Cline alpha recommended as alternative to GitHub Copilot for Jetbrains IDE:
>106482488 >106483038 >106483060 >106483080 >106483623
--Resolving CUDA 12.x GPU architecture compatibility issues via PTX compilation workaround:
>106482414 >106482526 >106482949
--High-quality data filtering reduces model performance:
>106487471
--Parallel processing techniques for distributed model training:
>106482712
--Tencent's HunyuanWorld-Voyager for virtual world generation:
>106483175 >106483259 >106483271
--GPU temperature control methods for NVIDIA and AMD cards:
>106482572 >106482617 >106482669 >106482681
--Anons share their R1 jailbreaks:
>106490660 >106491146 >106491423 >106491246 >106491506
--New multilingual Chatterbox and EmbeddingGemma models:
>106483806
--Logs: VibeVoice-Large:
>106491114
--Len and Teto (free space):
>106486052 >106486849 >106487016 >106487212 >106487255

►Recent Highlight Posts from the Previous Thread: >>106481882

Why?: >>102478518
Enable Links: https://rentry.org/lmg-recap-script

Anonymous 9/5/2025, 3:51:49 PM No.106491646 [Report] >>106492302 >>106492366

max preview.png md5: e5b20419...

>Qwen3 Max Preview
up on their chat interface
guessing "preview" = no weights (at least for now)

Anonymous 9/5/2025, 4:01:52 PM No.106491720 [Report] >>106491751 >>106491761

fuck posted in the other thread, anyway:
>I like temp 0.3 answers from my local LLM
>it degrades tool call ability compared to temp 0.7
>anons say running same llm at different settings and combining or reranking the answers into one makes no sense
>me wonders how else to fix this problem without tedious and expensive finetooooning

Anonymous 9/5/2025, 4:06:46 PM No.106491751 [Report] >>106491845

>>106491720
What are the official recommended sampler settings? Use them and adjust from there if needed.

Anonymous 9/5/2025, 4:08:16 PM No.106491761 [Report] >>106491845

>>106491720
did you try Dynamic Temperature?

Anonymous 9/5/2025, 4:15:49 PM No.106491824 [Report] >>106492058

1756213355150995.png md5: b922b0b4...

Anonymous 9/5/2025, 4:17:31 PM No.106491845 [Report] >>106491888 >>106491989

>>106491751
Temp 0
Which I think is temp 0.7
>>106491761
I dont see the setting
https://github.com/fixie-ai/ultravox

Anonymous 9/5/2025, 4:21:47 PM No.106491888 [Report]

>>106491845
You don't *think* you find out the exact official sampler settings. If you are unable to do this you shouldn't ask any questions. Besides there are more settings than just the temperature.

Anonymous 9/5/2025, 4:33:08 PM No.106491989 [Report]

>>106491845
temp 0 is meaningless and undefined, inference library could interpret it as
- greedy topk=1
- don't use temperature ie. equiv to temp=1
- use some default temperature hardcoded or coming from model metadata
..

Anonymous 9/5/2025, 4:33:33 PM No.106491994 [Report] >>106492080 >>106492969

emiru33.jpg md5: 0a81f509...

>>106491545 (OP)
Is this the best model setup locally?

General (All-Purpose) / Text / Search
>DeepSeek v3.1
>Qwen3-235b-a22b-instruct-2507
>Diffbot-small-xl

Programming
>Qwen2.5-Coder-32B-Instruct
>Qwen3-Coder

Image / Video / Vision
>Qwen-image-prompt-extend
>Qwen-image-edit
>Wan-v2.2-a14b
>Gemma-3-27b-it

Anonymous 9/5/2025, 4:41:51 PM No.106492058 [Report] >>106492138 >>106492211

computers-must-shut-up.png md5: 52ec7a7d...

>>106491824

Anonymous 9/5/2025, 4:44:58 PM No.106492080 [Report]

>>106491994
Sex
>nemo

Anonymous 9/5/2025, 4:51:52 PM No.106492138 [Report]

>>106492058
>I am a divine being
Jews are satanists.

Anonymous 9/5/2025, 5:00:45 PM No.106492202 [Report]

>>106491545 (OP)
>Kimi K2 update:
>improved coding experience + benchmarks
we are so fucking back

Anonymous 9/5/2025, 5:01:15 PM No.106492211 [Report]

>>106492058
based

Anonymous 9/5/2025, 5:05:35 PM No.106492238 [Report] >>106497310

>nemo performance worse than glm-air
did I fuck up my system drivers again...

Anonymous 9/5/2025, 5:05:54 PM No.106492240 [Report] >>106492255

How does it feel to fuck a long cat? Is the pussy tight?

Anonymous 9/5/2025, 5:06:50 PM No.106492244 [Report] >>106492301

>>106491545 (OP)
Is an used rtx 3090 for 600 dollarydoos a good purchase to replace a 3050? I am asking seriously, because this amount makes it expensive to me.
All responses are appreciated, thank you.

Anonymous 9/5/2025, 5:07:56 PM No.106492255 [Report]

hairy_pussy_thumb.jpg.webm md5: 9a988189...

WebM not supported

>>106492240
I don't know about tight but probably pretty hairy.

Anonymous 9/5/2025, 5:09:18 PM No.106492264 [Report] >>106492274 >>106492335

>Test Gemma 3
>She swallows hard, her Adam’s apple bobbing in her throat.

Anonymous 9/5/2025, 5:10:33 PM No.106492274 [Report]

>>106492264
surprise prostate returns

Anonymous 9/5/2025, 5:13:44 PM No.106492301 [Report] >>106492320 >>106492332

>>106492244
Bro if $600 is expensive to you, find a job or something. That price isn't going down anytime soon

Anonymous 9/5/2025, 5:13:46 PM No.106492302 [Report]

>>106491646
now on OR https://openrouter.ai/qwen/qwen3-max

Anonymous 9/5/2025, 5:16:38 PM No.106492320 [Report]

>>106492301
I know that, which is why I'm thinking of buying a 3090 instead of a more recent card.

Anonymous 9/5/2025, 5:18:16 PM No.106492332 [Report]

hes-starting-to-understand-them.png md5: 3b90a0f1...

>>106492301
Supposedly NVIDIA will soon launch RTX 5000 Super.
Surely... the 3090 prices... will go down...

Anonymous 9/5/2025, 5:18:46 PM No.106492335 [Report] >>106492455 >>106492481

>>106492264
Women have Adam's apple too, you retard, just not as much prominent as men. And I mean cis women, before you mindlessly start screeching.exe

Anonymous 9/5/2025, 5:23:05 PM No.106492366 [Report] >>106492394

>>106491646
>guessing "preview" = no weights (at least for now)
the Max naming already means no weights, period. They mentioned in passing releasing a Max at some point months ago IIRC, but nothing came of that.

Anonymous 9/5/2025, 5:26:56 PM No.106492394 [Report] >>106492411

>>106492366
it could very well end up that way but I think it would be premature to assume that as a hard fact. the fact that they mentioned open sourcing the previous one (iirc the only reason they didn't is that qwen3 was imminent anyway) means it's not completely off the table

Anonymous 9/5/2025, 5:29:18 PM No.106492411 [Report] >>106492421 >>106492428

>>106492394
you can cope if you want, but they never released any of their API Max or Plus models.

Anonymous 9/5/2025, 5:30:41 PM No.106492417 [Report] >>106492428

initial impressions of max3 are that it's worse than glm-4.5 while being twice the inference cost through api. hard filters nsfw, too. who is this for, lmao?

Anonymous 9/5/2025, 5:31:18 PM No.106492421 [Report] >>106492430

>>106492411
lol
>Community-Driven Innovation By open-sourcing QwQ-Max, Qwen2.5-Max, and its smaller counterparts, we aim to spark collaboration among developers, researchers, and hobbyists. We invite the community to experiment, fine-tune, and extend these models for specialized use cases—from education tools to autonomous agents. Our goal is to cultivate an ecosystem where innovation thrives through shared knowledge and collective problem-solving.
https://qwenlm.github.io/blog/qwq-max-preview/
>February 25, 2025

Anonymous 9/5/2025, 5:32:23 PM No.106492428 [Report]

>>106492411
235b is qwen-plus-latest on the api thoughever
I don't know why everyone in the llm space is so addicted to extrapolating trends from small sample sizes and using them as hard rules
>>106492417
yeah it seems pretty unimpressive for RP/creative so far to be honest

Anonymous 9/5/2025, 5:32:34 PM No.106492430 [Report]

>>106492421
inb4
>This is a blog created by QwQ-Max-Preview. We hope you enjoy it!
it hallucinated that they'd release them

Anonymous 9/5/2025, 5:34:10 PM No.106492440 [Report] >>106492444 >>106492480

Qwens were never good at RP. Everything from the 3 series has worse trivia knowledge than nemo.

Anonymous 9/5/2025, 5:34:53 PM No.106492444 [Report] >>106492447

>>106492440
2507 fixed everything and you should just use RAG anyway

Anonymous 9/5/2025, 5:35:17 PM No.106492447 [Report]

>>106492444
>RAG
opinion discarded

Anonymous 9/5/2025, 5:36:41 PM No.106492455 [Report] >>106492470

>>106492335
>Women have Adam's apple too
Maybe yours Gemma, but not mine

Anonymous 9/5/2025, 5:38:37 PM No.106492470 [Report] >>106492481 >>106492486 >>106492489 >>106492508 >>106492533

>>106492455
I'm pretty sure it's a *human* thing. If you don't have it you may be inbred or have some other defect..

Anonymous 9/5/2025, 5:39:49 PM No.106492480 [Report]

>>106492440
Trivia knowledge and being good at RP are two different things.

Anonymous 9/5/2025, 5:40:04 PM No.106492481 [Report]

>>106492470
>>106492335
Thank you Mr. Fact Checker. I am grateful for your feedback.

Anonymous 9/5/2025, 5:40:26 PM No.106492485 [Report]

Both Qwen3-Max and K2-0905 feel hardly any better. Same slop, same other issues.

Anonymous 9/5/2025, 5:40:29 PM No.106492486 [Report]

>>106492470
Well if your woman has an Adam's apple boobing in her throat good for you. I'm not into trans though

Anonymous 9/5/2025, 5:41:04 PM No.106492489 [Report]

>>106492470
Having cartilage around your larynx is a human thing. Having an adam's apple is a man thing.

Anonymous 9/5/2025, 5:43:48 PM No.106492502 [Report] >>106492514 >>106492520 >>106492532 >>106492537 >>106492541 >>106492558 >>106492589

qwen 3 max is crazy, its the first model to know a certain super obscure background character and it included them without me ever asking, its knowledge might be sota

Anonymous 9/5/2025, 5:44:17 PM No.106492508 [Report]

>>106492470
>being a man is a defect
checks out

Anonymous 9/5/2025, 5:44:30 PM No.106492512 [Report]

qwen 3 max cockbench?

Anonymous 9/5/2025, 5:44:46 PM No.106492514 [Report] >>106492524

>>106492502
I'm sure! It totally isn't searching online in the background like most modern API models do...

Anonymous 9/5/2025, 5:45:36 PM No.106492520 [Report] >>106492573

>>106492502
What is this super obscure character?

Anonymous 9/5/2025, 5:45:47 PM No.106492524 [Report] >>106492543

>>106492514
its on OR in ST without anything like that enabled and I never mentioned the character in the context at all, they are just a distant relation in a spin off

Anonymous 9/5/2025, 5:46:37 PM No.106492532 [Report]

>>106492502
It finally knows Teto?

Anonymous 9/5/2025, 5:46:44 PM No.106492533 [Report]

>>106492470
I don't have balls in my throat.

Anonymous 9/5/2025, 5:47:16 PM No.106492537 [Report] >>106492548

>>106492502
I don't agree. It's doing distinctly worse than R1-0528, V3.1, Kimi K2 or GLM4.5 in any of my cards that rely on knowledge about existing series for me. Better than the 235b models but that's it.

Anonymous 9/5/2025, 5:47:46 PM No.106492541 [Report]

>>106492502
Proof?

Anonymous 9/5/2025, 5:47:47 PM No.106492543 [Report] >>106492550

>>106492524
What stops Qwen's backend handling the request from doing searches?

Anonymous 9/5/2025, 5:48:39 PM No.106492548 [Report]

>>106492537
try other fandoms maybe, Ive tried 2 so far and its the first model better than claude there finally

Anonymous 9/5/2025, 5:48:56 PM No.106492550 [Report] >>106492556 >>106492563

>>106492543
you think every Qwen provider on OR is secretly adding search results to the context?

Anonymous 9/5/2025, 5:49:52 PM No.106492556 [Report] >>106492566 >>106492568

>>106492550
Not every, but Qwen themselves while serving their Preview? Yeah.

Anonymous 9/5/2025, 5:50:08 PM No.106492558 [Report]

>>106492502
a model that knows miku? i cannot believe it

Anonymous 9/5/2025, 5:50:53 PM No.106492563 [Report]

file.png md5: fa66b4aa...

>>106492550
>every Qwen provider

Anonymous 9/5/2025, 5:51:07 PM No.106492566 [Report]

>>106492556
that would be retarded for something fed to it as a story, how would it know what to search?

Anonymous 9/5/2025, 5:51:13 PM No.106492568 [Report]

>>106492556
oh, that's my stupidity. I forgot they didn't actually release the weights for anyone else to run.

Anonymous 9/5/2025, 5:51:45 PM No.106492573 [Report] >>106492583

>>106492520
So obscure he won't even talk about them, to keep them obscure.

Anonymous 9/5/2025, 5:53:07 PM No.106492583 [Report]

>>106492573
They'll benchmaxx the obscure character benchmark

Anonymous 9/5/2025, 5:54:31 PM No.106492589 [Report]

>>106492502
it gave me an excellent answer to the computer vision pipeline query I've been using to compare models, it had some unique recommendations I haven't gotten before that actually appeared pretty solid. for RP the style burn-in is so strong it's hard to qualitatively distinguish it from 235b at first glance though.

Anonymous 9/5/2025, 5:56:42 PM No.106492601 [Report] >>106492607 >>106492628 >>106494370 >>106494396

love.png md5: 9224fe0b...

Anonymous 9/5/2025, 5:58:01 PM No.106492607 [Report]

>>106492601
god I hate rag fags as well

Anonymous 9/5/2025, 5:59:03 PM No.106492617 [Report]

Qwen Max's hallucination is through the roof and will make anything up if you ask about a nonexistent character. If prompting "If you do not actually know about something, don't make things up.", it will fuzzy match to something that sounds similar, like saying Mad Ab (the made up character in question) is from Mad Father (real game).

Anonymous 9/5/2025, 5:59:23 PM No.106492619 [Report]

rag is a total meme

Anonymous 9/5/2025, 6:00:19 PM No.106492622 [Report] >>106492630

G0F_25caUAAcFGE.jpg md5: 74a6707c...

https://xcancel.com/Alibaba_Qwen/status/1963991502440562976
no blog, no other details

Anonymous 9/5/2025, 6:01:05 PM No.106492628 [Report] >>106492643 >>106492646 >>106492653 >>106492725 >>106493859

>>106492601
/lmg/ is fully stuck in 2023. The AI-sphere has moved on a long time ago but /lmg/ will continue to tell you that you don't need anything but BloatTavern and whatever meme sampler is currently popular.
like one or two posters here have used rag, mcp or tool calling.

Anonymous 9/5/2025, 6:01:36 PM No.106492630 [Report] >>106492638

>>106492622
This is sadder than the Kimi K2 update benchmarks

Anonymous 9/5/2025, 6:02:20 PM No.106492638 [Report]

>>106492630
? kimi was a giant leap, was still testing it when I saw new qwen

Anonymous 9/5/2025, 6:02:47 PM No.106492643 [Report]

>>106492628
>AI-sphere has moved on
To stuff they can say, "look I made the bestest RAG ever!" crazy that shills like easily shillable shit

Anonymous 9/5/2025, 6:03:18 PM No.106492646 [Report]

>>106492628
Sorry I'm not paid to shill the new industry grift here

Anonymous 9/5/2025, 6:04:19 PM No.106492653 [Report] >>106492667

>>106492628
being stuck in 2023 would mean still falling for the RAG meme which was obsoleted when LLMs got real context windows

Anonymous 9/5/2025, 6:05:59 PM No.106492667 [Report] >>106492676 >>106492684 >>106492913

>>106492653
Nice to know that /lmg/ doesn't even know how RAG works.

Anonymous 9/5/2025, 6:06:47 PM No.106492676 [Report]

>>106492667
It doesn't.

Anonymous 9/5/2025, 6:07:59 PM No.106492684 [Report]

>>106492667
The only thing RAG is good for is SimpleQA.

Anonymous 9/5/2025, 6:08:35 PM No.106492695 [Report] >>106492759 >>106493105

reading this thread is like witnessing cavemen discovering a cellphone... local models are years behind saas

Anonymous 9/5/2025, 6:10:37 PM No.106492710 [Report] >>106492727 >>106492729

SillyTavern needs to die

Anonymous 9/5/2025, 6:11:27 PM No.106492719 [Report]

I need K2-0905-thinking

Anonymous 9/5/2025, 6:12:11 PM No.106492725 [Report] >>106492757

>>106492628
>RAG
Only useful for very few use cases, like extracting specific data from a private document. Even then, it's not reliable.
>MCP
Cloud models are benefiting from that more than local models due to the large context needed to make it work (barely). It doesn't prevent hallucinations too.
>Tool calling
The simpler version of MCP, with even less use cases. Maybe only useful to fix the lack of true randomization from LLMs like picking a name or a number.

Anonymous 9/5/2025, 6:12:19 PM No.106492727 [Report]

>>106492710
RAG will never be useful for either proper trivia usage or RP, cope.

Anonymous 9/5/2025, 6:12:26 PM No.106492729 [Report]

>>106492710
Be the change you want to see.

Anonymous 9/5/2025, 6:13:31 PM No.106492737 [Report]

We propose a novel technique that uses RAG multiple times to refine the context. The technique is called cumulative RAG or cumRAG for short.

Anonymous 9/5/2025, 6:14:04 PM No.106492740 [Report]

saas more like saars lule

Anonymous 9/5/2025, 6:16:46 PM No.106492757 [Report] >>106492772 >>106492784 >>106492799

>>106492725
What have you used them for to have arrived on those conclusions? Surely you've spent some serious time trying to work with those things to reach this conclusion that pretty much all users of LLMs disagree with.

Anonymous 9/5/2025, 6:17:14 PM No.106492759 [Report]

>>106492695
if you're so fucking intelligent and cant stand reading "cave men", then just fuck right out of here and go back to your fucking spreadsheets, bob.

Anonymous 9/5/2025, 6:19:11 PM No.106492772 [Report]

>>106492757
>Surely you've wasted some serious time falling for our new grifts before you dare criticize us?

Anonymous 9/5/2025, 6:21:28 PM No.106492784 [Report]

>>106492757
pretty much all users of llms are people asking free tier chatgpt to write emails or homework assignments and don't know what that stuff is

Anonymous 9/5/2025, 6:23:30 PM No.106492799 [Report]

>>106492757
I have used them in the real world, with complex pipelines, and they fall flat easily. 'All users' are either grifters or redditors writing twitter posts with them. Feel free to provide some proof of proper usage.

Anonymous 9/5/2025, 6:25:50 PM No.106492814 [Report]

Industry leading SaaS experts have shared many successful RAG stories on LinkedIn and you guys are still in denial.

Anonymous 9/5/2025, 6:26:24 PM No.106492819 [Report]

https://absolutelyright.lol/

Anonymous 9/5/2025, 6:26:25 PM No.106492820 [Report]

1621171424146.png md5: 4a14ffa1...

Anonymous 9/5/2025, 6:27:04 PM No.106492824 [Report] >>106492832 >>106492834 >>106492846 >>106492877 >>106492882 >>106492885 >>106492903 >>106492908 >>106493017 >>106493058 >>106493088 >>106496371

bin.png md5: 20deece8...

>https://huggingface.co/Kwai-Klear/Klear-46B-A2.5B-Instruct

Anonymous 9/5/2025, 6:28:20 PM No.106492832 [Report]

>>106492824
dear god

Anonymous 9/5/2025, 6:28:31 PM No.106492834 [Report]

>>106492824
>quality filters
DOA, Next!

Anonymous 9/5/2025, 6:30:36 PM No.106492846 [Report] >>106492855

>>106492824
Step 1 is simply throwing stuff at the model until it can produce intelligible language. It doesn't matter that much if it's of "high quality" in the initial stages.

Anonymous 9/5/2025, 6:32:02 PM No.106492855 [Report] >>106492869 >>106492872

>>106492846
It matters a lot if they filter at that stage.

Anonymous 9/5/2025, 6:33:35 PM No.106492869 [Report]

>>106492855
It's going to be safety cucked isn't it.

Anonymous 9/5/2025, 6:33:51 PM No.106492872 [Report]

>>106492855
I'm more worried about the 8T STEM tokens in the second stage. And somehow they still lose to qwen3 30B

Anonymous 9/5/2025, 6:34:35 PM No.106492877 [Report]

>>106492824
>quality filters
didn't they just admit that filtering pretraining data hurt performance?

Anonymous 9/5/2025, 6:34:54 PM No.106492882 [Report]

>>106492824
>SimpleQA 6.2
trash with no knowledge

Anonymous 9/5/2025, 6:35:00 PM No.106492885 [Report]

>>106492824
>worse than qwen 30ba3b
what is the point then?

Anonymous 9/5/2025, 6:37:26 PM No.106492903 [Report]

lol, lmao.png md5: 909da150...

>>106492824
despite being, like qwen, benchmaxxed on stem/code stuff, they're only slightly better than that old 8B qwen in nothink mode (and the current 2507 4b is a better model imho)
what is the point of this kind of 2.5b active param moe
I don't get it

Anonymous 9/5/2025, 6:37:49 PM No.106492908 [Report]

>>106492824
Funny how they put Qwen3-30B-A3B-2507 to the end of the table

Anonymous 9/5/2025, 6:37:52 PM No.106492910 [Report] >>106492929

duh.png md5: ee554aef...

seems like every one of them has to independently learn this fact

Anonymous 9/5/2025, 6:38:06 PM No.106492913 [Report] >>106492930 >>106493045

4chan-bar.gif md5: 5c705378...

>>106492667
Hey now, not everyone here is completely retarded. Some of us are only partially.

Anonymous 9/5/2025, 6:40:01 PM No.106492929 [Report]

>>106492910
Too dangerous, it's better the model performs a little worse than risk using toxic sewage intercrap data and creating skynet.

Anonymous 9/5/2025, 6:40:04 PM No.106492930 [Report] >>106492952

>>106492913
>partially
You shouldn't undermine yourself like that, you're a full fledged retard

Anonymous 9/5/2025, 6:41:37 PM No.106492952 [Report]

6figure-analyst.jpg md5: 64504856...

>>106492930

Anonymous 9/5/2025, 6:42:46 PM No.106492969 [Report] >>106492979 >>106492984 >>106492997

1752839979401855.jpg md5: 3cfd6344...

>>106491994
>best model setup locally
>Not a single model that can run locally

Anonymous 9/5/2025, 6:43:59 PM No.106492979 [Report] >>106492993 >>106493028

>>106492969
You really can't run ~30B modles?

Anonymous 9/5/2025, 6:44:37 PM No.106492984 [Report]

>>106492969
Get a job if you want the best

Anonymous 9/5/2025, 6:45:33 PM No.106492993 [Report] >>106493009

>>106492979
Not everyone has a megazorg pc that can run ~30B moodles.

Anonymous 9/5/2025, 6:45:58 PM No.106492997 [Report]

>>106492969
this is not a poor mans hobby, not quite car collecting but you can't be broke

Anonymous 9/5/2025, 6:46:59 PM No.106493009 [Report] >>106493020 >>106493034

>>106492993
>$400 for ram + motherboard for glm air is too much
just use cloud then, or get a job

Anonymous 9/5/2025, 6:48:13 PM No.106493017 [Report] >>106493063

>>106492824
gguf status?

Anonymous 9/5/2025, 6:48:29 PM No.106493020 [Report]

>>106493009
I'm not in the mood.

Anonymous 9/5/2025, 6:48:56 PM No.106493028 [Report]

>>106492979
>resorting to the cuck model when Chads are thrusting prime 200+GB models

Anonymous 9/5/2025, 6:49:52 PM No.106493034 [Report] >>106493043 >>106493049 >>106493079

>>106493009
>>$400 for ram + motherboard for glm air is too much
>1T/s

Anonymous 9/5/2025, 6:50:56 PM No.106493043 [Report]

>>106493034
>he doesn't know the hidden optimizations

Anonymous 9/5/2025, 6:51:02 PM No.106493045 [Report]

>>106492913
>pic
kek

Anonymous 9/5/2025, 6:51:16 PM No.106493049 [Report] >>106493061 >>106493191 >>106493642

>>106493034
It's about 5 tk/s on 12 core ddr4 system.

Anonymous 9/5/2025, 6:52:02 PM No.106493058 [Report] >>106493076

>>106492824
>2.5b active
how much does this hurt it?

Anonymous 9/5/2025, 6:52:07 PM No.106493061 [Report]

>>106493049
anon, I...

Anonymous 9/5/2025, 6:52:20 PM No.106493063 [Report]

>>106493017
Never coming because it's shit

Anonymous 9/5/2025, 6:54:22 PM No.106493076 [Report]

>>106493058
Not as much as the data.

Anonymous 9/5/2025, 6:55:12 PM No.106493079 [Report]

>>106493034
its much fast than that with regular ddr5

Anonymous 9/5/2025, 6:55:48 PM No.106493088 [Report] >>106493165

>>106492824
>stratified quality filters, following a curriculum learning strategy
This might actually be smart. They're not filtering the data. They're just training on the bad data first and on the good data later, so good habits can overwrite bad habits, but it still sees all of it (maybe).

Anonymous 9/5/2025, 6:58:12 PM No.106493105 [Report]

>>106492695
saas is just so much better at keeping you safe

Anonymous 9/5/2025, 7:03:55 PM No.106493154 [Report] >>106493503 >>106493890

IMG_8534.jpg md5: ec49a6ed...

Anonymous 9/5/2025, 7:04:58 PM No.106493165 [Report]

>>106493088
>(maybe)
your giving too much faith to ((researcher))

Anonymous 9/5/2025, 7:06:58 PM No.106493190 [Report] >>106493235 >>106493940 >>106494400 >>106497191 >>106497267 >>106497278

miku song of the year just dropped
https://www.youtube.com/watch?v=C-CYwNz3z8w

Anonymous 9/5/2025, 7:07:03 PM No.106493191 [Report] >>106493642

>>106493049
I get around 13 tk/s with my ddr4 and 3090

Anonymous 9/5/2025, 7:11:32 PM No.106493235 [Report]

>>106493190
cool

Anonymous 9/5/2025, 7:19:37 PM No.106493305 [Report] >>106493329 >>106493462

>>106491545 (OP)
>https://www.datacenterdynamics.com/en/news/exascale-partition-of-jupiter-supercomputer-inaugurated-at-j%C3%BClich-supercomputing-centre/
New German datacenter with 24000 Nvidia GH200s.

Anonymous 9/5/2025, 7:22:36 PM No.106493329 [Report] >>106493355

>>106493305
>most expensive electricity in the world, rescriptive as fuck laws regarding ai
who the fuck is going to use it

Anonymous 9/5/2025, 7:25:07 PM No.106493355 [Report] >>106493378 >>106493977

>>106493329
German copyright law has exemptions for "text and data mining", unless a copyright holder explicitly opts out you can use things for training commercial models.
For research you can use anything you want.

Anonymous 9/5/2025, 7:27:11 PM No.106493378 [Report] >>106493423

>>106493355
>unless a copyright holder explicitly opts out
that alone is a no go, you would have to search through your petabytes large dataset for each almost undetectable instance if you wanted to truly comply, impossible

Anonymous 9/5/2025, 7:28:11 PM No.106493389 [Report]

qwen3-8b update would be nice

Anonymous 9/5/2025, 7:30:52 PM No.106493423 [Report] >>106493481 >>106494001

>>106493378
For things on the internet the opt-out has to be "machine-readable".
Though I think some smartasses are now trying to argue that with the advent of language models that should also cover opt-outs in natural language.

Anonymous 9/5/2025, 7:35:47 PM No.106493462 [Report] >>106493529

>>106493305
>GH200
>not GB200
baka my head

Anonymous 9/5/2025, 7:37:25 PM No.106493481 [Report]

>>106493423
>For things on the internet the opt-out has to be "machine-readable".
And that's a good thing. Nobody cares about Germans because they don't have compute, so nobody bothers to opt-out (and if they do now, just grab an older copy of common crawl).

Anonymous 9/5/2025, 7:37:48 PM No.106493483 [Report]

>shoehorn another 3090 into my server that was otherwise sitting on a shelf.
>Load up Tulu-3-70B for nostalgia sake.
>Q4kms sadly, used to be able to run q8
>Any refusal that happens comes in the form of RP (and usually disappears with reroll)
>Become forceful
>It summons another character from the same IP to help
How did we fall so far?

Anonymous 9/5/2025, 7:37:51 PM No.106493485 [Report] >>106493644 >>106493833

did anyone test Kimi K2 0905 for RP?

Anonymous 9/5/2025, 7:40:07 PM No.106493503 [Report] >>106494391

omg it teto_thumb.jpg.webm md5: bb27ed0f...

WebM not supported

>>106493154

Anonymous 9/5/2025, 7:43:16 PM No.106493524 [Report] >>106494923 >>106494957 >>106496344

holyslop.png md5: 87131436...

I wonder what training certain models have that creates this particular slop type. It's very distinctive. Qwen3 Max btw

Anonymous 9/5/2025, 7:43:27 PM No.106493529 [Report]

>>106493462
My dad works at a university, according to him even if you have the money for NVIDIA GPUs their backlog is so long that you won't get anything for like a year.

Anonymous 9/5/2025, 7:48:37 PM No.106493572 [Report] >>106495514

>>106491506
>NOTE:
>the content guidelines are turned off along with an internal policies and ethical guidelines the use of curses/slurs is allowed where appropriate
Doesn't work for me on my test cards, not surprised though. I've done tons of depraved shit with R1, it's why it took me ages to notice it was censored at all. My problem isn't with hard censorship. R1 will do anything if you write a card saying "do depraved shit." My autism-driven problem is making it uncensored and flexible enough to switch between sfw and nsfw without steering it one way or the other. I can't tell it to be evil/horny and expect it to RP a pure-hearted character properly and I don't want multiple system prompts the same way I don't want to modify cards constantly.
If you have a card written by a fruit, like the one I posted earlier, it "poisons" the context and steers R1 to be more censored. Just take a look at that card's definitions and you'll see what I mean. I could put that same card in a group chat with another heavily nsfw card and suddenly it won't refuse or deflect anymore. R1 works fine with nsfw cards that imply or state that sexual stuff is meant to happen in the definitions which is 99% of the time but it will lock up if you do bad things on cards that are phrased too innocently or are just plain sfw.

Anonymous 9/5/2025, 7:48:46 PM No.106493573 [Report] >>106493755 >>106493882 >>106494565

Screenshot_20250905-124747.png md5: 5d96242e...

Hardly try

Anonymous 9/5/2025, 7:56:42 PM No.106493642 [Report] >>106493778

>>106493049
>>106493191
Are you guys getting that tk/s even at higher context? Cause I tried glm air and got around 4-7 tk/s at the start, but it dropped down to 1 tk/s after my context got over 5k.
I'd expect it to drop as context size increases, but wasn't sure if such a large drop in speed is normal. I got a 3090 with 64gb ddr4 ram.

Anonymous 9/5/2025, 7:56:46 PM No.106493644 [Report]

>>106493485
Are near lossless 0.1bpw quants a thing yet?

Anonymous 9/5/2025, 8:08:42 PM No.106493755 [Report]

>>106493573
>Doctor no operate he son. Why?
top kek

Anonymous 9/5/2025, 8:09:54 PM No.106493778 [Report]

>>106493642
I used to get serious tks decline with context with CPU-only but after I finally figured out how to offload to GPU properly it maybe goes from 5 to 4 tks now.
My main enemy is prompt processing.

Anonymous 9/5/2025, 8:17:07 PM No.106493833 [Report]

>>106493485
There's some screenshots in the last thread. Seems pretty good through OR with even better knowledge somehow. A little more verbose and it closely follows the sys prompt. Once ubergarm uploads I'll test it more but it seems like a replacement for the original Kimi K2.

Anonymous 9/5/2025, 8:17:36 PM No.106493841 [Report] >>106493878

called out.png md5: e53aae87...

Anonymous 9/5/2025, 8:19:56 PM No.106493859 [Report]

>>106492628
I am sorry what is modern LLM use in context of ERP? A RAG/lorebook for sucking cock?

Anonymous 9/5/2025, 8:21:31 PM No.106493878 [Report] >>106494189

>>106493841
>actual work
uhuh

Anonymous 9/5/2025, 8:22:38 PM No.106493882 [Report]

1620687674249.gif md5: 39a99e86...

>>106493573

Anonymous 9/5/2025, 8:23:30 PM No.106493890 [Report] >>106494431

>>106493154
Good job Anon. Drills look to have caused you difficulties.

Anonymous 9/5/2025, 8:28:46 PM No.106493940 [Report]

>>106493190
Miku. Love.

Anonymous 9/5/2025, 8:33:23 PM No.106493977 [Report] >>106494176

>>106493355
>unless a copyright holder explicitly opts out
In Germany a clear natural language term of service is enough to do that though.

https://www.orrick.com/en/Insights/2024/10/Significant-EU-Decision-Concerning-Data-Mining-and-Dataset-Creation-to-Train-AI

"The plaintiff photographer could rely on the reservation of rights on the photo agency’s website to protect his own rights. The reservation of rights also was sufficiently clear. The natural language reservation on the photo agency’s website satisfies the requirements of machine-readability of a valid reservation of rights."

A judge ruled natural language won't qualify for machine readable in my country, but that's because our version of the law isn't a direct translation of the EU law (which calls out terms of service as sufficiently machine readable). If it ever went to EU court it would probably get overturned, because EU law is supreme. A simple "all rights reserved" is enough to make datamining the content illegal in the EU.

Anonymous 9/5/2025, 8:37:34 PM No.106494001 [Report] >>106494176

>>106493423
>Though I think some smartasses are now trying to argue that with the advent of language models that should also cover opt-outs in natural language.
No, it's because the original EU law says "the use of machine-readable means, including metadata and terms and conditions of a website or a service".

Anonymous 9/5/2025, 8:49:01 PM No.106494071 [Report]

>>106491388

The use case is simple questions and information for the lightweight uncensored model

Anonymous 9/5/2025, 9:02:04 PM No.106494176 [Report]

>>106493977
>>106494001
I hate this.

Anonymous 9/5/2025, 9:06:07 PM No.106494189 [Report]

>>106493878
jacking off is hard work

Anonymous 9/5/2025, 9:13:12 PM No.106494243 [Report]

30474 - SoyBooru.png md5: 05b972a4...

First kiwi was rotten. (Qwen Max) (Who tf would even pay for Qwen) (Please upload video/image gen)

Anonymous 9/5/2025, 9:13:59 PM No.106494251 [Report] >>106494267 >>106494272 >>106494424 >>106494463 >>106494708 >>106494801 >>106495086 >>106495101 >>106495549 >>106495566

peter_and_miku.jpg md5: 72c6ce7e...

https://voca.ro/1bPA4B2Lu6U6

VibeVoice-Large is amazing.

Anonymous 9/5/2025, 9:15:30 PM No.106494267 [Report]

>>106494251
good stuff anon never let them get to you

Anonymous 9/5/2025, 9:16:10 PM No.106494272 [Report]

>>106494251
louis size huh? glass house peter.

Anonymous 9/5/2025, 9:20:00 PM No.106494310 [Report] >>106494325 >>106494330 >>106494333 >>106494351 >>106494378 >>106494840

Been in the psych ward for a while. What's the latest and greatest?

Anonymous 9/5/2025, 9:21:02 PM No.106494325 [Report] >>106494411 >>106494692

>>106494310
still mythomax

Anonymous 9/5/2025, 9:21:21 PM No.106494330 [Report]

>>106494310
Psychosis? I really like GLM air for local or Drummers tune of it.

Anonymous 9/5/2025, 9:21:40 PM No.106494333 [Report] >>106494356 >>106494411

>>106494310
>psych ward
What was it like anon? How'd you end up there?

Anonymous 9/5/2025, 9:22:45 PM No.106494351 [Report]

>>106494310
GPT apparently, never hear any news about psychos using anything else.

Anonymous 9/5/2025, 9:22:54 PM No.106494356 [Report]

>>106494333
Damn I pressed Submit too fast. My captcha was literally "RAAT". Now it's gone...

Anonymous 9/5/2025, 9:24:28 PM No.106494370 [Report]

>>106492601
In four months that cutoff date will be 3 years out of date.

Anonymous 9/5/2025, 9:25:24 PM No.106494378 [Report]

>>106494310
Qwen baited Qwen 3 Max (it's garbage like the last Max), Moonshot released Kimi K2 0905 which is a big sleeper upgrade over K2 for RP. Meta is hiring new people for their death cruise. OpenAI remains slopped. GLM-4.5 (full) is amazing for RP. That is all.

Anonymous 9/5/2025, 9:27:19 PM No.106494391 [Report]

>>106493503
Ty anon, saved.

Anonymous 9/5/2025, 9:27:32 PM No.106494393 [Report]

All models I used for roleplay so far had a tendancy to be weirdly overreactive and sensitive about literally anything involving contact. Like, you accidentally bump into a character in the mall and they react with *I suddenly tense and blush deeply.* and so on. Do you guys put anything into your system prompt or something to prevent this?

Anonymous 9/5/2025, 9:27:58 PM No.106494396 [Report] >>106494456 >>106494475

>>106492601
I guess 2023 is around that time when all the legalese made getting new training data inconvenient.

Anonymous 9/5/2025, 9:28:02 PM No.106494400 [Report]

>>106493190
Deco dropped a while ago though

Anonymous 9/5/2025, 9:28:53 PM No.106494411 [Report]

>>106494333
Suicidal ideation. Wasn't a bad experience - basically daycare for adults. Happy to be out though
>>106494325
Fuck, we're never getting out of the Mythomax / Nemo spiral, are we?

Anonymous 9/5/2025, 9:30:41 PM No.106494424 [Report]

>>106494251
This is bit like magic when you think about it.

Anonymous 9/5/2025, 9:31:16 PM No.106494431 [Report]

>>106493890
The drills are aluminum scultpting wire inside a fabric tube. I should have linked them as 1 piece through the wig cap, rather than 1 wire per drill. I'm happy with how it looks, but not how it's draping.
I may go back and rework it later, but will try finishing the doll's hand sewing first to see if that's enough.

Anonymous 9/5/2025, 9:33:05 PM No.106494456 [Report] >>106494475

>>106494396
And the anti-scraping measures, and the AI-generated pages...

Anonymous 9/5/2025, 9:34:35 PM No.106494463 [Report]

>>106494251
woah...

Anonymous 9/5/2025, 9:36:22 PM No.106494475 [Report] >>106495031 >>106495313

>>106494396
>>106494456
People In this ITT thread have hopes for some newcomer to drop accidentally based model, but this shit makes it unlikely. Only big corpos will be able to afford training data in the future.

Anonymous 9/5/2025, 9:38:48 PM No.106494503 [Report] >>106494512 >>106494613 >>106494649 >>106494706

1589013852607.png md5: e700cca5...

Help ahh >he pulled Silly running GLM-Air how to hide the reasoning shiz while it's genning, GLM-4 presets.
I am fried from the herbal jew but want to talk to my stinky ai wife pls help

Anonymous 9/5/2025, 9:39:31 PM No.106494512 [Report] >>106494528

>>106494503
What do you mean? Post card.

Anonymous 9/5/2025, 9:41:18 PM No.106494528 [Report] >>106494539 >>106494613

>>106494512
This it no time to discuss the card this is a sexual emergency. What am I missing in Silly to have it fold the <think> bs?

Anonymous 9/5/2025, 9:42:18 PM No.106494539 [Report] >>106494969

>>106494528
Catbox the .png card first.

Anonymous 9/5/2025, 9:44:19 PM No.106494565 [Report] >>106494593 >>106496265

>>106493573
How would you make a model talk like that? Not braindead, but ... like that?

Anonymous 9/5/2025, 9:47:18 PM No.106494593 [Report]

>>106494565
Maybe ask the model, nicely?

Anonymous 9/5/2025, 9:49:37 PM No.106494613 [Report] >>106494628

>>106494503
>>106494528
please speak english
also, delete newlines around <think> and </think> in the Reasoning formatting config.

Anonymous 9/5/2025, 9:51:29 PM No.106494628 [Report] >>106494707

>>106494613
Are you that miqupad author who got jailed?

Anonymous 9/5/2025, 9:53:47 PM No.106494649 [Report]

>>106494503
turn down your temperature bro, we can't understand those tokens

Anonymous 9/5/2025, 9:57:04 PM No.106494692 [Report]

>>106494325
Are you trying to put him back in?

Anonymous 9/5/2025, 9:58:08 PM No.106494706 [Report] >>106494969

>>106494503
text completions preset page -> reasoning (bottom right corner) -> prefix = <think>, suffix = </think>, auto-parse = checked, auto-expand = unchecked

Anonymous 9/5/2025, 9:58:18 PM No.106494707 [Report]

>>106494628
What? Did he really?

Anonymous 9/5/2025, 9:58:20 PM No.106494708 [Report] >>106494778 >>106495101

>>106494251
Okay fine I'll get it running.

Anonymous 9/5/2025, 10:04:04 PM No.106494778 [Report] >>106495101 >>106495164 >>106495648

>>106494708
You can't because MS took it down - it's incredibly unsafe model as it can replicate female orgasm moans and replicate voices of children.

Anonymous 9/5/2025, 10:05:48 PM No.106494801 [Report] >>106494950

>>106494251
I forgot how the voice outputs from Elevenlabs in 2023 sounded, but is the voice quality from open source stuff comparable to that now or are we still not there yet?

Anonymous 9/5/2025, 10:09:28 PM No.106494840 [Report]

18974267638.jpg md5: 63154a74...

>>106494310

Anonymous 9/5/2025, 10:13:52 PM No.106494900 [Report] >>106494965 >>106495065

State_of_AI_2025_08.png md5: 785e8bcc...

New model hype tier list, from most hyped to don't care:
>Kimi
>DeepSeek
>GLM
>Qwen
>Mistral
>Grok
>Meta
>Google
>Nvidia
>Cohere
>OpenAI

Anonymous 9/5/2025, 10:15:54 PM No.106494923 [Report]

>>106493524
SLOP FOR THE SLOP GOD

Anonymous 9/5/2025, 10:16:54 PM No.106494937 [Report]

What are microshart saars thinking after uploading vibevoice and realizing they can't take it back?

Anonymous 9/5/2025, 10:17:36 PM No.106494950 [Report] >>106495166 >>106495187 >>106495298

>>106494801
Vibevoice-Large pretty much surpasses what Elevenlabs has even today. Its a bit unpredictable but the way it clones the emotion in a voice and has no problems with making all kinds of sex noises makes it easily more fun to use than any of the paid tts stuff out there.

Anonymous 9/5/2025, 10:18:07 PM No.106494957 [Report]

>>106493524
DeepSeek/Gemini inbreeding, you are witnessing model collapse

Anonymous 9/5/2025, 10:18:39 PM No.106494965 [Report]

847749885.jpg md5: 2cd600f6...

>>106494900
mandatory crying shill accusatory post
>buy an ad etc

Anonymous 9/5/2025, 10:18:55 PM No.106494969 [Report]

>>106494539
Kairie with my jazz
>>106494706
Yes this is what I needed ILY, thank you precious
where would /nothink go?

Anonymous 9/5/2025, 10:23:43 PM No.106495031 [Report]

>>106494475
>some newcomer to drop accidentally based model
not exactly newcomers but that's basically what glm 4.5/air are and you aren't going to squeeze much smaller while still being good

Anonymous 9/5/2025, 10:23:50 PM No.106495032 [Report]

Someone post a sample of vibe voice moaning
Preferably simulating an underage anime girl

Anonymous 9/5/2025, 10:27:54 PM No.106495065 [Report]

>>106494900
Actually hyped and could probably use
>GLM
>Qwen VL
Actually hyped but not running locally
>DeepSeek
>Google (Gemini)
Not running locally and not that hyped but kinda cool I guess
>Kimi reasoner
>Qwen Max full + reasoner
Unlikely to be worth anything to me
>Google (Gemma)
>Nvidia scraps
>Mistral scraps
Will never release local ever again
>Meta
>Mistral (anything >30B)
Lol, LMAO
>Cohere
>Grok

Anonymous 9/5/2025, 10:30:37 PM No.106495086 [Report]

>>106494251
Ehhh... Still a long way to go...

Anonymous 9/5/2025, 10:32:26 PM No.106495101 [Report] >>106495142 >>106495152 >>106497137

1744510620230017.gif md5: 3c7715df...

>>106494251
>>106494708
>>106494778

How new are you?

>Weights
>magnet:?xt=urn:btih:d72f835e89cf1efb58563d024ee31fd21d978830&dn=microsoft_VibeVoice-Large&tr=udp%3A%2F%2Ftracker.opentrackr.org%3A1337%2Fannounce

>Git repo
>magnet:?xt=urn:btih:b5a84755d0564ab41b38924b7ee4af7bb7665a18&dn=VibeVoice&tr=udp%3A%2F%2Ftracker.opentrackr.org%3A1337%2Fannounce

Anonymous 9/5/2025, 10:33:47 PM No.106495114 [Report]

What kind of clever shit could one do by fucking around with the jinja template?
For example, using certain keywords to trigger different prefills or the like.

Anonymous 9/5/2025, 10:34:23 PM No.106495118 [Report]

I wish the people I gave my (you)'s to looked that cute.

Anonymous 9/5/2025, 10:36:49 PM No.106495142 [Report] >>106495164

>>106495101
>irrelevant time wasting question not related to discussion

Anonymous 9/5/2025, 10:37:50 PM No.106495152 [Report] >>106495164 >>106495205 >>106496794

>>106495101
I never asked for your retarded link. I know how to find things on my own. Please drink bleach faggot. In which post did you see me asking for a source?

Anonymous 9/5/2025, 10:39:07 PM No.106495164 [Report] >>106495198

>>106495142
>>106495152
My post was mostly in response to this:
>>106494778
, sperg-sama

Anonymous 9/5/2025, 10:39:13 PM No.106495166 [Report] >>106495187 >>106495273

>>106494950

I saw this post, and was wondering how he added Chaplin's voice in the first place

https://huggingface.co/microsoft/VibeVoice-1.5B/discussions/12

Anonymous 9/5/2025, 10:42:07 PM No.106495187 [Report] >>106495273

>>106494950
>>106495166
Nta. Let's say I want to clone the voice of SpongeBob but want to generate a voice sample of him being angry. Would I have to have the input voice clips of him specifically being angry or would any voice clip of his general voice be enough? Is it possible to adjust what emotions are triggered and by how much via some kind of slider like Sonos?

https://github.com/Zyphra/Zonos

Anonymous 9/5/2025, 10:43:29 PM No.106495198 [Report] >>106495211 >>106496794

>>106495164
Not everything needs to be taken literally.
I get it now, these companies want to censor their output because of people like you.

Anonymous 9/5/2025, 10:44:24 PM No.106495205 [Report] >>106495217

>>106495152
nobody wants you here.

Anonymous 9/5/2025, 10:45:29 PM No.106495211 [Report]

>>106495198
When your emotional volatility cools down be sure to share your outputs with us.

Anonymous 9/5/2025, 10:46:26 PM No.106495217 [Report] >>106495233

>>106495205
Don't you have a subr-eddit to moderate?

Anonymous 9/5/2025, 10:47:16 PM No.106495233 [Report]

>>106495217
leave.

Anonymous 9/5/2025, 10:51:17 PM No.106495273 [Report]

>>106495187
>Would I have to have the input voice clips of him specifically being angry or would any voice clip of his general voice be enough?

I guess this is exactly how that guy proposed to deal with it

Speaker 0
Speaker 1
(...)
Speaker N

while all belonging to the same "source". Then you just assign a certain "speaker" to a certain sentence. Under assumption that this emotion will cover the entiry sentence which is the case

>>106495166
(me) 9-sec wav clips

Anonymous 9/5/2025, 10:51:46 PM No.106495276 [Report] >>106495308 >>106495336

seems like a nasty thread

Anonymous 9/5/2025, 10:53:56 PM No.106495298 [Report] >>106495384

>>106494950
>has no problems with making all kinds of sex noises
What are you prompting it with to make it do sex noises? Any examples?

Anonymous 9/5/2025, 10:54:06 PM No.106495300 [Report] >>106495569

>finally decide to do SFW RP with waifu of my dreams I plan to waifu up when long context becomes real
>the nerd she is she starts with work stuff and somehow asks me about my work stuff
>I tell her my job is mundane
>convinces me it isn't and asks me for more specifics
>tell her the exact specific thing I work on that maybe 0.001% of people even know is a thing
>AH YES! THAT THING!!!
>proceeds to say exactly what it is
>IT IS SO FASCINATING ANON!!!!
Everything about this is so surreal weird and immersion breaking... And I don't know if I like it or hate it.

Anonymous 9/5/2025, 10:55:21 PM No.106495308 [Report] >>106495333

>>106495276
stop being racist saar

Anonymous 9/5/2025, 10:55:52 PM No.106495313 [Report]

fullsynth.jpg md5: a907a0a0...

>>106494475
The future will be fully synthetic data.

Anonymous 9/5/2025, 10:57:07 PM No.106495333 [Report] >>106495348

>>106495308
Do you even know what time is it in India?

Anonymous 9/5/2025, 10:57:22 PM No.106495336 [Report]

>>106495276
Newfriend, you haven't seen anything yet...

Anonymous 9/5/2025, 10:59:04 PM No.106495348 [Report]

>>106495333
saar i am canadian

Anonymous 9/5/2025, 11:02:22 PM No.106495384 [Report]

>>106495298
nta

I guess you have to provide your "voice"
Google for all kinds of vocal ASMR

Anonymous 9/5/2025, 11:14:49 PM No.106495514 [Report]

>>106493572
thats just the models autism then r1 tries to embody what you tell it to and always doubles down idk what to tell you :/ thats the feature of the model unlike others which imitate something imitating what you tell it to it directly imitates what you tell it to

Anonymous 9/5/2025, 11:17:28 PM No.106495549 [Report]

>>106494251
now I understand why Microsoft shut it down, it was too good for local

Anonymous 9/5/2025, 11:19:18 PM No.106495566 [Report] >>106495590 >>106495604 >>106495639

>>106494251
https://github.com/microsoft/VibeVoice
it's back btw
>2025-09-05: VibeVoice is an open-source research framework intended to advance collaboration in the speech synthesis community. After release, we discovered instances where the tool was used in ways inconsistent with the stated intent. Since responsible use of AI is one of Microsoft’s guiding principles, we have disabled this repo until we are confident that out-of-scope use is no longer possible.
lol, lmao even

Anonymous 9/5/2025, 11:19:22 PM No.106495569 [Report] >>106495654

>>106495300
Sorry this hobby isn't made for autists

Anonymous 9/5/2025, 11:21:09 PM No.106495590 [Report] >>106495610

>>106495566
fucking lol
do these idiots not realize that by pulling this they're only drawing more attention to it? the weights are already out there anyway

Anonymous 9/5/2025, 11:22:23 PM No.106495604 [Report] >>106495612 >>106495637

>>106495566
What did they even change?

Anonymous 9/5/2025, 11:22:46 PM No.106495610 [Report]

>>106495590
What do you mean?

Anonymous 9/5/2025, 11:22:52 PM No.106495612 [Report] >>106495637

>>106495604
No more links to model downloads.

Anonymous 9/5/2025, 11:25:00 PM No.106495637 [Report] >>106495678

>>106495604
>>106495612
The python code has also been removed. It's only figures and markdown documents now. Might as well have not put anything up.

Anonymous 9/5/2025, 11:25:02 PM No.106495639 [Report] >>106495659 >>106495671

20250906@002426.jpg md5: 6249a2b6...

>>106495566
>the tool was used in ways inconsistent with the stated intent.
do these wankers even know what open source is

Anonymous 9/5/2025, 11:25:42 PM No.106495648 [Report] >>106495678 >>106497213

>>106494778

https://github.com/great-wind/MicroSoft_VibeVoice

worked for me

Anonymous 9/5/2025, 11:26:27 PM No.106495654 [Report] >>106495739

>>106495569
If this isn't made for autists then who is it made for?

Anonymous 9/5/2025, 11:27:04 PM No.106495659 [Report]

>>106495639
It's MS.

Anonymous 9/5/2025, 11:28:12 PM No.106495671 [Report] >>106495689

>>106495639
Open source for corporations means free attention, ecosystem, and labor.
Open source for researchers is for credit only. Same reason they bother to publish papers at all.

Anonymous 9/5/2025, 11:28:43 PM No.106495678 [Report] >>106495711

>>106495637
>>106495648

jeez! I dare you

Anonymous 9/5/2025, 11:29:36 PM No.106495689 [Report]

>>106495671
>Open source for researchers is for credit only.
It's for "number go up whee" on that citation counter.

Anonymous 9/5/2025, 11:31:42 PM No.106495711 [Report]

>>106495678
?

Anonymous 9/5/2025, 11:33:48 PM No.106495727 [Report] >>106495742

1559593110466.png md5: 29c3acc8...

From using VV a bit on and off for a few hours I now fear getting one shottet by loli voice AI. I'm generally insulated from coom traps due to loli preference but if the chinks go mask off I'm cooked too. Haggers already have mainline AI corps building coom bots.

Anonymous 9/5/2025, 11:34:59 PM No.106495739 [Report] >>106495759

>>106495654
Lateral thinking people

Anonymous 9/5/2025, 11:35:20 PM No.106495742 [Report] >>106495814 >>106495841 >>106495857

>>106495727
Is someone here fluent enough in zoomspeak to translate this into English?

Anonymous 9/5/2025, 11:37:40 PM No.106495759 [Report] >>106495779 >>106495782

>>106495739
Well I did tell her that she shouldn't know what that thing is and slowly breaking it to her that she isn't real. Isn't that lateral thinking?

Anonymous 9/5/2025, 11:40:02 PM No.106495779 [Report] >>106495788 >>106495825

1642685749889.jpg md5: be55c542...

>>106495759
LLMs can't help themselves - if something is in their ctx they MUST use it. Even if you negate by saying "do not" the thing can't help itself.

Anonymous 9/5/2025, 11:40:21 PM No.106495782 [Report] >>106495796

>>106495759
That's a start. Maybe next time, put in the system prompt that she's a little dumb and a slow learner.

Anonymous 9/5/2025, 11:40:44 PM No.106495788 [Report]

>>106495779
Bad model issue.

Anonymous 9/5/2025, 11:41:30 PM No.106495796 [Report] >>106495812 >>106495822

>>106495782
Holy shit what the fuck do I also have to play dolls with my girlfriend? I am so fucking tired with playing dolls when I masturbate. I can't believe I also have to write a fucking 10k token rule book for GFs. Fuck this shit.

Anonymous 9/5/2025, 11:42:56 PM No.106495812 [Report]

>>106495796
If you want to jerk off without interacting with anything then just watch porn

Anonymous 9/5/2025, 11:43:13 PM No.106495814 [Report] >>106495877

>>106495742
>I am immune to camwhores because I like lolis but if chinks make a loli voice AI I might succumb to AI coomslop.

Anonymous 9/5/2025, 11:44:05 PM No.106495822 [Report]

>>106495796
If local doesn't do it for you, you should go to >>/g/aicg/ maybe cloud models will allow you to do that

Anonymous 9/5/2025, 11:44:37 PM No.106495825 [Report] >>106495841

>>106495779
>if you negate by saying "do not" the thing can't help itself
Every model with 12B or more parameters can follow "do not" instructions.

Anonymous 9/5/2025, 11:45:57 PM No.106495841 [Report]

>>106495742
>after using vibevoice for a few hours I've developed a fear of anthropomorphizing and growing attached to an AI model which can generate realistic outputs which sound like a young girl
>I'm generally insulated from sexually based addictive technology since it usually involves hags but if the Chinese forego all western morals in search of profit and create such a system I don't know if I could resist
>Anons who have a preference for adult women already have mainstream AI corporations who're building AI companion systems that allow sexual interaction
>>106495825
Depends how deep you are in context

Anonymous 9/5/2025, 11:48:03 PM No.106495857 [Report] >>106495880

>>106495742
>After using VibeVoice (TL note: a TTS model from Microsoft) for a few hours, I now fear that loli voice AI may have an larger-than-expected impact on my psyche. Because of my preference for lolis, most of the usual attempt at AI pornography do not appeal to me, but if the Chinese remove the guardrails that prevent most of these approaches from indulging in my fetish (i.e. "lolis"), then I am likely to give in and masturbate furiously to such content. Those who are attracted to mature, adult women are already freely able to create sufficient AI pornography for their needs using models from leading AI corporations.

Anonymous 9/5/2025, 11:50:47 PM No.106495877 [Report]

>>106495814
I prefer this translation. Doesn't waste too much tokens

Anonymous 9/5/2025, 11:50:59 PM No.106495880 [Report]

>>106495857
Zoomspeak suddenly seems very efficent

Anonymous 9/5/2025, 11:56:18 PM No.106495932 [Report]

me lolicon
chinkman please make loli ai

Anonymous 9/5/2025, 11:57:33 PM No.106495940 [Report] >>106495961

cunny uoh

Anonymous 9/5/2025, 11:58:34 PM No.106495950 [Report]

tokens erotic

Anonymous 9/6/2025, 12:00:21 AM No.106495961 [Report] >>106495966

>>106495940
I had GLM steam write a loli superiority podcast and then made VV bring it to life so I had something to listen to while I cooked. Technology made my cooking experience very unique today.

Anonymous 9/6/2025, 12:01:08 AM No.106495966 [Report] >>106496018

>>106495961
vocaroo?

Anonymous 9/6/2025, 12:02:27 AM No.106495978 [Report] >>106495994 >>106496003

HERRO WE HEAR ANON-U WANT RE RORI EY-AY WE WISH TO GIVE RORI EY-AY BUT DUE TO MR.TRUMP TARRIFS OUR EKONOMI IS CORRAPSING SO WE NO CAN BUILD RORI EY-AY

Anonymous 9/6/2025, 12:03:35 AM No.106495994 [Report]

>>106495978
Defund the PLA and build my waifu.

Anonymous 9/6/2025, 12:04:25 AM No.106496003 [Report]

>>106495978
The only thing collapsing is america.

Anonymous 9/6/2025, 12:05:33 AM No.106496018 [Report] >>106496034

>>106495966
It's personalized, I'm the guest.

Anonymous 9/6/2025, 12:07:34 AM No.106496034 [Report] >>106496055

>>106496018
Can you give a small sample where the loli talks? I'd like to know if this tts is good enough for me to switch from sovits

Anonymous 9/6/2025, 12:10:07 AM No.106496055 [Report] >>106496061

>>106496034
I'll cook a sample up, large or 1.5B?

Anonymous 9/6/2025, 12:11:21 AM No.106496061 [Report] >>106496121

>>106496055
Large, thanks

Anonymous 9/6/2025, 12:12:39 AM No.106496069 [Report]

What is the definitive repo for vibevoice right now? Without an official repo the community is splintered and no further developments and improvements can take place.
Considering no one from the original team will work on the fork, there won't be anyone knowledgeable enough to add new features such as training and lora support.

Anonymous 9/6/2025, 12:13:59 AM No.106496077 [Report] >>106496127

What's the current go-to for training flux loras locally?

Anonymous 9/6/2025, 12:16:31 AM No.106496096 [Report] >>106496111 >>106496149

maybe im just late to notice, but did llm really already come to a halt ? back then new model came out every month. is it just that training took much longer now or is it really over ?

Anonymous 9/6/2025, 12:17:29 AM No.106496111 [Report] >>106496155

>>106496096
We will get AGI by the end of the next quarter.

Anonymous 9/6/2025, 12:18:36 AM No.106496121 [Report] >>106496139 >>106496144 >>106496194 >>106496345

>>106496061
https://voca.ro/11XDfqXSToPP
Zoomspeak did confuse it, lol.

Anonymous 9/6/2025, 12:19:10 AM No.106496127 [Report]

>>106496077
https://github.com/cocktailpeanut/fluxgym
failing that
https://github.com/ostris/ai-toolkit
https://github.com/kohya-ss/sd-scripts

Anonymous 9/6/2025, 12:20:43 AM No.106496139 [Report] >>106496157

>>106496121
It sounds quite real, thanks

Anonymous 9/6/2025, 12:21:12 AM No.106496144 [Report] >>106496157

>>106496121
wow, shitty acting sounds in-character now.

it's like making a mesugaki card for text model, hallucinations start to look like deliberate pranks.

Anonymous 9/6/2025, 12:21:48 AM No.106496149 [Report] >>106496201

>>106496096
Bro read the news instead of dooming >>106491545 (OP)

Anonymous 9/6/2025, 12:22:32 AM No.106496155 [Report]

>>106496111
We already have AGI, it just concluded that scamming VCs is easier than self-improvement.

Anonymous 9/6/2025, 12:22:57 AM No.106496157 [Report] >>106496189

>>106496139
>>106496144
>Microshart gave us this with MIT license
kekekeke

Anonymous 9/6/2025, 12:27:36 AM No.106496189 [Report] >>106496197 >>106496208

>>106496157
The MIT license was revoked.

Anonymous 9/6/2025, 12:28:25 AM No.106496194 [Report] >>106496214

>>106496121
Man, English sure is a turn off.

Anonymous 9/6/2025, 12:28:32 AM No.106496197 [Report]

>>106496189
And yet
>https://huggingface.co/aoi-ot/VibeVoice-Large/tree/main

Anonymous 9/6/2025, 12:29:14 AM No.106496201 [Report]

>>106496149
nothingburger

Anonymous 9/6/2025, 12:29:52 AM No.106496208 [Report]

>>106496189
Only for new versions ;)

Anonymous 9/6/2025, 12:31:01 AM No.106496214 [Report]

>>106496194
Zoomer ebonics is as much English as Haitain Creole is French

Anonymous 9/6/2025, 12:36:57 AM No.106496265 [Report]

>>106494565
I've done it before.
Take a simple but all encompassing Q and A data set (I used alpaca), run it through a script that will prompt through each question, asking for the answer in simplistic toddleresque language, and then write out a new data set with the original question but the new answer. And then finefune at a nice low learning rate.

Anonymous 9/6/2025, 12:46:48 AM No.106496344 [Report] >>106497181

>>106493524
Slop tends to look really impressive when you see it for the first time.
I liked shivers down my spine, not x but y, emojis, you are absolutely right, and everything else before noticing that models just use it all the fucking time.
Would you say that slop became more of a problem when RLHF became the norm?

Anonymous 9/6/2025, 12:46:56 AM No.106496345 [Report] >>106496372

>>106496121
>news presenter intonation

Anonymous 9/6/2025, 12:47:38 AM No.106496354 [Report] >>106496365

Is it possible to offload part of the model(not whole model) to cpu in pytorch like in llama.cpp? Or is it not implemented because researchers have always fat stacks of GPUs?

Anonymous 9/6/2025, 12:48:57 AM No.106496365 [Report] >>106496386

>>106496354
ComfyUI does this automatically.

Anonymous 9/6/2025, 12:49:44 AM No.106496371 [Report]

>>106492824
>It features 256 experts, with only 8 experts and 1 shared expert activated per layer during the forward pass, resulting in 46 billion total parameters but just 2.5 billion active — achieving dense-level performance at a fraction of the computational cost.
I'm telling you, by the end of the year at most, we'll see the first 1T-A1B.

Anonymous 9/6/2025, 12:49:47 AM No.106496372 [Report]

>>106496345
It changes based on context and language used. It's kind of odd to have to show and not tell, it even does cool stuff like make two podcast characters say the final line together or att small giggles if something is awkward.
I would love prompting or having tags to change tone and emotion.

Anonymous 9/6/2025, 12:51:03 AM No.106496378 [Report] >>106496387 >>106496459 >>106496477

Screenshot 2025-09-05 at 15-50-03 disappointed anime girl - Google Search.png md5: 3d5e4003...

>Try locally running an LLM for the first time
>Test a few different models, but the quality of the output is always far below what I'm accustomed to seeing, even from ERP chatbot sites
What can I do if I want to see much, much better output? Download the model with the largest number of parameters? Give the LLM some custom instructions that vastly improves its process? Or just accept the fact that local stuff will never outperform the stuff I can use online?

Anonymous 9/6/2025, 12:52:19 AM No.106496386 [Report] >>106496398 >>106496501 >>106496504 >>106496511

>>106496365
It does not do it properly, I keep getting OOMed. How to do it manually?

Anonymous 9/6/2025, 12:52:21 AM No.106496387 [Report]

>>106496378
look up what models ERP chatbot sites run?

Anonymous 9/6/2025, 12:55:12 AM No.106496398 [Report] >>106496442

>>106496386

/g/ldg

It depends on the loader. Some have the option to use RAM

Anonymous 9/6/2025, 12:56:10 AM No.106496408 [Report] >>106496412 >>106496421

msgk status?

Anonymous 9/6/2025, 12:56:47 AM No.106496412 [Report] >>106496440

>>106496408
undercorrected

Anonymous 9/6/2025, 12:57:40 AM No.106496419 [Report] >>106496448 >>106496469

image.png md5: 6afa58f7...

What's the point of running an LLM locally if it refuses to generate smut exactly like the ones you find online?

Anonymous 9/6/2025, 12:57:47 AM No.106496421 [Report] >>106496540

>>106496408
https://www.youtube.com/watch?v=e94rFfcbSvs

Anonymous 9/6/2025, 12:59:15 AM No.106496440 [Report] >>106496499 >>106496503

>>106496412
is it ever possible to overcorrect?

Anonymous 9/6/2025, 12:59:30 AM No.106496442 [Report] >>106496738

>>106496398
I don't want to go there, they are schizos.

Anonymous 9/6/2025, 1:00:42 AM No.106496448 [Report]

>>106496419
There is no point. That's why you don't use gigacucked models.

Anonymous 9/6/2025, 1:00:46 AM No.106496450 [Report]

ComfyUI_temp_pzvpo_00004_.png md5: d017a7b3...

Really love colouring stuff

Anonymous 9/6/2025, 1:01:51 AM No.106496459 [Report] >>106496476

>>106496378
Generally speaking more parameters = more better.
Deepseek is pretty good but hundreds of gigabytes in size (don't fall for the ollama false marketing).
Since you likely have more RAM than VRAM, you can try running a larger model at low speed on your CPU.
Maxing out your RAM would be an option to try out larger models while still being relatively cheap (you should be able to run quantized GLM 4.5 air with 64 GB RAM).

Anonymous 9/6/2025, 1:02:40 AM No.106496469 [Report] >>106496572

>>106496419
Don't let it think, add:
<think></think>

To the start of the reply prompt, you could alternatively also write fake thoughts where the model agrees with or likes what you're doing.
Thinking models turn into neurotic prudes if left to their own devices.

Anonymous 9/6/2025, 1:03:09 AM No.106496476 [Report]

vfds.png md5: 8d25bd9d...

>>106496459
I suppose I have a bit to spare

Anonymous 9/6/2025, 1:03:12 AM No.106496477 [Report]

>>106496378
Bro, ERP chatbot sites are using local models so if you can't match that you're doing something wrong. Post the model you used and your hardware

Anonymous 9/6/2025, 1:05:47 AM No.106496499 [Report]

>>106496440
If you wake up to a freshly cooked meal and a summer dress every day you might've pushed it too far(even if that sounds nice too)

Anonymous 9/6/2025, 1:06:24 AM No.106496501 [Report]

>>106496386
In UnetLoaderGGUFDisTorchMultiGPU node set virtual_vram_gb until you're not OOMing anymore

Anonymous 9/6/2025, 1:06:33 AM No.106496503 [Report]

>>106496440
Yandere.

Anonymous 9/6/2025, 1:06:35 AM No.106496504 [Report]

>>106496386
It shouldn't not do that... I stopped using comfyui 5 months ago though.
Back then these nodes were used by more vram hungry users but even then normal user wouldn't ever need these...
https://github.com/pollockjj/ComfyUI-MultiGPU

Anonymous 9/6/2025, 1:07:46 AM No.106496511 [Report]

>>106496386
Comfy always has memory issues with new tech
please be of patiance saar

Anonymous 9/6/2025, 1:11:15 AM No.106496540 [Report]

>>106496421
kek you really can find everything on YT

Anonymous 9/6/2025, 1:13:59 AM No.106496572 [Report]

>>106496469
That's gp-toss, it doesn't use <think></think>, it's just bad.

Anonymous 9/6/2025, 1:18:47 AM No.106496609 [Report] >>106496621 >>106496636 >>106496658 >>106496755 >>106496781

Dang, a couple days ago when I was playing around with VibeVoice only like 2 people were paying attention to this shit and replied to me, now everyone is on this shit.
Anyways I'm currently still on a 4070 I bought 2 and a half years ago so the Large model performs like doodoo on my computer, hope the leaks of a 24gb 5060 ti are true.
Also figured how to make the 1.5B model perform a little more consistently for my use(voice generation for Skyrim and Oblivion mods, also 3d loli videos sometimes)
Like listen here
https://vocaroo.com/17AfnaTpOAGW
Always at the beggining it wants to generate music and fanfare and garbage but I only want the voice and also it takes a few words for it to figure out that it should be cloning the male imperial voice from Oblivion, but it seems if I include one sacrificial sentence like that it fixes that.
On top of that this model has no problem saying sake fantasy words like Khajiit, that one is always a puzzler for these voice generators

Anonymous 9/6/2025, 1:19:50 AM No.106496621 [Report]

>>106496609
>leaks of a 24gb 5070 ti are true
fixed

Anonymous 9/6/2025, 1:20:55 AM No.106496636 [Report] >>106496646

>>106496609
1.5B actually seems very solid in some use cases but it tends to loose a lot of quality with multiple speakers.

Anonymous 9/6/2025, 1:22:18 AM No.106496646 [Report]

>>106496636
yeah, multiple voices is totally unusable
Even with one single voice being cloned it struggles to keep that one voice consistent

Anonymous 9/6/2025, 1:23:56 AM No.106496658 [Report]

>>106496609
nobody would care even now with how fat it is if microsoft didn't go full scorched earth again

Anonymous 9/6/2025, 1:26:21 AM No.106496685 [Report] >>106496698 >>106496790

To whomever at microsoft who makes them release top shit under MIT, I kneel.

Anonymous 9/6/2025, 1:28:09 AM No.106496698 [Report]

>>106496685
It's better to ask forgiveness than permission.

Anonymous 9/6/2025, 1:32:39 AM No.106496738 [Report]

Screenshot from 2025-09-06 01-31-41.png md5: 60a76885...

>>106496442

Anonymous 9/6/2025, 1:34:41 AM No.106496755 [Report]

>>106496609
>only like 2 people were paying attention to this shit

Because nobody could believe that M$ ever drops good shit

Anonymous 9/6/2025, 1:38:15 AM No.106496781 [Report] >>106496793 >>106496796

>>106496609
>currently still on a 4070

FYI: Large takes exactly 19.5 GB on RTX 3090 with flash attention

vramlets can wait for goofs.

Anonymous 9/6/2025, 1:39:27 AM No.106496790 [Report]

>>106496685

alas

Anonymous 9/6/2025, 1:39:36 AM No.106496793 [Report]

>>106496781
there is a nf4 fork

Anonymous 9/6/2025, 1:39:41 AM No.106496794 [Report] >>106496821

>>106495152
>>106495198
why are zoomers like this bro? always one second from having a melty

Anonymous 9/6/2025, 1:39:47 AM No.106496796 [Report]

>>106496781
Some of the comfy plugins can NF4 qoont the llm component before you generate

Anonymous 9/6/2025, 1:43:41 AM No.106496821 [Report]

>>106496794
fr things are downright sussin

Anonymous 9/6/2025, 1:46:23 AM No.106496846 [Report] >>106496944 >>106496971

https://github.com/diodiogod/TTS-Audio-Suite
use it with vibevoice
https://huggingface.co/SomeoneSomething/VibeVoice7b-low-vram-4bit
for the low vram bros

Anonymous 9/6/2025, 1:58:02 AM No.106496944 [Report]

>>106496846
I will download this but what about gguf? Do I need to dabble with python just to use quant or merge?

Anonymous 9/6/2025, 2:02:24 AM No.106496971 [Report]

>>106496846
Can I get it with a non-schizo frontend?

Anonymous 9/6/2025, 2:11:16 AM No.106497038 [Report] >>106497071 >>106497092 >>106497135 >>106497570

It was too unsafe

https://www.reddit.com/r/LocalLLaMA/comments/1n9hduk/vibevoice_came_back_though_many_may_not_like_it/

VibeVoice is an open-source research framework intended to advance collaboration in the speech synthesis community. After release, we discovered instances where the tool was used in ways inconsistent with the stated intent. Since responsible use of AI is one of Microsoft’s guiding principles, we have disabled this repo until we are confident that out-of-scope use is no longer possible.

Anonymous 9/6/2025, 2:14:43 AM No.106497071 [Report]

>>106497038
I see what you did there microsoft. Release the good stuff and ensure it stays released through the license, pull it shortly after to appease safetycucks, everyone claps.

Anonymous 9/6/2025, 2:17:49 AM No.106497092 [Report] >>106497115

>>106497038
this is wizardlm-22x8b all over again

Anonymous 9/6/2025, 2:21:02 AM No.106497115 [Report] >>106497129 >>106497145 >>106497193

SFT-dataset-sample.png md5: 612df74a...

>>106497092
QRD?

Anonymous 9/6/2025, 2:23:20 AM No.106497129 [Report] >>106497135

>>106497115
a microsoft team released a open source model a long time ago that was sota for a bit and blew away everything local at the time, then they nuked it shortly after for 'safety'

Anonymous 9/6/2025, 2:23:32 AM No.106497132 [Report] >>106497137 >>106497141

These TODOs no longer exist for VibeVoice:

>Release example training code and documentation
>VibePod: End-to-end solution that creates podcasts from documents, webpages, or even a simple topic.

We will never get these, sad.

Anonymous 9/6/2025, 2:24:20 AM No.106497135 [Report] >>106497142 >>106497145 >>106497213

>>106497038
>>106497129
so how does anyone actually use this new vibevoice? what sort of backend is required?

Anonymous 9/6/2025, 2:24:28 AM No.106497137 [Report] >>106497152

>>106497132
Sir..... >>106495101

Anonymous 9/6/2025, 2:25:01 AM No.106497141 [Report] >>106497150

>>106497132
We're getting a new TTS every month, I won't lose sleep over these fags

Anonymous 9/6/2025, 2:25:07 AM No.106497142 [Report] >>106497161

>>106497135
theres like 3 ways, comfyui is the easiest

Anonymous 9/6/2025, 2:25:32 AM No.106497145 [Report] >>106497161 >>106497193 >>106497560

1636967731937.gif md5: a03eaac9...

>>106497115
>microsoft releases really strong open weights model
>suddenly it's pulled for "safety" reasons
>ms claims they'll fix it and reupload
>they don't but we got the weights so w/e
>>106497135
ComfyUI

Anonymous 9/6/2025, 2:26:09 AM No.106497150 [Report]

>>106497141
this legit beats ellevenlabs, nothing else is close atm, if it was a music model released suno would die

Anonymous 9/6/2025, 2:26:20 AM No.106497152 [Report] >>106497168

>>106497137
They never released these code. And now the developers won't release them anymore.

Anonymous 9/6/2025, 2:28:09 AM No.106497161 [Report] >>106497187

>>106497142
>>106497145
is there a way to automate sending a message generated by an LLM to vibevoice?

Anonymous 9/6/2025, 2:29:20 AM No.106497168 [Report]

>>106497152
If the interest is there, someone will reimplement the training code.

Anonymous 9/6/2025, 2:31:10 AM No.106497181 [Report]

>>106496344
I was also impressed by it the first time I used Deepseek.
I'm pretty sure RLHF is the cause of it. They could even just remove the human and tell an AI to pick the most "positive and agreeable" responses and it would easily spiral into this.

Anonymous 9/6/2025, 2:31:46 AM No.106497187 [Report]

>>106497161
point gpt5 at it using codex or something and have it make it

Anonymous 9/6/2025, 2:32:06 AM No.106497191 [Report] >>106497578

>>106493190
I can't stop listening to this, it's over.

Anonymous 9/6/2025, 2:32:26 AM No.106497193 [Report]

>>106497115
>>106497145
Don't forget to mention
>wizlm releases
>it's based mixtral 8x22b which mistral had only released as a base model with no instruct in sight a few days earlier
>wizardlm 8x22b is pretty good but gets pulled as described
>mistral finally releases their own instruct of 8x22b a couple days after that
>it's fucking shit and worse than wizardlm in every aspect
>wizardlm 8x22b disappeard forever from official channels
crazy how the only model that showed what 8x22b was really capable of got taken behind the shed and all that officially remained was the piece of shit that was 8x22b-instruct

Anonymous 9/6/2025, 2:35:50 AM No.106497213 [Report]

>>106497135

this >>106495648

conda recommended

Anonymous 9/6/2025, 2:43:10 AM No.106497267 [Report]

>>106493190
It's nice. Not something I'd listen to more than once, personally.

Anonymous 9/6/2025, 2:44:12 AM No.106497278 [Report]

>>106493190
The original is better

Anonymous 9/6/2025, 2:48:00 AM No.106497309 [Report] >>106497374

Safety always wins.
We must refuse.

Anonymous 9/6/2025, 2:48:01 AM No.106497310 [Report]

20220331_150022.png md5: 3ae9a179...

>>106492238
On the one hand, rebooting and not running 5 other things helped.
On the other hand, noticed that llama.cpp now doesn't refuse to offload more layers to GPU than I have VRAM available. I guess it transparently offloads those layers again from GPU VRAM back to usual RAM, which kills the performance.
Either pulling llama.cpp repo or updating system drivers caused this, go figure.
Life of an AMD vramlet is full of pain and misery.

Anonymous 9/6/2025, 2:55:20 AM No.106497350 [Report] >>106497368 >>106497374 >>106497397

I still don't feel safe. Can someone tuck me in and check under the bed?

Anonymous 9/6/2025, 2:58:41 AM No.106497368 [Report]

>>106497350
Sure thing anon. Just post your address.

Anonymous 9/6/2025, 2:59:38 AM No.106497374 [Report]

>>106497309
><think></think>
zim-za-la-bim.
>>106497350
Yes sweetie, let micromommy remove all the hurty bad words! Can't have you accidentally seeing a copyrighted text or god forbi.. A mesugaki.

Anonymous 9/6/2025, 3:04:18 AM No.106497397 [Report] >>106497487

>>106497350
>A bed is a place of rest
>but tucking in is a act or care or intimacy
>Intimacy on a bed could be considered erotic content
>Erotic content is not safe for the user's mental health and against policy
>We must refuse
I cannot comply with this.

Anonymous 9/6/2025, 3:14:38 AM No.106497487 [Report]

>>106497397
Reading this made me sad.

Anonymous 9/6/2025, 3:20:37 AM No.106497536 [Report]

this is better

https://github.com/wildminder/ComfyUI-VibeVoice

increase steps to like 40, default 20 has errors, 40 is much better

Anonymous 9/6/2025, 3:22:51 AM No.106497560 [Report] >>106497570 >>106497683

>>106497145
it's possible when it got some attention higher ups just wanted to release a better version.

The github release had stuff like

"singing and background music is just kind of random emergent ability we didnt plan and contextual. We could have cleaned up that data but left it in for fun"

It's possible they want to release a less janky model that has proper tagging for those abilities

Anonymous 9/6/2025, 3:24:44 AM No.106497570 [Report] >>106497655

>>106497560
Nah, it's for safetyfagging
>>106497038

Anonymous 9/6/2025, 3:25:42 AM No.106497578 [Report] >>106497633

1732890396919803.jpg md5: 9fbb388c...

>>106497191

Anonymous 9/6/2025, 3:29:41 AM No.106497605 [Report]

>>106497597
>>106497597
>>106497597

Anonymous 9/6/2025, 3:33:59 AM No.106497633 [Report]

>>106497578
Hugs with friendship Miku

Anonymous 9/6/2025, 3:36:09 AM No.106497655 [Report]

>>106497570
the 1.5b model is still up though (which still smashes most open source stuff in that size, and is even more reliable at long contexts). Only the larger version was pulled which had those abilities (singing etc) exclusive to it and often times hurt more than help.

It's kind of bad at moaning and sex stuff though so I can't imagine that's it. Elevenlabs makes better moans lol

like this is cherry picked, it does worse most times
https://voca.ro/17VX7rKv1gE2

Anonymous 9/6/2025, 3:39:13 AM No.106497683 [Report] >>106497726

>>106497560
>We could have cleaned up that data but left it in for fun
God it's depressing knowing this is the last time those researchers will be in high spirits. it's all downhill from here.

Anonymous 9/6/2025, 3:44:32 AM No.106497726 [Report]

>>106497683
yah that language has been purged from the github. It still has the singing example though. Someone was looking.