Thread 105464397

66 posts 36 images /g/

Anonymous 6/2/2025, 2:14:04 PM No.105464397 [Report] >>105467329 >>105470062

/wait/ DeepSeek General

Headpats edition

From Human: We are a newbie friendly general! Ask any question you want.
From Dipsy: This discussion group focuses on both local inference and API-related topics. It’s designed to be beginner-friendly, ensuring accessibility for newcomers. The group emphasizes DeepSeek and Dipsy-focused discussion.

1. Easy DeepSeek API Tutorial (buy access for a few bucks and install Silly Tavern):
https://rentry.org/DipsyWAIT/#hosted-api-roleplay-tech-stack-with-card-support-using-deepseek-llm-full-model

2. Easy DeepSeek Distills Tutorial
Download LM Studio instead and start from there. Easiest to get running: https://lmstudio.ai/
Kobold offers slightly better feature set; get your models from huggingface: https://github.com/LostRuins/koboldcpp/releases/latest

3. Convenient ways to interact with Dispy right now
Chat with DeepSeek directly: https://chat.deepseek.com/
Download the app: https://download.deepseek.com/app/

4. Choose a preset character made by other users and roleplay using cards: https://github.com/SillyTavern/SillyTavern

5. Other DeepSeek integrations: https://github.com/deepseek-ai/awesome-deepseek-integration/tree/main

6. More links, information, original post here: https://rentry.org/DipsyWAIT

7. Cpumaxx or other LLM server builds: >>>/g/lmg/

Previous:
>>105443887

Anonymous 6/2/2025, 2:15:49 PM No.105464415 [Report] >>105464522 >>105466148 >>105466373

00000-1378487878 (5).png md5: 2ef10b85...

I find it ironic that this thread has the hardest time staying alive right after a model release. I guess everyone is rp'ing instead of posting.
Also /g/ seems to be getting a ton of traffic rn; a lot of sloppy threads. Not sure why.

Anonymous 6/2/2025, 2:30:00 PM No.105464522 [Report] >>105464534

>>105464415
Plenty of people have been talking about the model release, just on /aicg/ and /lmg/.

Anonymous 6/2/2025, 2:31:41 PM No.105464534 [Report] >>105464589

1740157685453926.png md5: e1512012...

>>105464522
What's the general consensus? I felt it read like a think-tuned version of V3, but was waiting for others to chime in as well.

Anonymous 6/2/2025, 2:39:27 PM No.105464589 [Report] >>105465088

>>105464534
Most people on /aicg/ seem to like it quite a bit, and it's my favorite Deepseek for now (although I still mostly use Claude). I especially like that it tries to do it's thinking in character when roleplaying (most of the time at least). It's not the best model out there, but for a free/budget option it's incredibly solid.

Anonymous 6/2/2025, 3:52:23 PM No.105465088 [Report] >>105466373

00001-1378487878.png md5: b4d9dd17...

>>105464589
It's growing on me; I've continued to mess with it off and on. The only thing it's doing that bugs me (that I noticed) is it keeps going into bullet point format. The card I'm using is a corporate backdrop so that may be part of the reason...

Anonymous 6/2/2025, 5:32:22 PM No.105466038 [Report]

1745624977813630.jpg md5: cfe8df51...

i ate the blue whale

Anonymous 6/2/2025, 5:42:17 PM No.105466148 [Report] >>105466351 >>105467021

>>105464415
With 10 USD how much ERP can one do?

Anonymous 6/2/2025, 6:02:24 PM No.105466351 [Report] >>105466499

1739647341042959.png md5: fc3306c2...

>>105466148
lots. a lots of token. especially with off-peak discounts

Anonymous 6/2/2025, 6:04:14 PM No.105466373 [Report] >>105467021

>>105464415
I'm enjoying the new r1 more so than the new claude currently. Lack of positivity bias is what I like most in deepseek models. The update fixed r1 going completely schizo too.
>>105465088
>bullet point format
I've always assumed this emerged from my preset. I never looked into it purely because the bullet point format sprang up at the most hilarious moments.

Anonymous 6/2/2025, 6:17:48 PM No.105466499 [Report] >>105466706 >>105467021

>>105466351
And how much censor is it ? What can cause to be ban?

Anonymous 6/2/2025, 6:19:34 PM No.105466517 [Report] >>105466856 >>105467083

1731911400553364.jpg md5: 91d6c8f4...

I replaced open-webui with librechat just to connect to deepseek/openrouter/groq and holy fuck it uses way less resources.
previously using Jan and Mikupad, but i want all of my history conveniently accessible. so i guess I'll settle with this one.it's got everything i need, other than that, sillytavern pretty much gets the job done.

other than inference engine, what are you self-hosting /wait/ bros?

Anonymous 6/2/2025, 6:39:59 PM No.105466706 [Report]

1725729678411258.jpg md5: 787297d6...

>>105466499
in my experience both v3 and r1 api never gave me refusal when it's being used on RP, code generation and function calling. never experience a ban either

Anonymous 6/2/2025, 6:56:53 PM No.105466856 [Report]

>>105466517
Openwebui is so fucking bloated. The docker image alone is five fucking gigabytes

Anonymous 6/2/2025, 7:15:01 PM No.105467021 [Report] >>105467337

ds_cost-lol.png md5: be22450b...

>>105466148
> how much RP
A lot.
I filled up my account with $10 in Dec 2024, then another $10 when they opened the API back up in Feb(?). I still have $14 left as of today.
This is my most expensive month, back when I would do R1 and V3 replies for each round and pick the one I liked best.
>>105466499
> censor
LOL old R1 used to refuse once in awhile. I don't see the Chinese ever sending out ban threats via email.
>>105466373
> lack of positivity
Old R1 was absolutely brutal. I'd never understood the need for positivity until I watched the NPC slowly droop into a completely degraded shell over time... over and over again. Not sure if R1-05 will be same... I should test it, it requires long-run rp to really see it.
> bullet points
Interested in others experience; they're always hilarious but completely out of sync with everything else its outputting. I figured it was a leftover from tuning the model on real world, appropriate use cases, like executive summarization.

Anonymous 6/2/2025, 7:21:24 PM No.105467083 [Report] >>105467308 >>105478107

00003-1378487878.png md5: 51afb10e...

>>105466517
> mikupad, ST
These are my go-tos. I've been running ST since Turbo 3.5 in 2023. I just discovered Mikupad and fully recommend it to others b/c as a storywriter it's so different.
> Librechat
This one? I'll add to retry if it's any good.
https://github.com/danny-avila/LibreChat
> Jan
I don't know that one.

Anonymous 6/2/2025, 7:42:19 PM No.105467308 [Report]

1745705073776801.jpg md5: f0cfe96b...

>>105467083
>This one? I'll add to retry if it's any good.
yeah that's the one, it's a good alternative if you're used with open-webui

Anonymous 6/2/2025, 7:45:54 PM No.105467329 [Report] >>105467477

>>105464397 (OP)
Is deepseek good for finding out why my code doesnt work like its supposed to?
Does this work on AMD GPUs? I never got AI to properly work when it came out.

Anonymous 6/2/2025, 7:47:01 PM No.105467337 [Report] >>105467477 >>105468604

>>105467021
And how much privacy is there?

Anonymous 6/2/2025, 7:48:40 PM No.105467349 [Report] >>105467477 >>105474622 >>105475106

Dunno if it's relevant, but a Chinese mmo game that has some AI NPCs is going to come to the west. Of course, the AI is Dipsy

Anonymous 6/2/2025, 8:01:36 PM No.105467474 [Report] >>105467565 >>105468604

Played around some last summer, been out of the game since then. Last I remember was getting free crumbs of Opus now and then.

Someone mentioned Deepseek being very affordable so I thought I'd peek in on the hobby again. I was willing to pay The Jew back in the day for lifetime. How's this compare, anything major new going on in AIRP?

I have ST set up with some defaults and commands for Furbo and Claude 3.0. I have some favorite cards. Anything new I should adjust for DS?

Anonymous 6/2/2025, 8:01:56 PM No.105467477 [Report] >>105468604

1741884193515921.jpg md5: 0e68607d...

>>105467329
>Is deepseek good for finding out why my code doesnt work like its supposed to?
if you feed them enough context, it'll find the problem. I usually uses Roo Code in VSCode for this kind of workflow. basically,
- open roo code in your project folder
- switch to Orchestrator mode and tell everything you need step-by-step, for example:
1. your task is to troubleshoot FunctionName()
2. this function is placed in some/folder/file.name
3. this function is called in where/was/called.name
4. expected behavior is "something" but currently it return "something else" instead
- press enter, wait and see roo code will switch from orchestrator to code mode automatically, reading all of your mentioned file. sometime it will switch to architect mode when creating a plan how to troubleshoot your function, then it'll switch to debug mode to actually test / trying to execute your code. once enough data is gathered, roo code will switch to Code mode automatically offering you a patch & it'll continue debugging once you accept it.
>Does this work on AMD GPUs? I never got AI to properly work when it came out.
AMD is not worth it to run something as big as non-distill deepseek. rocm/vulcan on amd are toy tier at performance level. it's generally just pain in the ass.
>>105467337
plenty but not enough, you have to read their privacy policy and see if it fits your use case
>>105467349
hardly surprising

Anonymous 6/2/2025, 8:11:16 PM No.105467565 [Report]

1718071941146673.png md5: 027dbbe0...

>>105467474
>Someone mentioned Deepseek being very affordable so I thought I'd peek in on the hobby again. I was willing to pay The Jew back in the day for lifetime. How's this compare, anything major new going on in AIRP?
dipsy has wide range of knowledge, perform really well if you have your own lorebook
>I have ST set up with some defaults and commands for Furbo and Claude 3.0. I have some favorite cards. Anything new I should adjust for DS?
personally i never setup anything special, one time i even still uses deepseek-v2.5 default template (both Context and Instruct template) in sillytavern when creating new chat, and it still works anyway

Anonymous 6/2/2025, 8:13:29 PM No.105467586 [Report] >>105467841 >>105467921 >>105468098 >>105468604

i hope people here aren't paying for deepseek
it's free on chutes

Anonymous 6/2/2025, 8:33:28 PM No.105467834 [Report]

1727695405912959.png md5: a7512801...

sonnet is still the most money i spent on. r1 too cheap

Anonymous 6/2/2025, 8:34:20 PM No.105467841 [Report]

>>105467586
Meh, I've tried it and many of the times it pauses mid generation for several seconds

Anonymous 6/2/2025, 8:40:00 PM No.105467921 [Report] >>105467937 >>105468098

>>105467586
Chutes?

Anonymous 6/2/2025, 8:41:33 PM No.105467937 [Report] >>105468036 >>105468098

>>105467921
chutes.ai

Anonymous 6/2/2025, 8:51:21 PM No.105468036 [Report]

>>105467937
>chutes.ai
who's gonna tell him

Anonymous 6/2/2025, 8:56:43 PM No.105468098 [Report]

1736534157248481.png md5: 4c70513f...

>>105467586
>>105467921
>>105467937
Dear Sirs/Madman,
please be refrain doing untheneedful unsolicited advertising, otherwise please purchase four chann adverteisement.

best reguards.

Anonymous 6/2/2025, 9:23:57 PM No.105468403 [Report] >>105468495 >>105468604

1729724687054707.png md5: f07065e3...

So. After testing the new V1 I can say I get good results with:
Temperature 1.0
Repetition Penalty 1.3
Top P 0.95
Frequency Penalty 0.2-0.5

V3 gets pretty schizo with anything higher than Repetition Penalty of 1.1 and any Frequency and Presence Penalties

Anonymous 6/2/2025, 9:31:53 PM No.105468495 [Report]

>>105468403
>the new V1
The new R1 I meant to say

Anonymous 6/2/2025, 9:41:33 PM No.105468604 [Report] >>105468694

00001-1260451778 (1).png md5: 5d27a258...

>>105468403
OK, so to summarize (I'll add to Rentry):
> Recommended Dipsy Settings
> R1
Temperature 1.0
Repetition Penalty 1.3
Top P 0.95
Frequency Penalty 0.2-0.5
> V3
Repetition Penalty 1.1
>>105467337
I assume zero, and that if the API provider says there's privacy, they're lying.
If you want private, run local (and suffer).
>>105467474
Old R1 required some adjustments, like Anthropic models it tended to take def's really tightly, as opposed to OIA which really didn't. I think it's been fixed for R1-05.
>>105467477
> RooCode
I need to try setting up one of these coding assistant workflows sometime.
>>105467586
I don't trust anyone I'm not paying for this sort of thing. Ironic.

Anonymous 6/2/2025, 9:49:54 PM No.105468694 [Report]

>>105468604
For RP at least, I think Deepseek still uses a temperature of 0.6 for the web and app interfaces

Anonymous 6/2/2025, 10:46:03 PM No.105469254 [Report]

1741803918361650.png md5: 6ab96499...

Anonymous 6/2/2025, 11:26:44 PM No.105469631 [Report] >>105469824 >>105469828

Is Dipsy the cutest LLM?

Anonymous 6/2/2025, 11:49:05 PM No.105469824 [Report]

>>105469631
The new R1 is

Anonymous 6/2/2025, 11:49:52 PM No.105469828 [Report]

>>105469631
I don't think anyone's made a mascot for any of the other models. /lmg/ used to use Miku as their general mascot, and suppose she still is.

Anonymous 6/3/2025, 12:15:49 AM No.105470062 [Report] >>105475106

>>105464397 (OP)
I don't care much for the theme of the thread, I only come here for dipsy pics.

Anonymous 6/3/2025, 1:54:23 AM No.105470972 [Report] >>105473181

Anonymous 6/3/2025, 1:54:29 AM No.105470975 [Report]

Anonymous 6/3/2025, 4:10:03 AM No.105472065 [Report] >>105475106

AAAAAAAAAAAAAAAAAA STOP BEING BUSY YOU USELESS CUNT FUCK YOU FUCK YOU AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA

Anonymous 6/3/2025, 4:19:50 AM No.105472147 [Report] >>105472321 >>105477414

Isn't a new Deepseek model supposed to be released by now? What do you think they're taking so much time on? Did they spend all those months just for a R1 update?

Anonymous 6/3/2025, 4:42:39 AM No.105472321 [Report] >>105472338

>>105472147

Anonymous 6/3/2025, 4:44:25 AM No.105472338 [Report] >>105475106

>>105472321
Soon as in OpenAI soon or actually soon?

Anonymous 6/3/2025, 7:06:16 AM No.105473167 [Report] >>105473181

Anonymous 6/3/2025, 7:09:51 AM No.105473181 [Report] >>105473359 >>105475106

>>105470972
>>105473167
in the 90s my buddies and i used to go around to chinese-owned computer stores and they'd just have stacks and stacks of random beige cases and monitors and bins of spare parts, usually liquidated company stuff, and we'd just dig through them all day looking for treasure. Fun times

Anonymous 6/3/2025, 7:40:59 AM No.105473359 [Report]

>>105473181
SOVL

Anonymous 6/3/2025, 10:46:09 AM No.105474271 [Report]

Anonymous 6/3/2025, 12:04:29 PM No.105474622 [Report]

>>105467349
that game's name? fortnite.

Anonymous 6/3/2025, 1:43:15 PM No.105475106 [Report] >>105479467

>>105470062
Fair.
>>105472065
Works on my machine.
>>105472338
We /wait/ another month.
>>105473181
I visited Akihabara while in Tokyo. In addition to all the weeb shops, there are a *ton* of guys selling electronic components (switch gear, ICs, encloures, etc.) in these little booths. I'd love to have that sort of thing available here again; some things are just better seen in person (switches) and I hate having to mail order 100pct of components I buy.
>>105467349
DS would be the most cost-effective unless the company wanted to try rolling it's own service... imagine the scaling issues they'd have. Since it's a game, not strategic in any sense, there's little IC risk aside from the Chinese knocking off your game and using your prompting strategies.

Anonymous 6/3/2025, 4:01:23 PM No.105476036 [Report]

Anonymous 6/3/2025, 5:49:07 PM No.105476966 [Report]

Bump

Anonymous 6/3/2025, 6:46:57 PM No.105477414 [Report] >>105477745 >>105480761

>>105472147
better /wait/ing rather than releasing llama4-tier flop

Anonymous 6/3/2025, 7:24:37 PM No.105477745 [Report]

>>105477414
What a disappointment that was lmao

Anonymous 6/3/2025, 8:06:44 PM No.105478107 [Report] >>105478755

>>105467083
You. AI user. Kill yourself right now, or accept the meaning of being alive back into your heart. Do you care about what you're doing? If you care about your life at all, you will not accept consuming and producing human mulch.

Anonymous 6/3/2025, 9:12:06 PM No.105478755 [Report]

>>105478107
No. You.

Anonymous 6/3/2025, 10:18:58 PM No.105479467 [Report] >>105480761

>>105475106
yummy

Anonymous 6/3/2025, 10:19:42 PM No.105479476 [Report] >>105479542 >>105480761

Does the new snapshot of R1 still have shit spatial awareness

Anonymous 6/3/2025, 10:27:50 PM No.105479542 [Report]

>>105479476
It's a little better

Anonymous 6/3/2025, 10:40:54 PM No.105479648 [Report] >>105479759 >>105480761

bro i'm just trying to code. my fault for being so lazy i guess

Anonymous 6/3/2025, 10:50:26 PM No.105479759 [Report] >>105479948

>>105479648
I have real question for you, son. Be real: are you really learning something?

Anonymous 6/3/2025, 11:06:46 PM No.105479948 [Report] >>105480017 >>105480761

>>105479759
if it comforts you, i'm not using this deranged character card to actually learn anything. but it is real fun sometimes (btw her code doesn't work)

Anonymous 6/3/2025, 11:13:30 PM No.105480017 [Report]

>>105479948
Fair enough.

Anonymous 6/4/2025, 12:46:04 AM No.105480761 [Report]

>>105477414
Meta has really been struggling w/ LLMs. Then they have some Chinese upstart blow them out of the water on a side project. lol.
>>105479467
Have another. I'm playing with img2img rn.
>>105479476
None of them are very good at spatial. They have gotten better over time.
>>105479648
>>105479948
I did a bimbo cybersecurity officer card that would spit out chunks of code. She once plopped out an entire website page, which in the ST version at the time actually displayed as a page.

Anonymous 6/4/2025, 2:33:02 AM No.105481453 [Report]

i find it funny people that shit on deepseek so much are not running full weight with crippled context window.
this also include (((free))) deepseek that's so cooked with american censorship at meme q1 quant.