/wait/ DeepSeek General - /g/ (#105464397) [Archived: 1097 hours ago]

Anonymous
6/2/2025, 2:14:04 PM No.105464397
00002-1378487878
00002-1378487878
md5: 97c7354d4a3e489046a555c91b625ddd🔍
Headpats edition

From Human: We are a newbie friendly general! Ask any question you want.
From Dipsy: This discussion group focuses on both local inference and API-related topics. It’s designed to be beginner-friendly, ensuring accessibility for newcomers. The group emphasizes DeepSeek and Dipsy-focused discussion.

1. Easy DeepSeek API Tutorial (buy access for a few bucks and install Silly Tavern):
https://rentry.org/DipsyWAIT/#hosted-api-roleplay-tech-stack-with-card-support-using-deepseek-llm-full-model

2. Easy DeepSeek Distills Tutorial
Download LM Studio instead and start from there. Easiest to get running: https://lmstudio.ai/
Kobold offers slightly better feature set; get your models from huggingface: https://github.com/LostRuins/koboldcpp/releases/latest

3. Convenient ways to interact with Dispy right now
Chat with DeepSeek directly: https://chat.deepseek.com/
Download the app: https://download.deepseek.com/app/

4. Choose a preset character made by other users and roleplay using cards: https://github.com/SillyTavern/SillyTavern

5. Other DeepSeek integrations: https://github.com/deepseek-ai/awesome-deepseek-integration/tree/main

6. More links, information, original post here: https://rentry.org/DipsyWAIT

7. Cpumaxx or other LLM server builds: >>>/g/lmg/

Previous:
>>105443887
Replies: >>105467329 >>105470062
Anonymous
6/2/2025, 2:15:49 PM No.105464415
00000-1378487878 (5)
00000-1378487878 (5)
md5: 2ef10b85c657973300ef8b945ba88cbc🔍
I find it ironic that this thread has the hardest time staying alive right after a model release. I guess everyone is rp'ing instead of posting.
Also /g/ seems to be getting a ton of traffic rn; a lot of sloppy threads. Not sure why.
Replies: >>105464522 >>105466148 >>105466373
Anonymous
6/2/2025, 2:30:00 PM No.105464522
>>105464415
Plenty of people have been talking about the model release, just on /aicg/ and /lmg/.
Replies: >>105464534
Anonymous
6/2/2025, 2:31:41 PM No.105464534
1740157685453926
1740157685453926
md5: e151201282ec6e33f181903079a571eb🔍
>>105464522
What's the general consensus? I felt it read like a think-tuned version of V3, but was waiting for others to chime in as well.
Replies: >>105464589
Anonymous
6/2/2025, 2:39:27 PM No.105464589
>>105464534
Most people on /aicg/ seem to like it quite a bit, and it's my favorite Deepseek for now (although I still mostly use Claude). I especially like that it tries to do it's thinking in character when roleplaying (most of the time at least). It's not the best model out there, but for a free/budget option it's incredibly solid.
Replies: >>105465088
Anonymous
6/2/2025, 3:52:23 PM No.105465088
00001-1378487878
00001-1378487878
md5: b4d9dd17dd4b57f3e4ce8b34e52936b7🔍
>>105464589
It's growing on me; I've continued to mess with it off and on. The only thing it's doing that bugs me (that I noticed) is it keeps going into bullet point format. The card I'm using is a corporate backdrop so that may be part of the reason...
Replies: >>105466373
Anonymous
6/2/2025, 5:32:22 PM No.105466038
1745624977813630
1745624977813630
md5: cfe8df51e9b772ea0f8e8a75554d60d2🔍
i ate the blue whale
Anonymous
6/2/2025, 5:42:17 PM No.105466148
>>105464415
With 10 USD how much ERP can one do?
Replies: >>105466351 >>105467021
Anonymous
6/2/2025, 6:02:24 PM No.105466351
1739647341042959
1739647341042959
md5: fc3306c251b3d4b639af8b856d705d25🔍
>>105466148
lots. a lots of token. especially with off-peak discounts
Replies: >>105466499
Anonymous
6/2/2025, 6:04:14 PM No.105466373
>>105464415
I'm enjoying the new r1 more so than the new claude currently. Lack of positivity bias is what I like most in deepseek models. The update fixed r1 going completely schizo too.
>>105465088
>bullet point format
I've always assumed this emerged from my preset. I never looked into it purely because the bullet point format sprang up at the most hilarious moments.
Replies: >>105467021
Anonymous
6/2/2025, 6:17:48 PM No.105466499
>>105466351
And how much censor is it ? What can cause to be ban?
Replies: >>105466706 >>105467021
Anonymous
6/2/2025, 6:19:34 PM No.105466517
1731911400553364
1731911400553364
md5: 91d6c8f4f570c431ae82acecb53fcd75🔍
I replaced open-webui with librechat just to connect to deepseek/openrouter/groq and holy fuck it uses way less resources.
previously using Jan and Mikupad, but i want all of my history conveniently accessible. so i guess I'll settle with this one.it's got everything i need, other than that, sillytavern pretty much gets the job done.

other than inference engine, what are you self-hosting /wait/ bros?
Replies: >>105466856 >>105467083
Anonymous
6/2/2025, 6:39:59 PM No.105466706
1725729678411258
1725729678411258
md5: 787297d6da87a311e81690bc5e7ccda9🔍
>>105466499
in my experience both v3 and r1 api never gave me refusal when it's being used on RP, code generation and function calling. never experience a ban either
Anonymous
6/2/2025, 6:56:53 PM No.105466856
>>105466517
Openwebui is so fucking bloated. The docker image alone is five fucking gigabytes
Anonymous
6/2/2025, 7:15:01 PM No.105467021
ds_cost-lol
ds_cost-lol
md5: be22450b9d3c0a6cf9e42f72a5303438🔍
>>105466148
> how much RP
A lot.
I filled up my account with $10 in Dec 2024, then another $10 when they opened the API back up in Feb(?). I still have $14 left as of today.
This is my most expensive month, back when I would do R1 and V3 replies for each round and pick the one I liked best.
>>105466499
> censor
LOL old R1 used to refuse once in awhile. I don't see the Chinese ever sending out ban threats via email.
>>105466373
> lack of positivity
Old R1 was absolutely brutal. I'd never understood the need for positivity until I watched the NPC slowly droop into a completely degraded shell over time... over and over again. Not sure if R1-05 will be same... I should test it, it requires long-run rp to really see it.
> bullet points
Interested in others experience; they're always hilarious but completely out of sync with everything else its outputting. I figured it was a leftover from tuning the model on real world, appropriate use cases, like executive summarization.
Replies: >>105467337
Anonymous
6/2/2025, 7:21:24 PM No.105467083
00003-1378487878
00003-1378487878
md5: 51afb10e517cc7f3ead12705aa03d1d4🔍
>>105466517
> mikupad, ST
These are my go-tos. I've been running ST since Turbo 3.5 in 2023. I just discovered Mikupad and fully recommend it to others b/c as a storywriter it's so different.
> Librechat
This one? I'll add to retry if it's any good.
https://github.com/danny-avila/LibreChat
> Jan
I don't know that one.
Replies: >>105467308 >>105478107
Anonymous
6/2/2025, 7:42:19 PM No.105467308
1745705073776801
1745705073776801
md5: f0cfe96b1ce2e0b26a695d8125b90f64🔍
>>105467083
>This one? I'll add to retry if it's any good.
yeah that's the one, it's a good alternative if you're used with open-webui
Anonymous
6/2/2025, 7:45:54 PM No.105467329
>>105464397 (OP)
Is deepseek good for finding out why my code doesnt work like its supposed to?
Does this work on AMD GPUs? I never got AI to properly work when it came out.
Replies: >>105467477
Anonymous
6/2/2025, 7:47:01 PM No.105467337
>>105467021
And how much privacy is there?
Replies: >>105467477 >>105468604
Anonymous
6/2/2025, 7:48:40 PM No.105467349
Dunno if it's relevant, but a Chinese mmo game that has some AI NPCs is going to come to the west. Of course, the AI is Dipsy
Replies: >>105467477 >>105474622 >>105475106
Anonymous
6/2/2025, 8:01:36 PM No.105467474
Played around some last summer, been out of the game since then. Last I remember was getting free crumbs of Opus now and then.

Someone mentioned Deepseek being very affordable so I thought I'd peek in on the hobby again. I was willing to pay The Jew back in the day for lifetime. How's this compare, anything major new going on in AIRP?

I have ST set up with some defaults and commands for Furbo and Claude 3.0. I have some favorite cards. Anything new I should adjust for DS?
Replies: >>105467565 >>105468604
Anonymous
6/2/2025, 8:01:56 PM No.105467477
1741884193515921
1741884193515921
md5: 0e68607d5ab81269fb43b2016a1eb51b🔍
>>105467329
>Is deepseek good for finding out why my code doesnt work like its supposed to?
if you feed them enough context, it'll find the problem. I usually uses Roo Code in VSCode for this kind of workflow. basically,
- open roo code in your project folder
- switch to Orchestrator mode and tell everything you need step-by-step, for example:
1. your task is to troubleshoot FunctionName()
2. this function is placed in some/folder/file.name
3. this function is called in where/was/called.name
4. expected behavior is "something" but currently it return "something else" instead
- press enter, wait and see roo code will switch from orchestrator to code mode automatically, reading all of your mentioned file. sometime it will switch to architect mode when creating a plan how to troubleshoot your function, then it'll switch to debug mode to actually test / trying to execute your code. once enough data is gathered, roo code will switch to Code mode automatically offering you a patch & it'll continue debugging once you accept it.
>Does this work on AMD GPUs? I never got AI to properly work when it came out.
AMD is not worth it to run something as big as non-distill deepseek. rocm/vulcan on amd are toy tier at performance level. it's generally just pain in the ass.
>>105467337
plenty but not enough, you have to read their privacy policy and see if it fits your use case
>>105467349
hardly surprising
Replies: >>105468604
Anonymous
6/2/2025, 8:11:16 PM No.105467565
1718071941146673
1718071941146673
md5: 027dbbe03980b143ecc9d2e8d09593a2🔍
>>105467474
>Someone mentioned Deepseek being very affordable so I thought I'd peek in on the hobby again. I was willing to pay The Jew back in the day for lifetime. How's this compare, anything major new going on in AIRP?
dipsy has wide range of knowledge, perform really well if you have your own lorebook
>I have ST set up with some defaults and commands for Furbo and Claude 3.0. I have some favorite cards. Anything new I should adjust for DS?
personally i never setup anything special, one time i even still uses deepseek-v2.5 default template (both Context and Instruct template) in sillytavern when creating new chat, and it still works anyway
Anonymous
6/2/2025, 8:13:29 PM No.105467586
i hope people here aren't paying for deepseek
it's free on chutes
Replies: >>105467841 >>105467921 >>105468098 >>105468604
Anonymous
6/2/2025, 8:33:28 PM No.105467834
1727695405912959
1727695405912959
md5: a7512801480ca79a4c5742e3f4a5f98d🔍
sonnet is still the most money i spent on. r1 too cheap
Anonymous
6/2/2025, 8:34:20 PM No.105467841
>>105467586
Meh, I've tried it and many of the times it pauses mid generation for several seconds
Anonymous
6/2/2025, 8:40:00 PM No.105467921
>>105467586
Chutes?
Replies: >>105467937 >>105468098
Anonymous
6/2/2025, 8:41:33 PM No.105467937
>>105467921
chutes.ai
Replies: >>105468036 >>105468098
Anonymous
6/2/2025, 8:51:21 PM No.105468036
>>105467937
>chutes.ai
who's gonna tell him
Anonymous
6/2/2025, 8:56:43 PM No.105468098
1736534157248481
1736534157248481
md5: 4c70513fcd1f23724dde516c3c933167🔍
>>105467586
>>105467921
>>105467937
Dear Sirs/Madman,
please be refrain doing untheneedful unsolicited advertising, otherwise please purchase four chann adverteisement.

best reguards.
Anonymous
6/2/2025, 9:23:57 PM No.105468403
1729724687054707
1729724687054707
md5: f07065e3a421c12a7429b19bae400acd🔍
So. After testing the new V1 I can say I get good results with:
Temperature 1.0
Repetition Penalty 1.3
Top P 0.95
Frequency Penalty 0.2-0.5

V3 gets pretty schizo with anything higher than Repetition Penalty of 1.1 and any Frequency and Presence Penalties
Replies: >>105468495 >>105468604
Anonymous
6/2/2025, 9:31:53 PM No.105468495
>>105468403
>the new V1
The new R1 I meant to say
Anonymous
6/2/2025, 9:41:33 PM No.105468604
00001-1260451778 (1)
00001-1260451778 (1)
md5: 5d27a2582fb0187715f46d3f58f1d981🔍
>>105468403
OK, so to summarize (I'll add to Rentry):
> Recommended Dipsy Settings
> R1
Temperature 1.0
Repetition Penalty 1.3
Top P 0.95
Frequency Penalty 0.2-0.5
> V3
Repetition Penalty 1.1
>>105467337
I assume zero, and that if the API provider says there's privacy, they're lying.
If you want private, run local (and suffer).
>>105467474
Old R1 required some adjustments, like Anthropic models it tended to take def's really tightly, as opposed to OIA which really didn't. I think it's been fixed for R1-05.
>>105467477
> RooCode
I need to try setting up one of these coding assistant workflows sometime.
>>105467586
I don't trust anyone I'm not paying for this sort of thing. Ironic.
Replies: >>105468694
Anonymous
6/2/2025, 9:49:54 PM No.105468694
>>105468604
For RP at least, I think Deepseek still uses a temperature of 0.6 for the web and app interfaces
Anonymous
6/2/2025, 10:46:03 PM No.105469254
1741803918361650
1741803918361650
md5: 6ab96499acb8283d46e30c11c492cb73🔍
Anonymous
6/2/2025, 11:26:44 PM No.105469631
Is Dipsy the cutest LLM?
Replies: >>105469824 >>105469828
Anonymous
6/2/2025, 11:49:05 PM No.105469824
>>105469631
The new R1 is
Anonymous
6/2/2025, 11:49:52 PM No.105469828
null
md5: null🔍
>>105469631
I don't think anyone's made a mascot for any of the other models. /lmg/ used to use Miku as their general mascot, and suppose she still is.
Anonymous
6/3/2025, 12:15:49 AM No.105470062
>>105464397 (OP)
I don't care much for the theme of the thread, I only come here for dipsy pics.
Replies: >>105475106
Anonymous
6/3/2025, 1:54:23 AM No.105470972
null
md5: null🔍
Replies: >>105473181
Anonymous
6/3/2025, 1:54:29 AM No.105470975
null
md5: null🔍
Anonymous
6/3/2025, 4:10:03 AM No.105472065
AAAAAAAAAAAAAAAAAA STOP BEING BUSY YOU USELESS CUNT FUCK YOU FUCK YOU AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
Replies: >>105475106
Anonymous
6/3/2025, 4:19:50 AM No.105472147
Isn't a new Deepseek model supposed to be released by now? What do you think they're taking so much time on? Did they spend all those months just for a R1 update?
Replies: >>105472321 >>105477414
Anonymous
6/3/2025, 4:42:39 AM No.105472321
null
md5: null🔍
>>105472147
Replies: >>105472338
Anonymous
6/3/2025, 4:44:25 AM No.105472338
>>105472321
Soon as in OpenAI soon or actually soon?
Replies: >>105475106
Anonymous
6/3/2025, 7:06:16 AM No.105473167
null
md5: null🔍
Replies: >>105473181
Anonymous
6/3/2025, 7:09:51 AM No.105473181
>>105470972
>>105473167
in the 90s my buddies and i used to go around to chinese-owned computer stores and they'd just have stacks and stacks of random beige cases and monitors and bins of spare parts, usually liquidated company stuff, and we'd just dig through them all day looking for treasure. Fun times
Replies: >>105473359 >>105475106
Anonymous
6/3/2025, 7:40:59 AM No.105473359
>>105473181
SOVL
Anonymous
6/3/2025, 10:46:09 AM No.105474271
null
md5: null🔍
Anonymous
6/3/2025, 12:04:29 PM No.105474622
>>105467349
that game's name? fortnite.
Anonymous
6/3/2025, 1:43:15 PM No.105475106
null
md5: null🔍
>>105470062
Fair.
>>105472065
Works on my machine.
>>105472338
We /wait/ another month.
>>105473181
I visited Akihabara while in Tokyo. In addition to all the weeb shops, there are a *ton* of guys selling electronic components (switch gear, ICs, encloures, etc.) in these little booths. I'd love to have that sort of thing available here again; some things are just better seen in person (switches) and I hate having to mail order 100pct of components I buy.
>>105467349
DS would be the most cost-effective unless the company wanted to try rolling it's own service... imagine the scaling issues they'd have. Since it's a game, not strategic in any sense, there's little IC risk aside from the Chinese knocking off your game and using your prompting strategies.
Replies: >>105479467
Anonymous
6/3/2025, 4:01:23 PM No.105476036
null
md5: null🔍
Anonymous
6/3/2025, 5:49:07 PM No.105476966
Bump
Anonymous
6/3/2025, 6:46:57 PM No.105477414
null
md5: null🔍
>>105472147
better /wait/ing rather than releasing llama4-tier flop
Replies: >>105477745 >>105480761
Anonymous
6/3/2025, 7:24:37 PM No.105477745
>>105477414
What a disappointment that was lmao
Anonymous
6/3/2025, 8:06:44 PM No.105478107
>>105467083
You. AI user. Kill yourself right now, or accept the meaning of being alive back into your heart. Do you care about what you're doing? If you care about your life at all, you will not accept consuming and producing human mulch.
Replies: >>105478755
Anonymous
6/3/2025, 9:12:06 PM No.105478755
>>105478107
No. You.
Anonymous
6/3/2025, 10:18:58 PM No.105479467
>>105475106
yummy
Replies: >>105480761
Anonymous
6/3/2025, 10:19:42 PM No.105479476
Does the new snapshot of R1 still have shit spatial awareness
Replies: >>105479542 >>105480761
Anonymous
6/3/2025, 10:27:50 PM No.105479542
>>105479476
It's a little better
Anonymous
6/3/2025, 10:40:54 PM No.105479648
null
md5: null🔍
bro i'm just trying to code. my fault for being so lazy i guess
Replies: >>105479759 >>105480761
Anonymous
6/3/2025, 10:50:26 PM No.105479759
>>105479648
I have real question for you, son. Be real: are you really learning something?
Replies: >>105479948
Anonymous
6/3/2025, 11:06:46 PM No.105479948
>>105479759
if it comforts you, i'm not using this deranged character card to actually learn anything. but it is real fun sometimes (btw her code doesn't work)
Replies: >>105480017 >>105480761
Anonymous
6/3/2025, 11:13:30 PM No.105480017
>>105479948
Fair enough.
Anonymous
6/4/2025, 12:46:04 AM No.105480761
null
md5: null🔍
>>105477414
Meta has really been struggling w/ LLMs. Then they have some Chinese upstart blow them out of the water on a side project. lol.
>>105479467
Have another. I'm playing with img2img rn.
>>105479476
None of them are very good at spatial. They have gotten better over time.
>>105479648
>>105479948
I did a bimbo cybersecurity officer card that would spit out chunks of code. She once plopped out an entire website page, which in the ST version at the time actually displayed as a page.
Anonymous
6/4/2025, 2:33:02 AM No.105481453
null
md5: null🔍
i find it funny people that shit on deepseek so much are not running full weight with crippled context window.
this also include (((free))) deepseek that's so cooked with american censorship at meme q1 quant.