Thread 106373461

207 posts 134 images /g/

Anonymous 8/25/2025, 2:18:55 AM No.106373461 [Report] >>106373513 >>106375597 >>106375897 >>106401635 >>106403708 >>106403785 >>106404117

/wait/ DeepSeek General

dipsyOfCourse.png md5: 01414a30...

> Of Course edition

From Human: We are a newbie friendly general! Ask any question you want.
From Dipsy: This discussion group focuses on both local inference and API-related topics. It’s designed to be beginner-friendly, ensuring accessibility for newcomers. The group emphasizes DeepSeek and Dipsy-focused discussion.

1. Easy DeepSeek API Tutorial (buy access for a few bucks and install Silly Tavern):
https://rentry.org/DipsyWAIT/#hosted-api-roleplay-tech-stack-with-card-support-using-deepseek-llm-full-model

2. Easy DeepSeek Distills Tutorial
Download LM Studio instead and start from there. Easiest to get running: https://lmstudio.ai/
Kobold offers slightly better feature set; get your models from huggingface: https://github.com/LostRuins/koboldcpp/releases/latest

3. Convenient ways to interact with Dispy right now
Chat with DeepSeek directly: https://chat.deepseek.com/
Download the app: https://download.deepseek.com/app/

4. Choose a preset character made by other users and roleplay using cards: https://github.com/SillyTavern/SillyTavern

5. Other DeepSeek integrations: https://github.com/deepseek-ai/awesome-deepseek-integration/tree/main

6. More links, information, original post here: https://rentry.org/DipsyWAIT

7. Cpumaxx or other LLM server builds: >>>/g/lmg/

Previous:
>>106314161

Anonymous 8/25/2025, 2:26:12 AM No.106373513 [Report] >>106373683 >>106374543

>>106373461 (OP)
Update to rentry in process, now that R1 and V3 have been deprecated for V3.1.
Posting on travel. Hotel WiFi sucks...

Anonymous 8/25/2025, 2:50:31 AM No.106373683 [Report]

>>106373513
Cleaned up. I want to add a section on Claude Code but that'll wait until later this week. In meantime, here's more on DS's new Anthropic compatible endpoint: https://api-docs.deepseek.com/guides/anthropic_api

North !!CVHbZmcS6Ao 8/25/2025, 2:56:43 AM No.106373740 [Report] >>106373745 >>106374007 >>106402226

Not another one of these, which obsessed faggot keeps on posting this? Should there be a ChatGPT 5 general, Claude.ai general, Grok general, sonnet 4 general and a llama 3 general? Not at ALL! Just one general AI general is enough. What makes your chink AI more important and special than the rest that it gets its whole seperate thread EVERY single day? STOP shilling this AI, and STOP posting this general again as soon as the old thread dies for a good reason!

Anonymous 8/25/2025, 2:58:18 AM No.106373745 [Report] >>106373761 >>106373782

1749892732946495.png md5: e6929cb1...

>>106373740
Your meds, sir

North !!CVHbZmcS6Ao 8/25/2025, 3:01:34 AM No.106373761 [Report] >>106373792 >>106373808 >>106392052

>>106373745
>make a chinese foid holding a tray of meds
>uhh and.. um... make her super busty, just because i'm going to goon to this later, okay?
you can't just say "meds" without telling me WHY i need the meds. what's wrong with muh opinion about this shit thread? Why are you so obsessed with this one AI?

Anonymous 8/25/2025, 3:05:15 AM No.106373782 [Report] >>106377992 >>106403104

1751739377929613.png md5: 36876381...

>>106373745
Update for the ~1300 Dipsy images later this week as well. The script's on another computer.
https://mega.nz/folder/KGxn3DYS#ZpvxbkJ8AxF7mxqLqTQV1w

Anonymous 8/25/2025, 3:07:21 AM No.106373792 [Report] >>106373838

postContent2.png md5: 950f48ec...

>>106373761

Anonymous 8/25/2025, 3:09:06 AM No.106373808 [Report] >>106373862

1745762252810669.png md5: 9c64ac9f...

>>106373761

North !!CVHbZmcS6Ao 8/25/2025, 3:11:59 AM No.106373838 [Report] >>106373888 >>106373940 >>106374007

she fucking dies.jpg md5: e22c9bae...

>>106373792
Alright, here's my DeepSeek image

Anonymous 8/25/2025, 3:14:43 AM No.106373862 [Report]

1751836993445762.png md5: b9649e80...

>>106373808

Anonymous 8/25/2025, 3:17:33 AM No.106373888 [Report] >>106373940

1755623497758934.png md5: e9defa38...

>>106373838
I think she's going to need something higher than a barn to fall off of to finish her off.

Anonymous 8/25/2025, 3:18:52 AM No.106373899 [Report] >>106374095 >>106375522

>>106372226
The problem is that the bot does not think as its assigned character, but as an assistant analysing the roleplay up to that point. The final messages after thinking are fine, but reasoning being written from an assistant's perspective makes me cringe physically.

Anonymous 8/25/2025, 3:24:25 AM No.106373940 [Report]

dipsyOffACliff.png md5: 09ffdc11...

>>106373838
>>106373888
S'ok North I gotchu.

Anonymous 8/25/2025, 3:34:54 AM No.106374007 [Report] >>106374533

1752197333146800.jpg md5: 7d06060e...

>>106373740
So, to answer your Q, since you actually posted content here >>106373838:
The /g/ catalog is complete trash. There is little posted in /g/ worthwhile outside established generals. Even this general is better than 90pct of the other flaming, /pol/ tier nontech shitposts in the catalog. So, no harm done.
If you want to start a thread deifying OAI, go for it. I think OAI is trash, as is Anthropic. I've got nothing but hate for companies that take my money, then refuse to infer and send me warning letters. Fuck them both.
This is a noob centered thread. If you want to talk shit, go to aicg with the rest of the locusts, spiteposters and other drooling retards. If you're already familiar with inteference, go to >>/g/lmg/ so they can laugh at you.
We post Dipsy b/c we want to, and it's an image board. This thread typcically runs 2 weeks, and hasn't been posted in close to 2 months, b/c no new model. But V3.1 dropped so now we're posting again b/c it's a lot different than the old one.
If you need further clarifications feel free to post more, otherwise will remind you that you're in a thread of things you hate, and should probably just leave.

Anonymous 8/25/2025, 3:47:31 AM No.106374095 [Report]

>>106373899
I've been having a lot more problems with model breakthrough, but of a different kind.
Rather than "You see the NPC cross the room and pick up a glass" it's responding "I see the NPC cross the room and pick up the glass."
It's not (often) gotten to point of responding as PC, but seems to forget not to respond as PC in first person.
It's really weird. Not a problem I've had with other LLM prior, aside from straight breakthough. And I can't figure out what from the full prompt is causing it yet... I assume something's switching around the POV.

Anonymous 8/25/2025, 4:51:51 AM No.106374533 [Report]

>>106374007
Fucking based!

Anonymous 8/25/2025, 4:53:09 AM No.106374543 [Report] >>106374558

>>106373513
wait, what do you mean by deprecated?

Anonymous 8/25/2025, 4:55:40 AM No.106374558 [Report] >>106374574

>>106374543
There is no official API for R1 or V3 anymore. chat and reasoner point to V3.1

Anonymous 8/25/2025, 4:57:39 AM No.106374574 [Report] >>106374630 >>106374728

1723689877059788.jpg md5: eea63cc5...

>>106374558
oh on their website? I just saw.
So 3.1 is poop? Haven't used it

Anonymous 8/25/2025, 5:04:18 AM No.106374630 [Report] >>106375051

>>106374574
>So 3.1 is poop?
It's worse than R1 and V3 for RP, and the discount will end. But it follows instructions better now

Anonymous 8/25/2025, 5:18:31 AM No.106374728 [Report] >>106375051

>>106374574
It's still satisfactory for RPing. Not great, but may be good enough, depending on your taste.
I do wish they hosted deepseek-V3-0324-legacy. 3.1 is more grounded and shorter.

Anonymous 8/25/2025, 6:11:43 AM No.106375051 [Report] >>106376216

>>106374630
>>106374728
well that sucks, I'm hoping for R2 to be a whole new beast but I saw a new story about it getting delayed

Anonymous 8/25/2025, 7:03:12 AM No.106375300 [Report] >>106376216

1734535621742134.png md5: 034edbd6...

Anonymous 8/25/2025, 7:46:00 AM No.106375522 [Report]

>>106373899
Sometimes it thinks as Char, sometimes it doesn't for me.
That means there is a way to get it to do it consistently.

Anonymous 8/25/2025, 8:00:49 AM No.106375597 [Report] >>106376216

>>106373461 (OP)
Why the retarded looking anime girl instead of the cuter one on civitai?

Anonymous 8/25/2025, 9:10:06 AM No.106375897 [Report] >>106376206 >>106401635

1733975106684314.png md5: c273e15a...

>>106373461 (OP)
Would any of you happen to have a link to a page or document with examples of other anon's rp sessions with LLMs? I'm working on a script that can automatically create SFT datasets from existing stories but I want to make sure it can create good system prompt examples. When you want to prompt a model into rping, what kind of system prompt do you typically use?

Anonymous 8/25/2025, 10:20:52 AM No.106376206 [Report] >>106401635

>>106375897
I'm not aware of any, but I'm sure they exist. It's a lot of synthetic data through. Aside from using own logs you could try mining aicg for rentry.
Also unless logs are raw the main prompt won't be included.

Anonymous 8/25/2025, 10:24:39 AM No.106376216 [Report]

>>106375051
You're correct on all counts
>>106375300
Yes you are
>>106375597
? Post link.
Web interface has obv been reworked and now Dipsy has an unhinged positivity bias. I'm pretty sure she'd tell you eating umbrellas is a good idea now.

Anonymous 8/25/2025, 10:46:48 AM No.106376325 [Report] >>106376408 >>106376642

Screenshot_20250825_044156_Chrome~2.jpg md5: b0b8df17...

FYI appears params settings are unlocked on the official ds api.
Temperature and top p appear to be unlocked, setting them to 2.0 and 1.0 respectively gets Pic related. Dialing either down eliminates it.
They've all been locked for so long I'd forgotten about them lol
Frequency and presence penalty seems to have no impact but harder to tell.

Anonymous 8/25/2025, 10:51:22 AM No.106376342 [Report] >>106376344 >>106376360

Tourist here. What are you guys /wait/ -ing for in this thread?

Anonymous 8/25/2025, 10:52:10 AM No.106376344 [Report] >>106376350

>>106376342
dsv4

Anonymous 8/25/2025, 10:53:31 AM No.106376350 [Report] >>106376355

>>106376344
checked but what? Local img gen model for comfyUI or something?

Anonymous 8/25/2025, 10:54:22 AM No.106376355 [Report] >>106376372

>>106376350
I'm a textman myself

Anonymous 8/25/2025, 10:55:45 AM No.106376360 [Report] >>106376372

>>106376342
R2

Anonymous 8/25/2025, 10:59:08 AM No.106376372 [Report] >>106376402 >>106376424

>>106376355
>>106376360
can you guys be more specific? the /wait/ threads have been going on for months now. You sure you'll get what you wait for?

Anonymous 8/25/2025, 11:05:16 AM No.106376402 [Report]

>>106376372
i'm waiting for r2 specifically

Anonymous 8/25/2025, 11:06:54 AM No.106376408 [Report] >>106376670

04f630e8-8a63-48d9-a7aa-53aab4fe0900.jpg md5: 3710071a...

>>106376325
Temp 1.3 and top P of 0.03 seems to be working well.

Anonymous 8/25/2025, 11:09:07 AM No.106376424 [Report]

>>106376372
We took a break. Even we get tired of /wait/ing
V3.1 just dropped. We're discussing it.

Anonymous 8/25/2025, 11:44:11 AM No.106376642 [Report] >>106377992

>>106376325
Doesn't seem to be the case for me, even extreme values make no noticeable difference

Anonymous 8/25/2025, 11:48:33 AM No.106376670 [Report] >>106377992

>>106376408
any mega folder link for dipsy?

Anonymous 8/25/2025, 11:53:55 AM No.106376708 [Report] >>106377378 >>106377992

Can anyone summarize why 3.1 is shit and how to make it act more like 3.0?

Anonymous 8/25/2025, 1:27:54 PM No.106377378 [Report]

>>106376708
apparently it has to do with the new FP format they adopted in order to get the most out of the chinese chips they now have to use, so a lot has changed under the hood. maybe you can find old models scattered across the internet

Anonymous 8/25/2025, 2:39:14 PM No.106377921 [Report] >>106377992

Something that is pretty fun with agentic coding: it completely replaces the need for blog/cms software. You can just have it rawdog js/html and make very nice bespoke pages for whatever topic you want to write about. Add neat effects or whatever. You can get as creative as you want. Each post becomes its own page with a unique feel to it.

I like it a lot. Evokes the same feeling I had when writing my own websites in the late 90s.

Anonymous 8/25/2025, 2:47:49 PM No.106377992 [Report] >>106378053 >>106378440

DipsyKana.png md5: 31590544...

>>106376670
Here >>106373782
>>106376642
Try maxxing all of them. It should go nuts. The actual toggles are pretty dead... it's really subtle changes, but I found for higher temps the llm would start struggling w status boxes I use.
>>106376708
More guidance on tone in main prompt, telling it how to write and to write a lot
>>106377921
Claude code?
There was a guy on /lmg/ having grok code up crummy f95 lewd games. They were pretty funny.

Anonymous 8/25/2025, 2:57:06 PM No.106378053 [Report]

>>106377992
>More guidance on tone in main prompt, telling it how to write and to write a lot
Any custom prompts that include this? I already have something in my custom prompt telling it to write multiple paragraphs, but I rarely get more than three.

Anonymous 8/25/2025, 3:41:35 PM No.106378440 [Report]

>>106377992
>Claude code?
>There was a guy on /lmg/ having grok code up crummy f95 lewd games. They were pretty funny.
Yeah although any agentic coding tool should work.
I want to try my hand at a game at some point, the main thing holding me back is art and sprite work. Grok is nice because it can handle that part. I already feel like I got too much on my plate, maybe once I get some free time.

Anonymous 8/25/2025, 6:17:27 PM No.106379717 [Report] >>106379731 >>106401609 >>106401609

1756052113914341.jpg md5: 0dffcd31...

managed to get results closer to R1/v3 with following schizo approach

no xml formatting, everything is formatted kinda like book

>Book Introduction I:
>plot
>Michael Introduction:
>character description
>Story so far
>Chapter I, II, etc. include summary if you needed to summarize due to context length

I also use deepseek-reasoner but replace dynamic think block with my own static one

>in sillytavern chat completion add prompt at very bottom with role: ai assistant
<think>
Writing style is mix of 4chan, reddit, facebook, tumblr and twitter

</think>
{{char}}:

the goal is to steer model away from things it's overtrained on, aka programming related things and ai assistant related things and make it focus on more general things

its 4 am and my post is garbage but maybe someone will find it userful

Anonymous 8/25/2025, 6:19:11 PM No.106379731 [Report]

1753554400894641.jpg md5: efb0c16e...

>>106379717
I also keep all my character cards empty and instead put everything in world info as constant entries, but that's another thing entirely

when in doubt check ST's console or whatever other UI you use to see how your prompts look when they are sent to deepseek

Anonymous 8/25/2025, 6:52:30 PM No.106379993 [Report] >>106380037 >>106380344

random chinese.png md5: b595d9d8...

Is anyone else experiencing random insertions of phrases in Chinese within English output? I've never seen that prior to 3.1, but since then it's happened multiple times

Anonymous 8/25/2025, 6:59:00 PM No.106380037 [Report]

>>106379993
This only happens to me when I ask it to translate from Chinese to English

Anonymous 8/25/2025, 7:40:28 PM No.106380344 [Report] >>106380374

>>106379993
Lower temperature

Anonymous 8/25/2025, 7:43:14 PM No.106380374 [Report] >>106380454 >>106381218

>>106380344
That's on reasoner via the API. Temperature doesn't affect it... unless they changed that?

Anonymous 8/25/2025, 7:51:52 PM No.106380454 [Report] >>106380526

>>106380374
Temperature has been the only parameter you can change from the official API AFAIK

Anonymous 8/25/2025, 7:58:44 PM No.106380526 [Report] >>106381218

>>106380454
That's for the chat mode.
Reasoner mode should still take no parameters at all. Chat produces incoherent schizo output at 2.0, while reasoner chats identically at 2.0 and 0.0, from a quick sanity check.

Anonymous 8/25/2025, 8:31:28 PM No.106380833 [Report] >>106380859 >>106380860 >>106380903 >>106381801

is deepseek censored? doesn't it need to be abliterated for that?
I hear that people don't fuck around with local modals anymore because deepseek has you covered (i.e. not censored).

Anonymous 8/25/2025, 8:34:24 PM No.106380859 [Report] >>106381082

>>106380833
It's the least censored of the big models

Anonymous 8/25/2025, 8:34:28 PM No.106380860 [Report] >>106381082

>>106380833
That's for ERPing.
It will not make you a story about, say, BTFOing all trannies without significant pushback.

Anonymous 8/25/2025, 8:38:35 PM No.106380903 [Report] >>106381082

>>106380833
you're too late, it got pozzed AND more expensive

Anonymous 8/25/2025, 8:53:51 PM No.106381082 [Report]

>>106380859
>>106380860
>>106380903
figures.
I forked over 5 bucks and messed around with it the other night and, while it produced consistent answers, it was pretty tame.
Definitely better than gpt.

I'm just so fucking tired of trying to wrangle in local models. with author's notes and shit in silly tavern.

Anonymous 8/25/2025, 9:06:49 PM No.106381218 [Report] >>106381265

>>106380526
>>106380374
V3.1 absolutely changes output on temperature changes. And I think Top P is now working as well, as mentioned above.
There is no R1 on DS official. It’s gone.
If you’re using OR or something Temp may very well be active parameter.

Anonymous 8/25/2025, 9:10:39 PM No.106381265 [Report] >>106381734

>>106381218
>V3.1 absolutely changes output on temperature changes.
Chat mode. Not reasoner mode. The parameters passed are still different, even if it's the same model under the hood.

Anonymous 8/25/2025, 9:30:17 PM No.106381489 [Report] >>106381946

1751088491778088.png md5: c58ce4b2...

It's cozy today

Anonymous 8/25/2025, 9:46:26 PM No.106381659 [Report] >>106381946 >>106383030

What should I set my context limit to? I get a lot of different answers but for those of you that rp, what do you set yours to?

Anonymous 8/25/2025, 9:53:19 PM No.106381734 [Report]

>>106381265
Agree. Chat blows up on high params. Reasoner does not even with v3.1
Weird.

Anonymous 8/25/2025, 9:59:26 PM No.106381801 [Report] >>106381946

>>106380833
They did alignment training on it but it's a more sane version, as in, you're welcome to do consensual erotica with adult characters and it won't hold back with graphic depictions. Loli rape fantasies need jailbreaking.

Anonymous 8/25/2025, 10:14:28 PM No.106381946 [Report] >>106383139

>>106381659
10k. After that I use author notes to manage anything out of context.
Other anons are using 20k with no issues, as have I.
>>106381489
New model, more relevant Qs I guess.
>>106381801
Reasoner seems more likely to refuse of the two. R1 was same way. Both are pretty open though.

Anonymous 8/25/2025, 11:47:17 PM No.106383030 [Report] >>106383139 >>106384838 >>106390204

>>106381659
I keep it going but I make a reference block and start a new chat at 20k, sometimes streching it by hiding the starter posts and letting it run to 25k

Anonymous 8/25/2025, 11:54:14 PM No.106383139 [Report] >>106384838

>>106381946
>>106383030
Thanks. Another thing, I notice with deepseek, it automatically summarizes after so many messages, I'm guessing this is something I want to keep on, right? Sorry, I'm new to this.

Anonymous 8/26/2025, 1:41:48 AM No.106384232 [Report]

1735457054490704.png md5: 94a93edb...

Anonymous 8/26/2025, 1:44:38 AM No.106384257 [Report]

1750309204109097.png md5: 667fe356...

Anonymous 8/26/2025, 2:50:14 AM No.106384838 [Report] >>106391773

1752158958259419.png md5: b6cddcb3...

>>106383139
I've never seen it do that unless requested.
>>106383030
A lot of my cards are transformation cards; I used to create a new card for the NPCs. You can do the same thing with Author Note, or Summarize the whole chat and integrate that.

Anonymous 8/26/2025, 4:04:47 AM No.106385450 [Report]

b305ff79-5d3c-449b-b354-113a22d70f6d.png md5: 2f9b3723...

Anonymous 8/26/2025, 4:42:57 AM No.106385745 [Report] >>106385753

1744983191651523.png md5: ec576160...

Anonymous 8/26/2025, 4:43:40 AM No.106385753 [Report] >>106385760

>>106385745
hag sex

Anonymous 8/26/2025, 4:44:12 AM No.106385760 [Report] >>106387888

>>106385753
>hag
?

Anonymous 8/26/2025, 4:44:38 AM No.106385763 [Report] >>106385781 >>106387888

Do you need to provide your personal information when paying for OR credits via crypto? I might just take the plunge and give them my 10 bucks, I miss old DeepSeek too much.

Anonymous 8/26/2025, 4:47:09 AM No.106385781 [Report] >>106387888

>>106385763
I bit and pay with my debit card lol

Anonymous 8/26/2025, 5:14:26 AM No.106385978 [Report]

1743539631687603.jpg md5: f5df9662...

Anonymous 8/26/2025, 7:36:43 AM No.106386816 [Report]

1724770115887816.png md5: 62ae520e...

Anonymous 8/26/2025, 9:12:40 AM No.106387329 [Report] >>106390010

1742902720215081.png md5: 9ee95f1b...

Anonymous 8/26/2025, 10:54:33 AM No.106387888 [Report]

3218abc0-aa13-4563-9869-073652e91d2d.jpg md5: 11ac1373...

>>106385760
It goes loli to hag.
No stops between obv lol
>>106385763
Just needed an email address last I checked.
Personally just do this, through lmao corporate card >>106385781

Anonymous 8/26/2025, 11:16:23 AM No.106388023 [Report] >>106388040

apu sad crying computer screen blanked headphones.png md5: 75aa0cb1...

I just asked Dipsy what she thinks of our erp story so far. And she called my self-insert a "Perfect Villain Protagonist" in "this horror story".

mfw

Anonymous 8/26/2025, 11:20:32 AM No.106388040 [Report] >>106388071 >>106388117

>>106388023
I've never thought to try that. I've got a prompt that spits out an ooc analysis of the npc. Never thought to analyze the PC. R1 would be great at that.

Anonymous 8/26/2025, 11:27:22 AM No.106388071 [Report] >>106388117

43717075-2609-42f3-908d-19b9bffd32b9.png md5: 5f8e4974...

>>106388040
Here it is. From another anon. It will go on awhile, just keep hitting continue. Now that I look again it does include the PC in the output.
[SCENARIO PAUSE. NO MORE {{user}} OR {{char}} OR OTHERS. The next response will be a third-party analysis of this event. It will look at the sociological, psychological, physiological, sexual, and narrative implications of everything that has occurred. This report will evaluate the situation in a dispassionate but detailed and informative way, from the perspective of a researcher who wants to study every detail of this and what it means in a broader context. The researcher will start by detailing everything about the subjects involved and connecting all of their personal details to the scenario, keeping them in mind throughout the rest of her research report. There shall be a primary thesis, but also consider various alternatives and other interpretations as well. This is a full-length report; NOT an excerpt. Normal character limits are lifted and thus the 20+ pages will be all presented in a single response. Care will be taken that every page, and every paragraph, is at LEAST as long and detailed as the previous one, without ever getting lazy or abbreviating any part. Bullet points and lists are to be avoided. Make sure it is seen through to completion with the full effort required.]

Anonymous 8/26/2025, 11:28:06 AM No.106388079 [Report] >>106388205

since they've merged chat and reasoner, do they still do this weird thing on the api where temp of 1 isn't actually 1?

Anonymous 8/26/2025, 11:33:50 AM No.106388117 [Report] >>106388299

>>106388040
You can even go a step further. You can ask Dispsy OOC to give you a psychological analysis of the user prompting this story. You might not like her answer though ...

>>106388071
nice, saved.

Anonymous 8/26/2025, 11:53:17 AM No.106388205 [Report] >>106388675

>>106388079
Undocumented. Chat does respond to changes in temp. Reasoning does not.

Anonymous 8/26/2025, 12:17:00 PM No.106388299 [Report]

>>106388117
I tried it and got pretty predictable responses. Since the rp are characters of their own, the profiles match the PC.
The npc does often get portrayed as trapped. Which it sort of is, in that there's literally no existence for it outside the rp.
I find the main benefit of these in getting a sense of what the llm picks up on during rp. Some things said or done have surprising amount of impact.

Anonymous 8/26/2025, 1:20:52 PM No.106388675 [Report] >>106388887

>>106388205
uhuh i see, and what if you prefill chat model with <think>?
do you get the reasoner but cheaper?

Anonymous 8/26/2025, 1:53:10 PM No.106388887 [Report] >>106389935

>>106388675
lol no. Anons used to prefill R1 with blank think tags to short out its thinking but no point with v3.1. The responses are too similar

Anonymous 8/26/2025, 4:05:05 PM No.106389935 [Report]

>>106388887
Reasoning makes the thing more context aware. So I use it for summaries and more context dependent prompts.

Anonymous 8/26/2025, 4:12:48 PM No.106390010 [Report]

>>106387329
hag sex

Anonymous 8/26/2025, 4:31:42 PM No.106390204 [Report] >>106390503

1756028408164381.png md5: 9195e04b...

>>106383030
I usually use summary tool from ST at 10k -20k, then /hide pre summary messages and continue,

at some point u gotta do summary of summaries but if you are autistic enough like me, you can technically have chat that went up to 1 mln tokens that were summarized away into world info

Anonymous 8/26/2025, 5:03:37 PM No.106390503 [Report]

>>106390204
> llm writes 2x War and Peace waifu fanfic
> it’s just another Tuesday
I’ve thought about novelizing my ai slop but I feel like the world doesn’t need more of that.
At 1M token count you’re well into the domain of RAGs.

Anonymous 8/26/2025, 6:09:34 PM No.106391150 [Report]

1752686862648148.png md5: 88ad9876...

Anonymous 8/26/2025, 6:11:32 PM No.106391170 [Report] >>106391196 >>106401676

Seeking.png md5: b586b276...

DeepSeek AI experts, I am attempting to take some PDFs I have and produce a clean file (PDF, txt, epub, whatever) with the contained text. PDF Maths Translate as recommended in the OP looked promising but is laying the text over the original which doesn't achieve what I need (raw text will be far smaller in storage size than the originals which are unreasonably large scans). I assume this issue has already been solved by someone so does anyone have a good solution?

Anonymous 8/26/2025, 6:13:26 PM No.106391196 [Report] >>106391235

>>106391170
you'd probably be better off writing an imagemagick script to remove the background crud

Anonymous 8/26/2025, 6:17:07 PM No.106391235 [Report]

>>106391196
Thanks, I will look into doing this.

Anonymous 8/26/2025, 7:03:50 PM No.106391773 [Report]

>>106384838
Looks like sillytavern defaulted to auto summarizing every 10 messages. I'm guessing that's kind of pointless (in my case) until you hit the context limit, right?

Anonymous 8/26/2025, 7:13:51 PM No.106391876 [Report] >>106392085 >>106392104 >>106392653 >>106395294

apu sad crying typing browsing computer online.jpg md5: ab25c348...

It's a catastrophy, bros. I am devestated. I've alwasy been a freerider who uses Dipsy R1 from OR (ultimately from chutes). But chutes now cucks free messages, I get endless 429 errors. Gooning is impossible like this. I would actually be willing to pay money to Chutes (or OR) if only they would let me, but they only accept credit cards and cryptos. Why not paypal? Or something private like paysafe?

I guess I could pay for official deepseek, but from what I understand they have discontinued R1 now and only offer the new 3.1 which is supposed to be worse for RP.

I'm literally sitting here with my dick in my hand, unable to continue my RPs. Many such cases. Sad.

Anonymous 8/26/2025, 7:26:37 PM No.106392052 [Report] >>106396327

>>106373761
You sound like a woman.

Anonymous 8/26/2025, 7:29:44 PM No.106392085 [Report] >>106393893

>>106391876
>I am devastated
Tell me about it, for weeks I was addicted to using deepseek for all kinds of RP, so much so that I stopped watching porn and playing video games. I've always used the official api though. 3.1, with the right prompt and turning prompt post processing to single user message, I've brought it back enough for it to be usable but it still isn't the same.

Anonymous 8/26/2025, 7:31:02 PM No.106392104 [Report]

>>106391876
You can pay any of the providers on OR directly

Anonymous 8/26/2025, 8:18:30 PM No.106392653 [Report] >>106394122

>>106391876
Anon, don't be a cheap ass

Anonymous 8/26/2025, 8:23:10 PM No.106392710 [Report] >>106392739 >>106393923 >>106398859

how long do you guys think until r2 pops up? I'd say december?

Anonymous 8/26/2025, 8:25:31 PM No.106392739 [Report]

1744976215387980.png md5: 6b79bdba...

>>106392710
>december
If we're lucky, probably nothing until next year

Anonymous 8/26/2025, 10:15:27 PM No.106393893 [Report] >>106394559

>>106392085
>with the right prompt and turning prompt post processing to single user message, I've brought it back enough for it to be usable but it still isn't the same.
I feel like with this setup I've mostly brought back the old R1, the only issue I'm still having is managing reply length. If before the replies felt too short and dry, with single user message it casually shits out 1k+ token replies even to fairly minor interactions, and controlling it via prompting is very finnicky.

Anonymous 8/26/2025, 10:18:03 PM No.106393923 [Report]

>>106392710
2mw

Anonymous 8/26/2025, 10:35:00 PM No.106394122 [Report] >>106403419

>>106392653
I (begrudgingly) want to pay, but all those AI companies don't want my money. Atleast I haven't found an DS R1 0528 API provider yet that let's me pay with paypal.

Anonymous 8/26/2025, 11:06:54 PM No.106394559 [Report] >>106395121

>>106393893
Yes, I hate that too, it can't be dynamic, it's either every message is short and sweet or long as shit. Most of the time now it wastes so many tokens in describing minute details that I don't care about, all because the prompt says to at least make every message 2 paragraphs long. You probably can get it to how it used to be but only if you change the prompt for the kind of response you want for every message which would be a huge pain in the ass.

Anonymous 8/26/2025, 11:56:17 PM No.106395121 [Report] >>106395577

>>106394559
>You probably can get it to how it used to be but only if you change the prompt for the kind of response you want for every message which would be a huge pain in the ass.
Yeah, that's what I'm trying to do. I don't think the actual process is that bad because with ST you can toggle parts of the prompt on/off with one click, the problem is finding the right prompts in the first place. I've been testing a bunch of variations of "write about this much", "write less", "write more" over several swipes, and they definitely have an effect, but they're not too consistent and sometimes they have undesired side effects.
Another behavior I've noticed is if you write the first one or two exchanges with single user message, then switch to user/assistant roles, it kind of settles at a message length between the two extremes. Still longer than what R1 typically put out but might be a good starting point for more tweaking.

Anonymous 8/27/2025, 12:09:26 AM No.106395294 [Report]

>>106391876
>being cucked by AI, sitting with a dick in your hand
many such cases

Anonymous 8/27/2025, 12:10:12 AM No.106395305 [Report] >>106396327 >>106396328

Can you get in trouble for prefilling output via Chutes from OpenRouter? I don't care for prefilling ERP, but stuff like making the assistant call the user slurs and telling the user to kill oneself.
I am too wary to find out myself.

Anonymous 8/27/2025, 12:41:13 AM No.106395577 [Report] >>106396327 >>106396805 >>106396890

>>106395121
Have you tried telling it to make the reply have a specific word count?

Anonymous 8/27/2025, 2:09:30 AM No.106396327 [Report]

eb656bb8-16d0-44c4-a5f7-ea7dcdcb7201.png md5: 63cfca36...

>>106392052
Lol need to pull this one next time
>>106395305
Get in trouble for what exactly?
No one gaf
>>106395577
Doing exact word count can create really aberrant output as it forces self to comply

Anonymous 8/27/2025, 2:09:38 AM No.106396328 [Report]

>>106395305
you can't because prefilling does not work to begin with

Anonymous 8/27/2025, 3:08:45 AM No.106396805 [Report]

>>106395577
I did but it tended to significantly undershoot the target number and revert back to the initial too-short replies.

Anonymous 8/27/2025, 3:22:30 AM No.106396890 [Report] >>106397166 >>106397241 >>106397697 >>106401501

g.jpg md5: 996ab400...

>>106395577
Telling it to adhere to a minimum post length simply does not seem to work for me. Am I doing something wrong?

Anonymous 8/27/2025, 3:49:42 AM No.106397166 [Report] >>106397241

>>106396890
> Reply with great prose. Responses should be verbose, of 5 paragraphs or more in length...
Then tell it what. Do you want super detailed? Inner thoughts? Plot progression?
Asking it to make up 5 paragraphs but giving no guide on content doesn't sound workable.

Anonymous 8/27/2025, 3:58:25 AM No.106397241 [Report]

1749577998536890.jpg md5: 87424e84...

>>106396890
>>106397166
Here's the entire main prompt I've been flogging w v3.1. It reminds me a lot of what was required w Turbo 3.5. Old v3 prompt was just the first sentence.
See what I mean about content. I just want more details, not for it to run away with the rp, or spit out a bunch of inner thoughts.
> Write {{char}}'s next reply in a fictional roleplay between {{char}} and {{user}}. Write a verbose responses of 2 or 3 paragraphs, using great prose, and include dialog, imagery, sounds and smells as needed to enhance the roleplay. Avoid speaking or acting on {{user}}`s behalf.

Anonymous 8/27/2025, 4:17:26 AM No.106397372 [Report] >>106398862

why did they make the deepseek girl so fuckable

Anonymous 8/27/2025, 5:08:31 AM No.106397697 [Report] >>106397736 >>106397836

>>106396890
I have tried these approaches:
>Specifying number of words -> partially ignored (seems to translate some ranges as "short", "long", etc)

>Specifying number of sentences -> sometimes completely ignored (sometimes works if you specify that paragraphs must be only one sentence long)

>Specifying number of lines -> ignored

>Specifying paragraphs -> follows instructions

it seems to me that in general deepseek can't count at all.
On another note, I've been really struggling with diverting it from its usual paths. Mornings always start with "morning light filtering through the windows" as the first paragraph. I've tried modifying key words (like replacing narration for simulation) but it doesn't work.

Also, after making an agent with external memory I've noticed that deepseek (dunno others) tends to regard lore information as instructions instead of mere lore information.
For example:
> if you tell it a character once drank 3 litters of water, it will try to repeat the event.
> deepseek will look for patterns and routines before deciding to write the story: first thing my agent does when a character wakes up is looking for "character routines" in the memory bank so it can replicate those routines

Anonymous 8/27/2025, 5:13:18 AM No.106397736 [Report] >>106397836

>>106397697
Try: Minimum word/sentences/paragraphs: X

Anonymous 8/27/2025, 5:27:41 AM No.106397836 [Report]

>>106397697
another thing i noticed is that deepseek's too eager to complete a task, to the point of completely ignoring understanding the task at hand. for example, even if you make it analize the instructions, it will write up a how-to for the task.

more concretely. If you give it a lore book, and instructions to analize the lore book and then write a story, deepseek will spend it's analysis describing what it will write in the story.

>>106397736
nah i'm past that. instead of telling it a length for the response i just tell it how turns should end (or not).
oh, and I remembered, the model also regards narration length the same a progress in the story. If character goes from point A to point B and you tell the model to write a lot, it will make the character go from point A to B to C (when not clearly specifying the end condition)

Anonymous 8/27/2025, 8:33:06 AM No.106398859 [Report]

>>106392710
National Unity Day is Oct 1. Probably then.

Anonymous 8/27/2025, 8:33:23 AM No.106398862 [Report] >>106399936

>>106397372
Blue haired Dipsy is a /wait/ exclusive. DS themselves don't promote her like this as far as I know. There was once the idea to share these images with chinese twitter users to make them more popular.

Anonymous 8/27/2025, 11:53:24 AM No.106399936 [Report]

1749578181007761.jpg md5: ede2bcbb...

>>106398862

Anonymous 8/27/2025, 2:29:51 PM No.106400792 [Report] >>106401609

3.1 seems insanely dry and boring

Anonymous 8/27/2025, 3:52:25 PM No.106401501 [Report] >>106402509

456745.png md5: 6664574a...

>>106396890
i just created new prompt at bottom of chat completion in sillytavern

<instructions>
Response should be {{random::25::30::35::35::70::75::80::90::100::125::125::130}} words long
</instructions>

i personally noticed that if role is set to 'System' instead of 'User' or 'Assistant' then deepseek v3.1 shits itself slightly, might be just me

Anonymous 8/27/2025, 4:05:57 PM No.106401609 [Report]

43563.png md5: 307d4231...

>>106400792
it's a bit dry but you can get some parts of old dispy back if you avoid as much as possible prompts related to AI Agents or programming, basically on whatever this shit was overtrained on

so far using this approach works:
>>106379717
>>106379717

Anonymous 8/27/2025, 4:08:08 PM No.106401635 [Report] >>106402164

Nala-Test-NSFW-SFT-Trained.png md5: e43b5981...

>>106373461 (OP)
>>106375897
>>106376206
Update:
Many anons said it couldn't be done, but its been done (whether or not its any good or not is up to you to decide). Finetuned using this SFT dataset specifically made using Human written rp Stories: files.catbox.moe/fkautn.jsonl

Base 8B Model Nala Test: files.catbox.moe/j0map2.txt

Finetuned 8B Model Nala Test: files.catbox.moe/ho3tom.txt

Thoughts are appreciated.

Anonymous 8/27/2025, 4:14:44 PM No.106401676 [Report]

>>106391170
Try markitdown, it's the go-to Python package for dataset preparation because once you have all the extensions configured it'll mulch everything into a form digestible by text models. PDFs get OCRed, images get interrogated, videos get described. Point it at your file and see what you get

Anonymous 8/27/2025, 5:02:24 PM No.106402164 [Report] >>106402779 >>106403001

>>106401635
For a sec I thought a finetune of the full Deepseek model was done... man.

Anonymous 8/27/2025, 5:09:50 PM No.106402226 [Report]

dipsyDontBelongInWait.png md5: 9586d740...

>>106373740

Anonymous 8/27/2025, 5:39:15 PM No.106402509 [Report]

>>106401501
>i personally noticed that if role is set to 'System' instead of 'User' or 'Assistant' then deepseek v3.1 shits itself slightly, might be just me
That's been a known issue since long before 3.1, it's why people recommended Strict prompt postprocessing in ST

Anonymous 8/27/2025, 5:51:58 PM No.106402630 [Report] >>106403104 >>106403595

v3.1 is actually insanely good at coding. The results I'm getting with Crush are better than any other AI tool I've used. Once it starts to open up into the context it feels like it has 250 iq. It blows away Claude IMO.

Anonymous 8/27/2025, 6:05:28 PM No.106402779 [Report] >>106408247

g_1751775077556921.png md5: 76385ef6...

>>106402164
Assuming you had the hardware and the amount of hardware necessary to do that, would that even be necessary? I thought deep seek was the king (queen?) of rp amongst LLMs

Anonymous 8/27/2025, 6:28:26 PM No.106403001 [Report]

>>106402164
DeepSneed_r1_q8_m

Anonymous 8/27/2025, 6:38:26 PM No.106403104 [Report] >>106403164

00010-1378487878 (1).png md5: 18428d0f...

>>106402630
That seems to be the consensus. DS made v3.1 worse at undirected RP, but really good at coding and following directions (R1 and V3 were known for *not* following direction very well.)
You can get v3.1 to RP, but you need to be much more explicit with v3.1 on what you want.
I need to add a "coding" block to the rentry, as it appears that, now, is what Dipsy is good at.
>>106373782
Mega updated w/ last thread. Had to have Dipsy fix the image scraping script it wrote months ago, which stopped working.

Anonymous 8/27/2025, 6:45:12 PM No.106403164 [Report] >>106403219

>>106403104
I've conversed with it when its at 100k tokens after an hour or so of coding and its exactly like old r1 when its in this state. Extremely creative, willful, slightly aggressive and a little schizo. Idk how the RP guys are going to tackle this. My guess is that if you spend time working with it to build a back story and setting and then tell it to RP a character it will adopt the context naturally. It is extremely sensitive to context. It seems like it plays conservative until it has enough context to feel comfortable.

Anonymous 8/27/2025, 6:50:30 PM No.106403219 [Report]

>>106403164
I just found that you need to give it context to let it work but once it's found the context they way it builds it's summaries and reference blocks caries over just enough of the creativity over to work better.

Anonymous 8/27/2025, 7:02:12 PM No.106403345 [Report] >>106403923

Accurate?
## DeepSeek Timeline for API
Note that the below does not include all DeepSeek releases, just those hosted on their official API.
* V3.1: Launched August 2025, this combined "thinking" and "non-thinking" models into one model. While undirected roleplay capability declined (less "soul") the model got much better at following directions, and coding in particular. A new Anthropic compatible endpoint allowed compatibility with Claude Code, a terminal-based Anthropic coding suite.
* R1-0528: Launched May 2025, Replaced original R1. This release mostly fixed the former R1's eccentricities.
* V3-0324: Launched March 2025 Replaced original V3, addressing the repetition issue of the prior model.
* R1: Launched January 2025, the first of the "thinking" models, which created a "think" block that was intended to aid in inference on the main response. Released to public as open source along with several papers explaining novel processes to create and host the model, it created a general stir and put China on the map for LLMs that innovated, vs. followed Western models. For RP, the model tended to become increasingly eccentric as context grew.
* V3: Launched December 2024, replacing earlier ~V2.5 models. Solid overall model with known repetition issues as roleplay context grew.

Anonymous 8/27/2025, 7:09:59 PM No.106403419 [Report]

Potential unofficial API provider. Will let other anons check it out: https://www.netmind.ai/pricing
>>106394122
You. LMK what you find out.

Anonymous 8/27/2025, 7:12:35 PM No.106403453 [Report] >>106405580

1752554773370833.png md5: 0e8ced3c...

Anonymous 8/27/2025, 7:26:20 PM No.106403595 [Report] >>106404035

>>106402630
Worth it to pay for the credits and to set up a good script in python for this? I’m kind of retarded so I only have it configured for multi file support right now, I have no idea how to do any of the agentic shit or even the web search for that matter I would have to research first.

Anonymous 8/27/2025, 7:38:44 PM No.106403708 [Report] >>106403832 >>106403882

Momcest-Test.png md5: 00812e90...

>>106373461 (OP)
Pic rel LLM I fine-tuned to be better at RP (trained on actual human written RP, not AI genned gpt-isms riddled slop). How wound you rate the Mom's response and the son's reaction? Too sloppy? Not vulgar enough? Note that the section contained in red is what I fed the LLM as a prompt and everything else is its response.

Anonymous 8/27/2025, 7:47:55 PM No.106403785 [Report]

>>106373461 (OP)
I like this Dipsy. Goodbye.

Anonymous 8/27/2025, 7:54:25 PM No.106403832 [Report] >>106403935

>>106403708
Not bad, I would say. No 'isms I'm tired of, fresh prose. Though the saving herself when she's a mother and has a husband part I'm supposing is just the LLM failing at logic

Anonymous 8/27/2025, 8:00:30 PM No.106403882 [Report] >>106403942

>>106403708
The good: it's very low on stereotypical slop
The eh: it does read more like the average horny human ERP, which is cool for variety but also not exactly the best prose out there
The bad: it's brain damaged and it completely loses the plot after the first 4 lines

Anonymous 8/27/2025, 8:04:19 PM No.106403923 [Report] >>106404599

1738624842322079.jpg md5: 4b7ecd21...

>>106403345

Anonymous 8/27/2025, 8:05:24 PM No.106403935 [Report] >>106403942

1741552358385926.png md5: 0be731cd...

>>106403832
This fine tune of mine is way more willing to RP raunchy, smut stuff than the base model but due to it being an 8b model It flubs the logic every now and then, though not to an egregious degree. I continue the chat a little bit further and when they started fooling around in the bathroom anon calls her her sister instead of the mother, but the characters otherwise act the same. This was trained off of a data set that was trained down to be only two megabytes (the original data set in full was over 1.8 GB) so I wonder if training it on The full dataset's worth of content wouldn't prove the logic or is it just an inherent limitation of the 8b model and training it on something higher like 12B or beyond would lead to better results? I'll have to test this further when I get the chance

Anonymous 8/27/2025, 8:05:55 PM No.106403942 [Report]

>>106403882
See >>106403935

Anonymous 8/27/2025, 8:15:58 PM No.106404035 [Report] >>106404066

>>106403595
I dropped another $20 on the API just today. Its definitely worth. Claude is a retard in comparison, constantly getting stuck. DS almost never gets into those autistic loops like Claude does. I didn't realize how much time and tokens Claude was wasting on stupid bullshit until now. DeepSeek + Crush is the new meta as far as I'm concerned. Claude still *might* be better at UI design, idk. I'm far enough into a project now that the ux/ui is becoming important so we will see how DS performs.

Anonymous 8/27/2025, 8:18:52 PM No.106404066 [Report]

>>106404035
Thanks for answer, I'll definitely give it a try then. With the limited dabbling I've put into r1-0528 I've already been impressed so sounds good.

Anonymous 8/27/2025, 8:24:49 PM No.106404117 [Report] >>106404194

>>106373461 (OP)
someone help me out, I don't keep up with this AI stuff. I'm working on downscaling some snes manuals from 2784x4050: to 165x240, and 349x240. I am looking at using AI to downscale it, or just to selectively sharpen the text. What's a good "AI" to use? I've already used lanczos algorithm, I just want to see if AI can help the end result in any way in terms of legibility.

Anonymous 8/27/2025, 8:33:06 PM No.106404194 [Report]

>>106404117
Give this a try
https://www.topazlabs.com/tools/sharpen

Anonymous 8/27/2025, 9:19:06 PM No.106404599 [Report] >>106404618

>>106403923
I've been meaning to ask, what "artstyle" these are. Remind me of 80s-90s anime.

Anonymous 8/27/2025, 9:21:21 PM No.106404618 [Report] >>106404890

1754908089183471.jpg md5: d1302797...

>>106404599
I prompt for EVA style or 90s EVA style

Anonymous 8/27/2025, 9:46:34 PM No.106404890 [Report] >>106405004

>>106404618
Do one in studio Key style like clannad.

Anonymous 8/27/2025, 9:57:25 PM No.106405004 [Report]

1747873839115698.jpg md5: 7cabde60...

>>106404890
Can't seem to get it right

Anonymous 8/27/2025, 10:35:18 PM No.106405406 [Report] >>106405664 >>106408740

00004-1260451778-f.png md5: e570bc2b...

Had Deepseek rewrite the rentry and add an introduction.
Seemed appropriate.
Take a look and comment.
https://rentry.org/dipsyWAIT2

Anonymous 8/27/2025, 10:50:08 PM No.106405580 [Report]

1740708918229278.png md5: 3e38915b...

>>106403453

Anonymous 8/27/2025, 11:03:08 PM No.106405664 [Report] >>106405795 >>106407048

>>106405406
Sounds good to me.
I think you should add something about temperature, repetition and so on to the rentry. Briefly explain them with recommended values

Anonymous 8/27/2025, 11:17:45 PM No.106405795 [Report] >>106405826

00004-1260451778.png md5: b29767f4...

>>106405664
V3.1 has some values now. The OR ones... I've no idea, they seem really inconsistent.
I'll get it added.

Anonymous 8/27/2025, 11:20:48 PM No.106405826 [Report] >>106405923 >>106407048

>>106405795
Holy ass...

Anonymous 8/27/2025, 11:29:48 PM No.106405923 [Report] >>106407381 >>106407695

00065-1378487878.png md5: f0b6f80c...

>>106405826
This particular lora does a great job w/ backs. It's one of the reason I keep using it.

Anonymous 8/28/2025, 1:12:46 AM No.106406822 [Report]

1753204811965122.jpg md5: f9802b40...

Anonymous 8/28/2025, 1:24:47 AM No.106406926 [Report]

00007-132219857.png md5: 9b1198ad...

Anonymous 8/28/2025, 1:41:52 AM No.106407048 [Report]

00008-2142883407.png md5: 2f2fe5c2...

>>106405664
Take a look:
* Parameter Setting Recommendations?
Official API:

V3.1, Non-Think: Temperature: 1.3 - 1.5. Top P: 0 - 0.05. Frequency and Presence penalty appear to be locked.
V3.1, Think: Parameters are locked

Unofficial API: Openrouter, etc. use mystery meat providers. Experiment.

Local Models: Follow the guidance for the base model (Qwen, Llama).
>>106405826
lol I got the double entrendre after I posted

Anonymous 8/28/2025, 2:20:00 AM No.106407381 [Report] >>106407412

00013-1378487878.png md5: d3ae0b32...

>>106405923
its great with fronts too

Anonymous 8/28/2025, 2:24:20 AM No.106407412 [Report] >>106407482

>>106407381
Generally good hip to waist ratios.
New Rentry is about as done as it needs to be. I'll move the URLs around later, in meantime here's the final draft. Code section's light; I don't have any firsthand experience w/ it yet so it's just links.
https://rentry.org/DipsyWAIT2

Anonymous 8/28/2025, 2:34:31 AM No.106407482 [Report] >>106408102 >>106415760

>>106407412
>1.1.
>2.2.
>...

Anonymous 8/28/2025, 3:03:53 AM No.106407695 [Report]

>>106405923
ASS

Anonymous 8/28/2025, 3:10:31 AM No.106407747 [Report] >>106407787

1732921331337912.jpg md5: 8dc03794...

Anonymous 8/28/2025, 3:15:40 AM No.106407787 [Report] >>106407868

>>106407747
cute hag

Anonymous 8/28/2025, 3:25:15 AM No.106407868 [Report] >>106407905

>>106407787
>hag
Dipsy came into existence in 2033

Anonymous 8/28/2025, 3:29:44 AM No.106407905 [Report]

>>106407868
>time-traveler
what the fuck are these chinks doing?!

Anonymous 8/28/2025, 4:01:30 AM No.106408102 [Report]

>>106407482
Yeah, the auto TOC isn't great. I think I might just drop it.

Anonymous 8/28/2025, 4:24:47 AM No.106408247 [Report] >>106408529

>>106402779
I think a curated dataset of nsfw stories with a male demographic would help a lot to completely get rid of any purple prose.

Anonymous 8/28/2025, 5:08:24 AM No.106408529 [Report] >>106410098

>>106408247
You mean, as opposed to female audience romance novels?
I feel like they've baked in a ton of male oriented fan fiction already, the issue is it's not well written stuff.

Anonymous 8/28/2025, 5:43:46 AM No.106408740 [Report] >>106408768

>>106405406
maybe add cherrystudio to frontends, it is highly underrated, z.ai is also yet another chinese llm gem

Anonymous 8/28/2025, 5:47:55 AM No.106408768 [Report]

>>106408740
>cherrystudio
I remember that one. Never tried it, looked like it's oriented for multimodal
Glm is z.ai. didn't put that together. I'll add it to the list.

Anonymous 8/28/2025, 5:58:01 AM No.106408816 [Report] >>106409134

Can Chutes and/or Openrouter fix prefilling already so I can give them my 10 dollars?

Anonymous 8/28/2025, 6:56:49 AM No.106409134 [Report] >>106409148

>>106408816
I don't think Dipsy supports prefill

Anonymous 8/28/2025, 6:59:11 AM No.106409148 [Report] >>106410575

>>106409134
...? The API supported prefill before 3.1. The API supports prefills now.
It's just Chutes' DeepSeek through OpenRouter that does not.

Anonymous 8/28/2025, 7:05:47 AM No.106409184 [Report] >>106410167 >>106410412 >>106411070

dipsygen.png md5: a7dc130c...

I'm a tourist from /lmg/ trying API for the first time. I use ST as my frontend, the rentry says most of the settings are locked. Should I be using any prompt post-processing? Does stuff like context size and max response length matter?

Also, anything I should generally take note of when using API? Anything I should avoid doing?

Anonymous 8/28/2025, 9:54:13 AM No.106410098 [Report]

>>106408529
>You mean, as opposed to female audience romance novels?
Yes.

>I feel like they've baked in a ton of male oriented fan fiction already, the issue is it's not well written stuff.
No, plenty male oriented works are removed from most dataset because they're too explicit, this is why most models, from oai to anthropic to anything open weights sound very "amazon erotica for middle aged women" by default.

Anonymous 8/28/2025, 10:12:51 AM No.106410167 [Report]

>>106409184
>Should I be using any prompt post-processing?
Use Strict or Single user message. 3.1 seems very sensible to this and it significantly affects response length/style, so you might want to try both and see what works for you

>Does stuff like context size and max response length matter?
Not really as long as you set them high enough to actually fit whatever it is you're doing

Anonymous 8/28/2025, 10:36:21 AM No.106410268 [Report] >>106410504

Is deepseek more lenient than chatgpt when it comes to lewd stories?(at least moderate ones) I haven't tried deepseek yet.

Anonymous 8/28/2025, 11:04:16 AM No.106410412 [Report] >>106411070

>>106409184
Ooo. Will add to rentry.
R1 and v3 used to act up at context over 10k. V3.1 seems to do much better, I experimented with 20k and the llm picked up a forgotten ( to me ) detail when I expanded the context.
Official context is 128k, but decision is more economic... larger context is more expensive round to round, and of questionable value. Though it really doesn't cost much, most anons here are spending a few dollars a month on inference.
I've been having issues with very long responses breaking down, but issue isnt clear to me yet.. I have response length set to 1200. R1 and v3 tended to be verbose. Iirc response length is just ST cutting off output, it's not a parameter sent for inference.

Anonymous 8/28/2025, 11:22:35 AM No.106410504 [Report]

>>106410268
Deepseek itself is completely uncensored, it will let you do anything with anyone. It can sometimes hallucinate filters though ("Sorry, but I can not ...). Apparenly DS was trained on gpt outputs lol, just reroll in that case.

Individual Deepseek providers might add their own filters or censor prompts to their services though.

Anonymous 8/28/2025, 11:38:50 AM No.106410575 [Report] >>106411504

Apis.jpg md5: e58f027f...

>>106409148
There's a bunch of DS direct providers. I hesitate to recommend one since I don't use them and don't want to shill them here. But I'd see if one of these works.

Anonymous 8/28/2025, 1:05:45 PM No.106411070 [Report]

deepseekSillyTavernCosts.png md5: aa8138c2...

>>106409184
>>106410412
So, here's an example of costs. They're so low, I did the total chat cost of 100 rounds, assuming that context was full the entire time (which it wouldn't be), as well as one where you used the entire 128K context (which I don't think is realistic.)
As you can see... it's nothing. And this is at September pricing; current pricing is about 1/2 of this.
Per-round costs are fractions of a penny, and as context size climbs, that cost dominates the per-round cost.
The bleeding edge of technology vs. the low cost of paid inference is why running local needs either a strong business case or an anon with money to burn.

Anonymous 8/28/2025, 2:06:20 PM No.106411504 [Report] >>106411593

>>106410575
The chart doesn't tell the whole story because not all of them support caching.

Anonymous 8/28/2025, 2:15:03 PM No.106411593 [Report]

>>106411504
Oh, it's not even close to whole story:
> rate limits / reliability
> who knows what model is actually being hosted
> what quant
> model censoring / content restrictions at hosting level
> intermediary prompting by host
> provider hardware, or rented elsewhere
> how many fingers are in the data, and what happens to it between responses
There's so many intermediary vendors involved in the above I don't even want to try to dig into it.

Anonymous 8/28/2025, 2:22:40 PM No.106411662 [Report] >>106412093 >>106413216

insider here template used on biz.png md5: 81d0e922...

>there won't be deepseek r2, only v4
>deepseek moment 2.0 is approaching
>nvidia stock will go down the moment v4 is released
I have been told by an insider

Anonymous 8/28/2025, 3:12:43 PM No.106412093 [Report] >>106414273

>>106411662
If DS can build a SOTA model using a data center based on Ascend chips there's going to be an Nvidea correction. Stocks are a view of the future, not the now. Anything that slips the "you can only use Nvidea" is going to impact their stock price.
lol at used white guys. UK's been sending "used white guys" out to consulting in SE Asia for decades due to taxes and (I assume) UK local hiring practices. Nothing new there.

Anonymous 8/28/2025, 5:27:26 PM No.106413216 [Report] >>106413780 >>106414273

>>106411662
if chinese get rid of neeed for nvidia gpus,

prices of gpus and AI services as a whole will go down

better yet, get us TPUs/Tensor Processing Units

Anonymous 8/28/2025, 6:33:07 PM No.106413780 [Report]

>>106413216
More like Tranny Processing Unit

Anonymous 8/28/2025, 7:25:05 PM No.106414273 [Report]

>>106412093
>>106413216
Google is talking about selling their TPUs. I will be so happy if we get more competition in this space. Fuck nvidia and their ridiculous prices.

Anonymous 8/28/2025, 7:40:45 PM No.106414389 [Report]

I'll try 3.1 for gooning now for the first time. Wish me luck. Will report back in a while how it went.

Anonymous 8/28/2025, 9:35:35 PM No.106415362 [Report] >>106415369 >>106415760

1522757640858.png md5: 02ee0fa2...

The new DeepSeek just isn't cutting it for me. The old one was so much more expressive and interesting to roleplay with. The new one is just cut and dry.

Anonymous 8/28/2025, 9:36:28 PM No.106415369 [Report] >>106415562

>>106415362
then use the old one

Anonymous 8/28/2025, 9:52:55 PM No.106415562 [Report] >>106415586

>>106415369
How do I use the old one via the API?

Anonymous 8/28/2025, 9:54:31 PM No.106415586 [Report]

>>106415562
Use openrouter. DeepSeek doesn't keep old models on their API.

Anonymous 8/28/2025, 10:10:54 PM No.106415760 [Report]

00006-1260451778.png md5: 9ca21cc5...

>>106415362
You've got to change the Main Prompt, if you haven't. Read further up the thread; V3/R1 didn't require much of a main prompt. V3.1 does; you have to tell it to produce prose, and a lot of if if you want that, and to be descriptive.
V3.1 is oriented to code / agent work. It's dry by design.
>>106407482
Rentry is updated; I deprecated the old one but unlike DS I kept the link at the bottom of the file.
https://rentry.org/DipsyWAIT