Search Results

Found 2 results for "7dd85704347aa6994c2b6942b147037d" across all boards searching md5.

Anonymous /g/106117698#106118629
8/2/2025, 8:13:27 PM
>>106117698
Is there anything like a local Ani from Twitter yet? The repos I've reviewed only tie models together via the UI, that is, generate TTS on page A, generator an image on page B.

If not, I will consider developing my own Robo-wife via a web app which would make API calls to a home server (>=6GB GPU required)

>Robo-wife Phone Camera -> Image -> Server -> Image tagger ---(0.2 second delay)---> Image tags -> Server
>Robo-wife Phone Microphone -> Audio -> Server -> Speech to Text ---(0.2 second delay)---> Server
>Server considers image tags and audio, and creates a response (1 to 5 seconds) -> text to speech ---(0.4 second delay)---> Robo-wife -> Phone Speaker and Screen

Could also add in OCR to provide reading capabilities.

I think it'd be possible to get near real time responses with streaming response tokens from the server.
Anonymous /g/106118295#106118295
8/2/2025, 7:34:01 PM
Do any of you know of an open source local virtual assistant/companion similar to Ani from Twitter?

I am considering developing a Robo-wife via a web app which would make API calls to a home server (>=6GB GPU required)

>Robo-wife Phone Camera -> Image -> Server -> Image tagger ---(0.2 second delay)---> Image tags -> Server
>Robo-wife Phone Microphone -> Audio -> Server -> Speech to Text ---(0.2 second delay)---> Server
>Server compiles image tags and audio into a response (1 to 5 seconds) -> text to speech ---(0.4 second delay)---> Robo-wife -> Phone Speaker and Screen

Could also add in OCR to provide reading capabilities.

I think it'd be possible to get near real time responses with streaming response tokens from the server.