off-topic

2025-09-14T03:13:35.119019Z

I know LLM are all the rage, but I need to talk about just how good the voice recognition of ChatGPT is. It's absolutely amazing, and I just can't deal with other voice recognition anymore. Why isn't this more talked about. The model they use for voice to text is simply the best, it's the first voice recognition I use that just understands me as good as a human would.

shiyi.gu 2025-09-15T14:25:51.733389Z

I always work with claude, use chatgpt for dictation and no patience, use gemini for discussing buddhism. Highly recommend Buddhism part šŸ˜†, Gemini knows it too well.

raspasov 2025-09-17T04:33:51.214819Z

I gotta give it to them, ChatGPT voice recognition is quite amazing, including their chatbot. For overall LLM response quality, I don’t think they are very special. Many LLMs, at the premium levels, are quite good, or perhaps better. Most recently, I’ve been defaulting to Gemini 2.5 Pro for most technical questions. I’m guessing there’s also something about the combination of Gemini and the (obviously) superior Google search index that makes it stand out.

liebs 2025-09-14T14:23:52.770119Z

maybe the LLMs are all talking about it amongst themselves and we just can't hear them

🤣 1
shiyi.gu 2025-09-14T18:00:41.380999Z

Gemini doesn’t know I always first open chatgpt to do transcribe. Chatgpt doesn’t know, I always paste its transcription to Gemini.

šŸ˜‚ 5
silian 2025-09-15T05:07:09.103159Z

@shiyi.gu Is this true? chatgpt for voice dictation, Gemini as your general purpose?

Mno 2025-09-26T14:49:46.997619Z

I used it to try and practice my French and it was pretty good, especially interactive mode on the app, however I think there was a limit for interactive mode and I stopped using it to preserve it haha

Mno 2025-09-26T14:50:34.814889Z

It's the only one that seems to understand me when I switch from French to English to Spanish mid prompt

2025-09-14T03:19:50.467549Z

And on that note, I wish there was a voice recognition LLM hybrid specialized for coding dictation. Something where I could be like: "defn add let foo 2 + 2 in the body use map to sum it actually rename it to bar" And it'll edit in almost realtime what I say, like a human would.

Nathan Smith 2025-09-14T15:32:05.762519Z

Check out https://whalequench.club/

šŸ‘€ 2
silian 2025-09-15T05:12:39.828089Z

I have high hopes & rooting for Neuralink. Conveying word-thoughts with fingers or thumbs is so clumsy. For example, to get "conveying" in the previous sentence I had to attempt it 4 times. (I'm lying horizontally, stretching my neck, using my index finger to glide type.)

john 2025-09-18T00:47:10.038499Z

@didibus out here, vibe coding reality in #off-topic

2025-09-18T00:55:13.809539Z

@feedmyinbox02_clojuri Have you tried the ChatGPT voice dictation? Bet it would get conveying right the first time

āœļø 1
2025-09-18T00:57:16.762279Z

I think what they need to add is a mix mode where they fuse the speech to text and agentic LLM text editing. Because now if you just say: "I wanted to ask no delete that I am in need of help" It'll just type that. We need to merge it with the agent intelligence to understand what I want the actual things to edit and type to be versus what is higher level directions

2025-09-18T00:58:11.116899Z

If someone wants to steal this idea and make a startup with VC funding go for it by the way haha

john 2025-09-18T01:12:35.980199Z

Eye tracking. Head tracking. Multi party conversation tracking

2025-09-18T01:58:52.331239Z

Zoom 2.0?

john 2025-09-18T03:00:48.299629Z

I guess it'd have to integrate into your os

john 2025-09-18T03:01:17.979659Z

Control the mouse, keyboard, see the screen, camera, etc

silian 2025-09-18T04:24:30.262679Z

@didibus interesting idea. I still dislike speech; hate to have to vocalize but it's an option. As you say, it knows no difference between instructions and a string. It's strings all the way down.

silian 2025-09-18T04:27:31.188589Z

If neural patterns producing strings versus ā€œinstructions for your string-producerā€ are distinct, Neuralink would be nice.

2025-09-18T15:08:39.763079Z

Well call me old fashioned, but I'd rather speech than brain surgery 🤣

2025-09-18T15:09:58.160939Z

But I do think that speech sucks because it sucks. Like tall to my wife or coworkers isn't annoying or difficult. If I could talk to a computer the same it be great.