Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

First, most of the English speaking world is not native.

"As of 2022, there were about 400 million native speakers of English. Including people who speak English as a second language, estimates of the total number of Anglophones vary from 1.5 billion to 2 billion."

Second, all popular models I tested did well with that query, including Gemini on Android (aka "ok Google"), except Apple's.

https://en.m.wikipedia.org/wiki/English-speaking_world



I am not sure why you go on the subject of English speaking world etc. Anyway, the models you tested with that query, which I am not sure why we think is a good benchmark, are local models running on a wireless device or they use datacenter and only convey the text back and forth?


I'm fairly sure Siri still sends user voice samples to a data center. At least for a while, it used to use multipath TCP to decrease latency over multiple available network connections if I'm not misremembering.

Some modern Apple devices support "local Siri", but it's a limited subset of both voice recognition performance and capabilities.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: