While I dislike the tos change and don't use zoom u understand why there doing it. How else can they train a closed caption system like everyone else has? They need data for it. Transcripts are becoming a killer feature for me for meetings. I can understand why my dad couldn't type now, he had a secretary to transcribe everything. It's super efficient. I just pull jr devs or product into a meeting now turn on transcribe and we just talk through a problem. No one has to take notes we just talk and diagram. I started turning it on in war rooms as well. Every 30 minutes I stop the transcript and start a new one. A few minutes later I can share out the previous minutes to managers so they can get a detailed progress update. Better than typing really detailed things into slack, and better than an audio recording.
Edit: some people pointed out that whisper would do a good job with transcription but there's other things like tweaking the model which is essentially training it and there is things like building their own summarization systems that may be bespoke by customer. At my work we use some AI that answers HR and other types of questions that are kind of trained on our company specific questions and it actually does a great job but that does mean that we have to allow our data to be used for AI training. We're also using this system to do first tier tech support and some of our developer channels for very common questions and it works great because it finds those common questions and gets an answer before a human's even able to pay attention. Both of those approaches could be enabled by these terms of service changes
> How else can they train a closed caption system like everyone else has? They need data for it.
This is where zero knowledge federated learning comes in. Unfortunately, this is very much a tomorrow technology (it needs the infrastructure to support it). Why invest in privacy-preserving methods for training machine learning models tomorrow when you can steal users private information today (or even better, bully them into doing so by being the defacto VC that everyone needs to use because of network effects).
Edit: some people pointed out that whisper would do a good job with transcription but there's other things like tweaking the model which is essentially training it and there is things like building their own summarization systems that may be bespoke by customer. At my work we use some AI that answers HR and other types of questions that are kind of trained on our company specific questions and it actually does a great job but that does mean that we have to allow our data to be used for AI training. We're also using this system to do first tier tech support and some of our developer channels for very common questions and it works great because it finds those common questions and gets an answer before a human's even able to pay attention. Both of those approaches could be enabled by these terms of service changes