More

requilence · 2025-09-25T08:52:13 1758790333

Great project! I’ve had a similar experience with Rewind and the related privacy concerns. A quick thought: if I recall correctly, Rewind performs OCR locally, so it only needs to send textual data. Since you’re focusing on macOS, you could rely on VNRecognizeTextRequest and skip the extra OCR complexity. It might also help to detect and mask sensitive information with lightweight models (e.g., BERT), especially when leveraging cloud-based AI.

jerryliu12 · 2025-09-25T22:03:04 1758837784

Woah didn't know about VNRecognizeTextRequest, that's super cool thanks for flagging!

requilence · 2025-07-27T06:31:12 1753597872

Self‑hosting swaps one risk for another. A cleaner option may be local‑first apps that speak an open sync protocol: your data lives on your phone/laptop/homeserver; an optional, end‑to‑end‑encrypted relay just moves opaque blobs for backup and multi‑device access. Why this matters: the No. 1 failure I see isn’t servers getting hacked. It’s users losing the only copy of their files. * ~70 million phones are lost annually; most are never recovered. https://snohomishcountywa.gov/Archive.asp?ADID=543 * Consumer hard‑drive AFR hovers around 1‑2 %—≈7 % fail within five years. https://blocksandfiles.com/2024/05/02/disk-failure-rates-in-... * 29 % of people have already suffered data loss from deletion, ransomware, or disk death. https://www.worldbackupday.com/en With odds like these, an off‑device, unreadable backup is basic hygiene, not a luxury. Disclaimer: I work on Anytype, which follows this model: local ownership, open‑source sync protocol, source‑available clients.

requilence · 2025-07-17T10:55:24 1752749724

Thank you, I made an update to the original post with your explanation, and because you stated that the output was a pure hallucination, I also attached one of them.

requilence · 2025-07-16T00:48:02 1752626882

In one of the responses, it provided the financial analysis of a not well-known company with a non-Latin name located in a small country. I found this company; it is real and numbers in the response are real. When I asked my ChatGPT to provide a financial report for this company without using web tools, it responded: `Unfortunately, I don’t have specific financial statements for “xxx” for 2021 and 2022 in my training data, and since you’ve asked not to use web search, I can’t pull them live.`.

Xx_crazy420_xX · 2025-07-16T06:01:40 1752645700

Did you try to ask it to provide data of the company, by explicitly invoking hallucination in the model?

Right now there is no real proof, untill you confirm that the data it provided cannot be hallucinated (which could be not feisable).

Also, acknowledging the response fron OpenAI staff dismissing it, would you mind sharing PoC?

requilence · 2025-07-17T10:58:26 1752749906

I've updated the original post with technical details and an output example.

krainboltgreene · 2025-07-16T04:32:23 1752640343

I’m struggling to understand why you are so adamant that this is proof.

requilence · 2025-07-16T00:30:48 1752625848

they have security.txt file on their domain and mentioned it in some other place

requilence · 2025-07-16T00:24:58 1752625498

Right, thank you for the suggestion. Just added a paragraph to the original blog post.

tabletcorry · 2025-07-16T01:06:09 1752627969

Your added paragraph appears to suggest the opposite, that this was an LLM response. Was the "leaked data" a response from an LLM directly?

JyB · 2025-07-16T23:37:38 1752709058

Yes apparently which makes this report pretty flimsy.

tptacek · 2025-07-17T02:57:00 1752721020

Upthread, OpenAI's security team confirms it's a false report; it's a variant of the empty-prompt hallucination.

JyB · 2025-07-17T05:41:21 1752730881

Incredible that so many people still don't understand what an LLM is. Especially ones that you would expect to grasp it.

requilence · 2025-07-15T23:29:54 1752622194

Reported a flaw to OpenAI that lets users peek at others' chat responses. Got an auto-reply on May 29th, radio silence since. Issue remains unpatched :( Avoided their bug bounty due to permanent NDAs preventing disclosure even after fixes. Following standard 45-day disclosure window—users should avoid sharing sensitive data until this is resolved.

jonrouach · 2025-07-16T00:07:27 1752624447

you're sure it's not their "feature" that calling the api with empty string returns random hallucinations?

https://jarbon.medium.com/gpt-prompt-bug-94322a96c574

requilence · 2025-07-16T00:13:15 1752624795

No, definitely not the empty string hallucination bug. These are clearly real user conversations. They start like proper replies to requests, sometimes reference the original question, and appear in different languages.

jonrouach · 2025-07-16T00:46:15 1752626775

i had the exact same behavior back in 2023, it seemed like clearly leakage of user conversations - but it was just a bug with api calls in the software i was using.

https://snipboard.io/FXOkdK.jpg

postalcoder · 2025-07-16T02:53:39 1752634419

There was an issue with conversation leakage, though. It involved some bug with Redis.

I felt like it was a huge deal at the time but it’s surprisingly hard to quickly google it.

Sebguer · 2025-07-16T02:59:06 1752634746

It was the classic "oh no we did caching wrong" bug that many startups bump into. It didn't expose actual conversations though, only their titles: https://openai.com/index/march-20-chatgpt-outage/

postalcoder · 2025-07-16T04:25:04 1752639904

ah there it is. thanks for jogging my memory. funny to think of how niche chatgpt was considered then to now.

JyB · 2025-07-16T00:24:30 1752625470

I don’t see anything here that would prevent a LLM from generating these. Right?

requilence · 2025-07-16T00:50:42 1752627042

In one of the responses, it provided the financial analysis of a not well-known company with a non-Latin name located in a small country. I found this company; it is real and numbers in the response are real. When I asked my ChatGPT to provide a financial report for this company without using web tools, it responded: `Unfortunately, I don’t have specific financial statements for “xxx” for 2021 and 2022 in my training data, and since you’ve asked not to use web search, I can’t pull them live.`.

BoiledCabbage · 2025-07-16T14:18:10 1752675490

> numbers in the response are real.

OpenAI very well may have a bug, but I'm not clear on this part. How do you know the numbers are real?

I understand you know the name is the company is real, but how do you know the numbers are real?

It's way may than anyone should need to do, but the only way I can see someone knowing this is contacting the owners is the company.

Sebguer · 2025-07-16T01:08:36 1752628116

Do you understand what a hallucination is?

jojobas · 2025-07-16T02:21:21 1752632481

Coming up with accurate financial data that you can't get it to report outright doesn't seem like one.

Sebguer · 2025-07-16T02:50:44 1752634244

Models do not possess awareness of their training data. Also you are taking at face value that it is "accurate".

refulgentis · 2025-07-16T02:25:12 1752632712

I don't understand the wording

Accurate financial data?

How do we know?

What does using not-web-search not having the data have to do with the claim that private chats with the data are being leaked?

01HNNWZ0MV43FF · 2025-07-16T03:47:37 1752637657

> I found this company; it is real and numbers in the response are real.

???

refulgentis · 2025-07-16T04:37:48 1752640668

Which of my questions does that answer?

queenkjuul · 2025-07-16T08:32:01 1752654721

That the financial data is accurate?

refulgentis · 2025-07-16T13:42:17 1752673337

It's an ourobos - he can't verify it's real! If he can, its online and available by search.

JyB · 2025-07-16T23:36:35 1752708995

Therefore what are the odds that this is just the LLM doing its thing versus "a vulnerability". Seem like a pretty obvious bet.

addandsubtract · 2025-07-16T10:19:00 1752661140

New Touring Test unlocked! Differentiate between real and fake hallucinations.

DANmode · 2025-07-16T20:13:54 1752696834

So THAT'S what the "GT" means on all of these GPU model names!

999900000999 · 2025-07-16T00:16:31 1752624991

Users should always avoid sharing sensitive data.

A lot of AI products straight up have plan text logs available for everyone at the company to view.

pyman · 2025-07-16T00:29:28 1752625768

It's not just about sensitive data like passwords, contracts, or IP. It's also about the personal conversations people have with ChatGPT. Some are depressed, some are dealing with bullying, others are trying to figure out how to come out to their parents. For them, this isn't just sensitive, it's life-changing if it gets leaked. It's like Meta leaking their WhatsApp messages.

I really hope they fix this bug and start taking security more seriously. Trust is everything.

milkshakes · 2025-07-16T01:55:59 1752630959

maybe you should stop trusting random people on the internet making extraordinary claims without proof then?

baby_souffle · 2025-07-16T02:05:59 1752631559

Isn't "assume vulnerable" The only prudent thing to do here?

refulgentis · 2025-07-16T02:26:53 1752632813

No? Yes? Mu?

After some hemming and hawing, my most cromulent thought is, having good security posture isn't synonymous with accepting every claim you get from the firehose

milkshakes · 2025-07-16T02:32:05 1752633125

everything is vulnerable. the question is, has this researcher demonstrated that they have discovered and successfully exploited such a vulnerability. what exactly in this post makes you believe that this is the case?

999900000999 · 2025-07-16T03:13:36 1752635616

https://arstechnica.com/tech-policy/2025/07/nyt-to-start-sea...

ameliaquining · 2025-07-16T03:50:57 1752637857

This is going to be subject to the legal discovery process with the usual safeguards to prevent leaks; in particular, the judge will directly supervise the decision of who needs access to these logs, and if someone discloses information derived from them for an improper purpose, there's a very good chance they'll go to jail for contempt of court, which is much more stringent than you can usually expect for data privacy. You can still quite reasonably be against it, but you cannot reasonably call it "plain text logs available for everyone at the company to view".

ameliaquining · 2025-07-16T00:22:12 1752625332

Which ones? Do you just mean tiny startups and side projects and the like or is this a problem that major model providers have?

poniko · 2025-07-15T23:52:06 1752623526

The NDA part feels really murky.

tptacek · 2025-07-16T00:10:18 1752624618

It's pretty standard for bounty programs. If you don't like it, which is reasonable, do what this researcher did and just post independently.

asadotzler · 2025-07-16T00:47:16 1752626836

That's an exaggeration. Most industry leaders do not require NDAs, only coordinated disclosure.

Mozilla's program, which has been around longer than most, doesn't. Google and Microsoft don't. Meta and Apple don't.

This is water carrying, intentional or not, for a terrible practice that should be shamed, so that it doesn't become standard.

tptacek · 2025-07-16T00:48:18 1752626898

My understanding is that all Bugcrowd bounties do by default.

You can shame it all you want, but you can also just publish your bugs directly. Nobody has to use the Bugcrowd platform. You don't even have to wait 45 days; I don't buy these "CERT/CC" rules.

asadotzler · 2025-07-18T22:42:50 1752878570

You said it was pretty standard for bug bounty programs, and I disagreed pointing to several of the largest and longest lived bug bounty programs, none of which do that, and your response is pointing out that one particular platform does it?

Even among 3rd party platforms, of which there are several bigs, the NDAs are not a platform requirement, just an option for participating firms.

NDAs are not the norm. Don't mislead people who would otherwise get into this game with non-issues they need not worry over.

tptacek · 2025-07-19T12:15:43 1752927343

OpenAI's security team commented on the thread themselves that they believe they simply accepted the Bugcrowd defaults. I think you're trying to find a controversy that just isn't here.

pyman · 2025-07-16T00:37:32 1752626252

The bug bounty world is a funny one. I remember one complaining that their bug was dismissed and fixed after they signed an NDA, no payout, nothing. Another one got $100 instead of $5,000 because the company downgraded the severity from high to low. So they ended up with little or no money, and no recognition either. Not sure if these were edge cases, but it does make you wonder how fair the process really is.

tptacek · 2025-07-16T00:45:17 1752626717

If you're dealing with large companies, a good rule of thumb is that the bounty program is incentivized to pay you out. Their internal metrics improve the more they pay; the point is to turn up interesting bugs, and the figure of merit for that is "how much did we have to spend". At a large company, a bounty that isn't paying anything out is a failure.

All bets are off with small random startups that do bug bounties because they think they're supposed to (most companies should not run bounties). But that's not OpenAI. Dave Aitel works at OpenAI. They're not trying to stiff you.

Simultaneous discovery (either with other researchers or, even more often, with internal assessments) is super common. What's more, you're not going to get any corroboration or context for them (sets up a crazy bad incentive with bounty seekers, who litigate bounty results endlessly). When you get a weird and unfair-seeming response to a bounty from a big tech company, for the sake of your own sanity (and because you'll probably be right), just assume someone internal found the bug before you did, and you reported it in the (sometimes long) window during which they were fixing it.

pyman · 2025-07-16T01:07:31 1752628051

Interesting insights, thanks for sharing

com2kid · 2025-07-16T03:20:47 1752636047

I see other users conversations on my Gemini dashboard, not sure who to even complain to.

Software quality is... Minimal now days.

fcpguru · 2025-07-15T23:36:43 1752622603

well done, sounds very reasonable and following the rules.

requilence · 2025-07-15T23:49:41 1752623381

Appreciate it. Just trying to do the right thing by both OpenAI and users here.

maxlin · 2025-07-16T00:12:52 1752624772

Permanent NDA's? Oof. It's like their plan is to just try to force the lid down till they reach ASI or something lol

tptacek · 2025-07-16T00:36:34 1752626194

Again: NDAs are bog standard bounty terms.

requilence · on July 20, 2023

We do releases in the Github Actions CI. So you can inspect the CI logs and published artefacts(desktop/android). Then you can compare the binaries checksums. I would appreciate ideas on how we can make it more transparent

requilence · on Oct 28, 2020

Hello, I’m Roman and I’m a technical co-founder at Anytype. We are building a new operating environment that breaks down barriers between apps and gives back privacy and data ownership to users.

Anytype runs locally and exchanges data directly in a peer-to-peer way without exposing it to intermediaries even when users work across devices and with each other. Because of this, Anytype is free without storage and upload limits. We plan to open-source it with the public release.

This is our early demo and I’d like to get feedback on what we are building, so we can incorporate it into Anytype. Also, if you have any questions, I’d love to answer them.

randomchars · on Oct 28, 2020

Looks neat, but a few thoughts...

The video is from January, what changed since then?

I still only see a Get early access option linking to typeform, and no actual download/signup.

> We plan to open-source it with the public release

If it'll be open source, why not open source it now?

Also, according to: https://news.ycombinator.com/showhn.html

> Show HN is for something you've made that other people can play with. HN users can try it out, give you feedback, and ask questions in the thread.

sharipova · on Oct 28, 2020

I'm glad you liked the video.

Yes, it's from January and it was based on our prototype.

Now we re-built it from scratch and invited first 100+ users who are currently using Anytype and giving us a lot of feedback.

We are inviting alpha testers regularly from our community channel on telegram.

As a lot of things change (we re-wrote Anytype from scratch just recently), we want to open source with open beta.

Yes, I think it was a mistake to have Show in the submission

Oshyan · on Oct 28, 2020

It is indeed in a closed test process right now, with onboarding new testers regularly. I think the selection is random from the pool of people who have filled out their interest forms on Telegram.

ebalit · on Oct 30, 2020

This looks really nice! I'll definitely register to the waitlist.

Can you tell us more about your business model?

requilence · on Jan 8, 2020

1. In case of public pages, as far as any of the nodes has your page saved it will be available.

2. In case the data is shared with someone(e.g. teammate or family) you always be able to restore it with the seed phrase. In case of private-only data, our plan is to encourage users to have multiple devices(mobile+desktop) to have a data backup

3. We plan to implemet per-device private key(like the keybase does) and use it to decrypt files' private keys. In case the device's private key was stolen you can remove this device from the list and it will not be able to receive new changes