Nothing lasts forever. Good times will come and go and so would bad times.
I think as humans we are used to small time frames which are proportional to our own lifetime.
But the world: say climate, population, geology etc. moves at a much different cycle, if at all you can call it a cycle since none of the iterations are exactly the same.
So the lesson is this: change is coming. Change will always be coming. Embrace it.
If you like something, you have to struggle to preserve it as much as you can, for as long as you can, but you can never make it permanent.
> we shouldn't stop insisting that things change for the better
I never said we shouldn't.
What I meant by "Change will always be coming. Embrace it.", is to accept it as a reality, be ready for it and prepare for it. That means, be ready to resist negative change and accept positive change.
Even after successfully resisting negative change, the end state may still be different than before. This is what we have to accept and be ready for, mentally.
My read is that levkk is conflating concurrency with "real production apps" and this whole thread is starting to surface that "real production apps" and "high concurrency" are not measuring the same thing at all.
Sqlite is used in real production apps more than any other database.
Sqlite is also weak at any sort of write concurrency.
I am really sick of AI generated videos. I don't have anything against AI videos per se but the fact that it's so easy to generate videos that people are churning out really really bad quality videos out there.
There's another phenomenon I have been noticing more and more lately: the frequent scene cuts. One scene lasts a mere 3 seconds before being cut to a different scene. Whether it is entirely different scene, or a different angle of the same thing or zoomed in/out.
I am not sure if this phenomenon is due to AI but I sense some correlation there.
> There's another phenomenon I have been noticing more and more lately: the frequent scene cuts. One scene lasts a mere 3 seconds before being cut to a different scene. Whether it is entirely different scene, or a different angle of the same thing or zoomed in/out.
I find this awful, but it's not a remotely new thing.
Vim swap files (.swp) are periodically saved out to disk. In the event of a crash, reopening a file will present you with a screen asking if you want to recover the file. I doubt it’s character by character, but I’m sure vim has a setting to speed it up
While fraud does require intention to deceive, I get the sentiment that hallucinated citations shouldn't be dismissed as simply carelessness. It should be something stronger than that: gross negligence or something MUCH stronger! There should absolutely be repercussions for this.
But let's not call it fraud. That word is reserved for something specific.
EDIT: someone else said "reckless disregard" equals intent or something to that effect. So I looked it up.
It appears so that is the case. "Reckless Disregard Equals Intent" in legal language.
But I am not sure if this particular clause should apply here. Perhaps it depends on what kind of research is being published? For e.g., if it is related to medical science and has a real consequence on people's health, we can then apply this?
I do believe this policy is appropriate to deal with the reckless disregard of posting hallucinated references.
It's a conscious decision to not take the time to check your AI output, and instead waste a whole bunch of other people's time letting them essentially do that for you in duplicate.
Feels like that should disqualify you from participation for a bit. Intent or no intent.
> Feels like that should disqualify you from participation for a bit. Intent or no intent.
Exactly! For a bit!
Yet this is not for a bit! This is a lifetime disqualification, and that's been my entire grip the whole time! Is nobody reading this?
"The penalty is a 1-year ban from arXiv followed by the requirement that subsequent arXiv submissions must first be accepted at a reputable peer-reviewed venue."
> Then how long are they disqualifying them from submitting prior to acceptance, if not lifetime?
Well, "lifetime ban" means "you are not allowed back in". Their ban specifically allows you back in (after a specified period) subject to fulfilling a single constraint.
It's conditional acceptance back in, which is not the same as a lifetime ban which is unconditional.
The intent to deceive is there. The deception is lying when you submit it that it is a scholarly piece of work in which amongst many other things you know the citations are accurate. This false representation was knowingly and intentionally made at the time of submission.
The citation being incorrect is merely the proof of deception not the (relevant) deception itself.
Fraud is the correct description provided (and this is practically a guarantee) you intended to benefit from the submission of the paper (e.g. by bolstering your resume).
If I violate the letter of the ToS when clicking submit you can correctly argue that I have technically committed fraud! Yet that is almost never what anyone actually means when having discussions like this one.
Fraud in a scientific context generally refers to fabricated research results. At least personally I agree with GP that hallucinated citations are generally something akin to laziness thus not fraud but rather some sort of professional negligence.
Fraud in the scientific world has generally taken the form of fabricated results, but I don't agree that the word has transitioned away from the common and legal meaning of deception in order to get a benefit.
Even if it had though, I'd be perfectly comfortable calling this fraud in this discussion based on the common meaning of the word. Just because we're talking about a scientific context does not mean we need to use the scientific-jargon versions of words - we're not in a scientific context ourselves.
---
And I'd disagree that this is just about the "letter of the ToS". While that is perhaps a necessary component in order to prove the deception, this is really about the cultural expectations of the community that merely happened to have been encoded in the ToS. The fraud would still occur without the ToS, it would merely be next to impossible to show you didn't simply misunderstand the cultural norms and what your actions would lead others to believe.
I disagree with your implicit assertion regarding the common meaning of the term in this context. I believe that the term fraud as commonly used when discussing things in a scientific context has always (for at least my entire life) been taken to refer to knowingly and intentionally falsified research results (also falsified appointments, falsified affiliations, falsified authorships, etc).
> deception in order to get a benefit.
The point being that reckless or negligent conduct is not commonly taken to constitute deception. There's a reason we have different terms for these things.
Sure, you can say "well he exhibited reckless disregard for his professional duties when he opted not to bother reading the citation section that the LLM shat out, and reckless disregard is sufficient to meet the legal bar for fraud, and also the ToS specifically says that you certify that you validated all references manually so bam! two counts of fraud legally speaking" and you wouldn't be wrong but the distinction between "legally fraud" and "fraud as is commonly meant when talking about scientific papers" is essential to effective conversation in this particular instance.
> Just because we're talking about a scientific context does not mean we need to use the scientific-jargon versions of words
The context is essential because (obviously) it affects how people interpret the meaning of your words. A fraudulent submission to a scientific journal has a specific and well understood meaning in common usage.
If you still disagree with me imagine polling a bunch of tenured career researchers about what they would think if they read the statement "X caught submitting a fraudulent paper to journal Y". I can just about guarantee you that none of them are imagining hallucinated citations.
We're not "discussing things in a scientific context" here. We're in the context of a startup/programmer news aggregator discussing scientific news. We are not "a bunch of tenured career researchers" discussing amongst ourselves so the jargon appropriate for that context is not the appropriate jargon - rather we need to use the jargon that the startup/programmer news aggregator crowd would understand.
That said even in a scientific context I still disagree and your example at the end is a fine starting point. By comparison imagine one of the profs said told the others that their house was burgled. The others would probably be thinking that things like TVs or computers or money was stolen, and not that the thief simply stole all their spoons. That doesn't make having all your spoons stolen not burglary. Likewise the profs expect that the results or authorships are where the fraud occurred because those are the best places to extract value with fraud, not by avoiding the simple act of writing the paper with correct citations. That doesn't mean fraudulently using an LLM to hallucinate a paper from your (we'll suppose for sake of argument) actual results is any less fraudulent though, it's just an unexpected form of fraud.
Edit: I want to be clear that this is not my argument: "well he exhibited reckless disregard for his professional duties when he opted not to bother reading the citation section". I see other people making that argument, and I'm not sure if they're right or wrong that that's another reason why it is fraud, but I'm certain that we don't even need to reach that question.
My argument is that it is fraud to represent the paper as a scholarly work when you don't know that it is correct. It is not that you are taking a risk it might be wrong, it is that you are actively representing that you know it is correct and if you do not know that you are committing fraud even if it happens to be so. This is a case of intentional deception, the deception being the representation that this is scholarly work, not reckless disregard for the truth as to the accuracy of the citations.
There are actually a surprising (IMO) number of career researchers on this site. Regardless, disregarding the context specific meaning will at absolute best result in a disjointed conversation where people are talking past each other. Worse, in this instance people are debating how arxiv (and other venues) ought to handle these sorts of things at which point you are well and truly into the territory where you need to get the field specific terminology right.
I concede that I was sloppy when I referred to what the researchers would be imagining. I should have phrased it as asking them if they thought that transgression X constituted fraud.
Regardless, hopefully you can see the idea that I was attempting to communicate? The burglary example isn't equivalent because while the spoons are unexpected the end result is still an event that most people would agree constituted burglary and resulted in noticeable harm to the victim.
I'm struggling to adjust your example on the fly but perhaps if it were the contents of the yard waste bin that had been pilfered? That's still technically burglary but I think most people would view it quite differently and might question the wisdom of prosecuting it.
I think the key difference here comes down to motivations as well as impact. Falsifying results (for example) is an active attempt to counterfeit the core value proposition of the endeavor and the end result of that is proportional - personal benefit directly as a result of the falsification and significant damages to anyone sufficiently bamboozled by the fiction long enough to base any decisions on it. Whereas no one using an LLM to generate just the bibliography is doing that to get ahead (at least not on its own) and any damages are limited to the reader wasting a few minutes trying to figure out the extent of the issue and who to contact about it.
> I should have phrased it as asking them if they thought that transgression X constituted fraud.
I'm not sure you're right that they wouldn't think it's fraud at first glance, but I also think it's the wrong test. Words have meanings that result in an actual truth to the matter that is sometimes non-obvious.
A theorem isn't false just because it intuitively seems false until you spend decades trying to prove it and finally find the proof. Requiring people be cleanshaven isn't obviously discriminatory, until someone points out the existence and prevalence of pseudofolliculitis barbae and perhaps that an action being discriminatory doesn't actually require intent.
So I'd propose the correct test (and again, this would only be for a discussion in an academic context, which this is not) looks something like giving them access to the common dictionary definitions, and legal definitions, of fraud, and then asking them to define it. Then taking their definition and applying it to the facts at hand.
I don't think your average academic thinks academia has a special meaning of the word. I think they would simply acknowledge the dictionary definition as the correct one, and as a result I think this is plainly fraud even as the word is used in an academic context.
---
I don't think anyone defines fraud as only deception relating to the core value proposition of the instrument of the deception.
I think (though might well be misunderstanding) that reckless disregard is taken to be an intentional choice but that it does not imply that the outcome itself was intentional. The difference between intentionally doing something that you know for a fact has a high risk of failure but you can't necessarily predict the outcome versus intentionally seeking a particular legally disallowed outcome.
But what LPisGood was saying is that reckless disregard (as opposed to explicit intent) is sufficient to meet the legal bar for fraud.
Nothing lasts forever. Good times will come and go and so would bad times.
I think as humans we are used to small time frames which are proportional to our own lifetime.
But the world: say climate, population, geology etc. moves at a much different cycle, if at all you can call it a cycle since none of the iterations are exactly the same.
So the lesson is this: change is coming. Change will always be coming. Embrace it.
If you like something, you have to struggle to preserve it as much as you can, for as long as you can, but you can never make it permanent.
reply