Risk management kills any attempt at bold choices, decisions are steered at the modelable and the low risk. There space is thus shrunk. When there were fewer media behemoths there were more variations on the risk models and the pattern was less descernable.
This is the real reason the ultra rich are buying media companies. They expect the existing copyright laws to prevail in court and to either make significant revenue licensing IP for training or to take large stakes in AI companies in return for the IP.
If this happens then free and open content (the Wikipedia model, more or less) becomes a hugely impactful "commoditize the complement" play for the big AI and tech firms. Every good piece of open content is something that AI firms don't have to license from a proprietary supplier. And if models trained on entirely open content can write an acceptable "first draft" of something new, that's huge acceleration.
Seems like a bad bet to me. It looks like authors are going to lose this case setting the precedent that you not only don’t need to license training data, obtaining it illegally (for free) is totally okay.
Didn't Anthropic's case already set the precedent that training itself is fine? It's not like copyrighted novels are a large portion of human-generated text data. It's just the stuff that's easier to get because it's preserved in bulk.
Video transcription has more or less been solved. Imagine how much data Google has in YouTube transcripts. And the longer these AI chat bots operate the more data they manage to collect for training as well (I think Google making it so you can easily upvote or downvote a response by the bot is a good idea).
I only use the free tiers of any particular app. It forces you to really think about you want the tool to do as opposed to treating it as the 'easy' button.
There's nothing at all wrong with Wikipedia but it needs sources to cite since it doesn't allow original research and the World Factbook is an important one.
Looking at the underlying study, this isn’t evidence of bias. It’s evidence of correlation between Republicans and negative sentiment.
If you look at the sentiment for public figures given, the bottom one is, for example, Brett Kavanaugh. Well, he was credibly accused of sexual assault during his confirmation hearings, which was a huge deal at the time. Someone with that on their record will probably be read as negative, but, I mean, not the editors’ fault!
> His policies resulted in the deaths of tens of millions of people in China during his reign, mainly due to starvation, but also through persecution, prison labour in laogai, and mass executions
What's "kid gloves" about that?
Let's contrast with the the farthest thing from a leftwing dictator we can find, the quintessential rightwing one, Adolf Hitler. Here's the intro to his Wikipedia page:
> Adolf Hitler[a] (20 April 1889 – 30 April 1945) was an Austrian-born German politician who was the dictator of Germany during the Nazi era, which lasted from 1933 until his suicide in 1945. He rose to power as the leader of the Nazi Party,[b] becoming the chancellor of Germany in 1933 and then taking the title of Führer und Reichskanzler in 1934.[c] Germany's invasion of Poland on 1 September 1939 under his leadership marked the outbreak of the Second World War. Throughout the ensuing conflict, Hitler was closely involved in the direction of German military operations as well as the perpetration of the Holocaust, the genocide of about six million Jews and millions of other victims.
When it comes to politics and studies... We all should know by now to research those sources too, right?
"The Manhattan Institute for Policy Research is an American 501(c)(3) nonprofit[5] conservative think tank."
It is a report generated by this conservative organization (that presumably gets donations from many other conservatives). Is there a chance that the report itself is suspect?
"Bias" here is just sentiment analysis. The report (from a conservative think-tank) is not about factual errors. Plus, the effect they find shows only for US politics, where there is really not much of a "left".
> Findings show that Wikipedia entries are more likely to attach negative sentiment to terms representative of right-leaning political orientation than to their left-leaning counterparts
Is that a bias or just reality?
Right leaning politicians in the US include people paying underage girls for sex, people screaming about "Jewish Space lasers", people obviously stealing money in plain sight with crypto pumps and dumps, people running away from responsibility, people getting caught engaging in sexual acts in public, and on and on and on. Their left-wing equivalents are... extremely mild by comparison. What, some run of the mill corruption and sexual comments that resulted in resignations?
If go past "right wing is associated with more negative things", and look into what those negative things are, you'd realise it's just reality. Just because there are two parties and two categories of political leanings doesn't mean they are somehow equal.
It is bias if your "factual reality" over-exaggerates the "facts" for the "bad guys" and under-exaggerates / completely neglects to report on the "good guys"
But to put it with John Steward, what if reality itself has a left leaning bias?
What if left leaning people have empirically broader empathy [0] which could imply that right leaning people have in tendency worse personalities. I guess you would attest yet another biased article here.
There are also genuinely good guys and bad guys. Reality, itself, has a bias. To think that ideology doesn't correlate at all with how moral you might act is, frankly, stupid. Not all positions are created equal.
Duckdb is wonderful. I have several multi-TB pipelines that I moved over from spark and dask. It's so much easier to think in terms of knew machine that gets it's resources used efficiently instead of distributed/cloud processes. It even let me take some big data things back to on-prem from aws.
reply