Below are posts associated with the “copyright” tag.
digital labor and generative AI: what Stack Overflow CEO Prashanth Chandrasekhar gets wrong
This morning, while getting ready for the day, I spent some time catching up on podcasts, including Nilay Patel’s interview of Stack Overflow CEO Prashanth Chandrasekhar on a recent episode of Decoder (a podcast I’ve spent a lot more time listening to since it went ad free for subscribers). I ditched the Stack Exchange network a year and a half ago over digital labor concerns—I was literally being prevented from deleting my own content from the site, which is bonkers—and I’m honestly not sure why I bookmarked the interview for listening a few days ago. I think it was more than a hate listen, though: For all of my own feelings about generative AI, I make an effort to be open minded, and I was interested in the headline for the interview: “Stack Overflow users don’t trust AI. They’re using it anyway.”
🔗 linkblog: OpenAI’s billion-dollar Disney deal puts Mickey Mouse and Marvel in Sora
Involving Disney, who infamously stiffed Alan Dean Foster on Star Wars royalties, so clearly demonstrates how the underlying issue with generative AI isn’t copyright, it’s labor.
what is the correct monkey paw threshold?
One of the great “be careful what you wish for” stories is The Monkey’s Paw in which a family receives a magic item that grants wishes but discovers to their horror that all the wishes are granted in terrible, horrible ways. I can’t remember when I last read the story (though I’m confident I have—maybe in high school?), but monkey paw has stuck in my brain as the metaphor for this idea that wishes can go terribly, terribly wrong, so you really ought to think them through.
🔗 linkblog: People Are Crashing Out Over Sora 2’s New Guardrails
Look, maybe this is a genuine misstep on OpenAI’s part, but it still feels to me like the company started with the guardrails off so that it could use this kind of user backlash to push the Overton Window in conversations with rightsholders.
Also, remember that we small potatoes rightsholders will never be able to have our voices heard like Disney or Nintendo.
🔗 linkblog: OpenAI wasn’t expecting Sora’s copyright drama
Something feels off here. An AI CEO who claims they genuinely didn’t anticipate copyright and deepfake concerns is either dumb or playing dumb. I can’t help but suspect the latter, which is arguably worse, since it suggests an effort to shift the discourse before complaints come in.
🔗 linkblog: OpenAI’s Sora 2 Copyright Infringement Machine Features Nazi SpongeBobs and Criminal Pikachus
I continue to believe that cracking down on intellectual property is not the right way to resist AI, but Koebler does a great job of describing how maddening it is that big companies are going to get away with worse infringement than individual people taking advantage of fair use.
why I think labor, not copyright, is the foundational problem with AI scrapers
This morning on Bluesky, I saw some posts about a class action lawsuit against Anthropic for their use of pirated, copyrighted materials in training their generative AI models. One of the sources of these copyrighted materials was the LibGen database, which I took a peek at nearly six months ago to confirm what I was already sure to be true: that my scientific writing was also collected as training material by companies like Anthropic or Meta. I don’t love that big tech companies are profiting off of my work in this way, and I’m sympathetic to the authors who are taking legal action against Anthropic. However, as I’ve written repeatedly over the past few years (you can find some of those thoughts—and others—by scrolling through here, I don’t know that copyright is the right way of responding to this kind of abuse.
🔗 linkblog: AI industry horrified to face largest copyright class action ever certified
Again, I’m not sure copyright is the way to go in fighting immoral generative AI companies (that the ALA and EFF are on Anthropic’s side seems important to me), but “we have to be able to do this to be successful” still strikes me as such a hollow, self-serving argument.
Jacques Ellul and success as the only techbro metric
When I was in grad school, a faculty member in my program told me a story about his then-quite-young son, who was having a grand old time climbing on top of the kitchen table and then leaping off of it to the floor below. (Truth be told, my memories of this conversation are fuzzy, and the son might have been engaged in some otherwise dangerous behavior.) The father tried to tell the son to stop doing this, warning: “You could have hurt yourself!” The son’s response? “But I didn’t!” Sure, the action had been potentially dangerous, but the landing had been a success, and the son didn’t see what the big deal was.
🔗 linkblog: OpenAI's viral Studio Ghibli moment highlights AI copyright concerns | TechCrunch
Generative AI products make me mad, I don’t like them, and I’m not going to defend them. That said, if this gets framed as a copyright problem, is there any way to give Studio Ghibli (or Pixar or the Seuss estate) power to cry foul here that doesn’t also shut down fan art, parodies, and the like? I’m skeptical, and that’s why I think “labor” is the more productive—if more legally ambiguous—framing here.
thoughts on academic labor, digital labor, intellectual property, and generative AI
Thanks to this article from The Atlantic that I saw on Bluesky, I’ve been able to confirm something that I’ve long assumed to be the case: that my creative and scholarly work is being used to train generative AI tools. More specifically, I used the searchable database embedded in the article to search for myself and find that at least eight of my articles (plus two corrections) are available in the LibGen pirate library—which means that they were almost certainly used by Meta to train their Llama LLM.
🔗 linkblog: OpenAI declares AI race “over” if training on copyrighted works isn’t fair use'
I believe that scraping the internet to profit off of generative AI is ethically problematic BUT I concede that it should be fair use BUT this is still a soulless and terrible argument.
hoopla and other apps that make digital books worse than physical ones
I have mixed feelings about the digital library app hoopla—which offers access to ebooks, electronic comics, and other media that my library doesn’t necessarily carry in physical format—but it’s so dang useful that I keep using it despite some hesitations (see this post for some recent complaints). Tonight, though, as I tried to wrap up the introduction to the English translation of Jacques Ellul’s Théologie et technique (which I ought to just buy in French-language physical format, since its publishing house offers 5€ shipping to the U.S.), I noticed something that really made me mad.
🔗 linkblog: Tintin and the fascists'
As someone who once owned Tintin au pays des Soviets (but also sold it more out of distaste of the art than the politics) and who is currently rereading a less-but-still-problematic classic Belgian bande dessinée, I really appreciated these reflections.
I hadn’t realized that Tintin was about to enter the public domain, and that’s exciting! I’m more optimistic that Tintin can be reclaimed from its fascist, racist roots than Werd is. I worry that that’s because I haven’t fully confronted those roots, but I hope it’s because there’s an anarchist Tintin parody/remix out there that’s already demonstrated that the character can be repurposed for other means. I regret not buying the book when I had a chance to a month or two ago, but maybe Tintin in the public domain will allow me to find other subversive takes on the character.
proposing legislation on Creative Commons for the 2025 Community of Christ World Conference
Even after many years of attending, being a member of, and now serving in Community of Christ, I’m still alternately surprised by how many things are the same as my Latter-day Saint upbringing and how many things are different. In the latter category, even though I’ve intellectually understood this for a while, it still surprises me that the World Conferences of Community of Christ (renamed from General Conferences in the 1960s) are sites of debate and discussion rather than a series of sermons.
🔗 linkblog: Big publishers think libraries are the enemy'
A good take by Molly White. I remember when I stopped thinking about ebooks in terms of screens (as opposed to paper) and started thinking about them in terms of DRM (as opposed to free use). DRM helps the already powerful at the expense of everything else, and I want to do more to push back against it.
🔗 linkblog: Second Circuit Says Libraries Disincentivize Authors To Write Books By Lending Them For Free'
Deeply appreciate Masnick’s writeup. I don’t know the ins and outs of the law, and that’s given me some pause in being upset about the ruling. To see a lawyer find fault with so much gives me greater confidence in my own frustration.
The most dangerous part, though, doesn’t require a law degree to understand. The logic of the findings poses a threat to all libraries, not just this one.
🔗 linkblog: The Internet Archive just lost its appeal over ebook lending'
In a weird kind of Streisand effect, I’ve only started using the Internet Archive library since this lawsuit began and it’s a fantastic service. I won’t pretend to know the ins and outs of copyright law, but this sucks.
🔗 linkblog: Ex-Google CEO says successful AI startups can steal IP and hire lawyers to ‘clean up the mess’'
What reckless hubris. As I wrote earlier today, I’m in favor of more liberal IP law, but not so that businesses can swallow up content to profit from it.
trying to remember that Disney sucks (even if I like a lot of their IP)
When I was slowly making my way through David Graeber and David Wengrow’s The Dawn of Everything last month, I was having trouble processing all of the ideas in the ambitious, dense book, so I was surprised when one idea sounded familiar: schismogenesis. A few years ago, Cory Doctorow wrote an essay using schismogenesis as a theme. Here’s Doctorow’s explanation of the concept from the original book, and the beginning of his thesis in the post:
📺 tvblog: ❤️❤️❤️❤️🖤 for The Acolyte
This was good! Not perfect: there were some rough edges, it’s hard to take some scenes seriously if you’ve watched The Good Place, and there’s a bad case of the Force working as the plot needs it to. Despite all that, though, I love the franchise leaning into a “actually, the Jedi kind of suck” story, and there were some interesting fight scenes and compelling story elements. Happy to see Star Wars experimenting like this… though, of course, it should be moving into the public domain so that everyone (not just Disney) can do that experimentation.
follow up on not having control over my own research
Back in December, I wrote a frustrated post about an article I’d submitted to a special issue that was now being repackaged into an edited volume, in which my research would appear as a chapter. At the time, I wrote about how frustrated I was at the lack of control I had over my own research output. I might well have consented to having my work reprinted in this new format, but I was frustrated that my consent was neither sought nor necessary for the process. I concluded that post with a quip that:
🔗 linkblog: Journalists “deeply troubled” by OpenAI’s content deals with Vox, The Atlantic'
In a roundabout way, I think this helps demonstrate why scraping data for generative AI isn’t a question of copyright. Even when there is a legal agreement, it can still be exploitative—it’s a question of digital labor.