> The most surprising thing to me in this is that the non-profit will still exist.
I'm surprised people are surprised.
>> That entity will scrape the internet and train the models and claim that "it's just research" to be able to claim that all is fair-use.
a lot of people and entities do this though... openAI is in the spotlight, but scraping everything and selling it is the business model for a lot of companies...
Scraping the web, creating maps and pointing people to the source is one thing; scraping the web, creating content from that scraping without attributing any of the source material, and arguing that the outcome is completely novel and original is another.
In my eyes, all genAI companies/tools are the same. I dislike all equally, and I use none of them.
> creating content from that scraping without attributing any of the source material, and arguing that the outcome is completely novel and original is another.
That's the business model of lots of companies. Take, collect and collate data, put it in a new format more useful for your field/customers, resell.
I'm surprised people are surprised.
>> That entity will scrape the internet and train the models and claim that "it's just research" to be able to claim that all is fair-use.
a lot of people and entities do this though... openAI is in the spotlight, but scraping everything and selling it is the business model for a lot of companies...