Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> The most surprising thing to me in this is that the non-profit will still exist.

I'm surprised people are surprised.

>> That entity will scrape the internet and train the models and claim that "it's just research" to be able to claim that all is fair-use.

a lot of people and entities do this though... openAI is in the spotlight, but scraping everything and selling it is the business model for a lot of companies...



Scraping the web, creating maps and pointing people to the source is one thing; scraping the web, creating content from that scraping without attributing any of the source material, and arguing that the outcome is completely novel and original is another.

In my eyes, all genAI companies/tools are the same. I dislike all equally, and I use none of them.


> creating content from that scraping without attributing any of the source material, and arguing that the outcome is completely novel and original is another.

That's the business model of lots of companies. Take, collect and collate data, put it in a new format more useful for your field/customers, resell.


Not with copyrighted content, though.


Absolutely with copyrighted content, it just depends on what you're doing with it.




Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: