Reddit locks down its public data in new content policy, says use now requires a contract

Share via:


Reddit on Thursday is rolling out a new policy aimed at balancing its desire to license its content to larger tech companies, like Google, and protecting users’ privacy. The newly announced “Public Content Policy” will now join Reddit’s existing privacy policy and content policy to guide how Reddit’s data is being accessed and used by commercial entities and other partners. Related to this, the company also announced a subreddit dedicated to researchers working with Reddit’s data.

The announcement comes shortly after Reddit’s stock market debut, which sees the company positioning itself to grow revenue not only from the ads that run on its platform and API usage by developers but also from its corpus of data. The company in its IPO prospectus said it had already made $203 million through data licensing agreements and expects that number to increase over time.

While Reddit hadn’t historically blocked access to its data for AI training purposes, it changed its course last year. Reddit CEO Steve Huffman told The New York Times that it didn’t make sense for Reddit to continue to give “all of that value to some of the largest companies in the world for free,” signaling the company’s plan to move into the data licensing space.

With those efforts now well underway, the new Public Content Policy will further lock down access to Reddit’s data without an agreement.

“Unfortunately, we see more and more commercial entities using unauthorized access or misusing authorized access to collect public data in bulk, including Reddit public content,” Reddit writes in its blog. “Worse, these entities perceive they have no limitation on their usage of that data, and they do so with no regard for user rights or privacy, ignoring reasonable legal, safety, and user removal requests. While we will continue our efforts to block known bad actors, we need to do more to restrict access to Reddit public content at scale to trusted actors who have agreed to abide by our policies. But we also need to continue to ensure that users, mods, researchers, and other good-faith, non-commercial actors have access.”

In other words, access to Reddit data for research and other non-commercial efforts will continue, but those entities that wants to use Reddit’s data for other purposes — including for AI training — will have to pay. In a graphic shared on the blog, Reddit makes this clear, saying that businesses interested in using Reddit data to “power, augment or enhance your product for any commercial purposes” requires a contract.

Image Credits: Reddit

Advertisers, meanwhile, are directed to an ads API for managing campaigns and tracking their performance.

Because the company is essentially just a large website, indexable by search engines, this new policy aims to lock down Reddit content from any unauthorized collection while also respecting users’ rights.

For instance, Reddit says that its partners will have to upload users’ decisions to delete their content. So if users don’t want their personal posts to become fodder for future AI engines, they should be able to opt out. Partners are also restricted by the new policy from using Reddit’s content to identify individuals or their personal information, including for ad targeting. Partners also can’t use Reddit content to spam or harass its users or to conduct “background checks, facial recognition, government surveillance, or help law enforcement do any of the above.”

The policy additionally restricts access to adult media and clarifies that Reddit won’t sell its users’ personal information. The company notes also that it will never license non-public content like private messages or non-public account information, like users’ emails or browsing history, among other things.

To help researchers who want to use Reddit data for non-commercial purposes, the company has established a new subreddit, r/reddit4researchers. The company says it’s partnering with OpenMined to also develop a program to guide and grow researchers’ collaboration with Reddit.



Source link

Disclaimer

We strive to uphold the highest ethical standards in all of our reporting and coverage. We StartupNews.fyi want to be transparent with our readers about any potential conflicts of interest that may arise in our work. It’s possible that some of the investors we feature may have connections to other businesses, including competitors or companies we write about. However, we want to assure our readers that this will not have any impact on the integrity or impartiality of our reporting. We are committed to delivering accurate, unbiased news and information to our audience, and we will continue to uphold our ethics and principles in all of our work. Thank you for your trust and support.

Popular

More Like this

Reddit locks down its public data in new content policy, says use now requires a contract


Reddit on Thursday is rolling out a new policy aimed at balancing its desire to license its content to larger tech companies, like Google, and protecting users’ privacy. The newly announced “Public Content Policy” will now join Reddit’s existing privacy policy and content policy to guide how Reddit’s data is being accessed and used by commercial entities and other partners. Related to this, the company also announced a subreddit dedicated to researchers working with Reddit’s data.

The announcement comes shortly after Reddit’s stock market debut, which sees the company positioning itself to grow revenue not only from the ads that run on its platform and API usage by developers but also from its corpus of data. The company in its IPO prospectus said it had already made $203 million through data licensing agreements and expects that number to increase over time.

While Reddit hadn’t historically blocked access to its data for AI training purposes, it changed its course last year. Reddit CEO Steve Huffman told The New York Times that it didn’t make sense for Reddit to continue to give “all of that value to some of the largest companies in the world for free,” signaling the company’s plan to move into the data licensing space.

With those efforts now well underway, the new Public Content Policy will further lock down access to Reddit’s data without an agreement.

“Unfortunately, we see more and more commercial entities using unauthorized access or misusing authorized access to collect public data in bulk, including Reddit public content,” Reddit writes in its blog. “Worse, these entities perceive they have no limitation on their usage of that data, and they do so with no regard for user rights or privacy, ignoring reasonable legal, safety, and user removal requests. While we will continue our efforts to block known bad actors, we need to do more to restrict access to Reddit public content at scale to trusted actors who have agreed to abide by our policies. But we also need to continue to ensure that users, mods, researchers, and other good-faith, non-commercial actors have access.”

In other words, access to Reddit data for research and other non-commercial efforts will continue, but those entities that wants to use Reddit’s data for other purposes — including for AI training — will have to pay. In a graphic shared on the blog, Reddit makes this clear, saying that businesses interested in using Reddit data to “power, augment or enhance your product for any commercial purposes” requires a contract.

Image Credits: Reddit

Advertisers, meanwhile, are directed to an ads API for managing campaigns and tracking their performance.

Because the company is essentially just a large website, indexable by search engines, this new policy aims to lock down Reddit content from any unauthorized collection while also respecting users’ rights.

For instance, Reddit says that its partners will have to upload users’ decisions to delete their content. So if users don’t want their personal posts to become fodder for future AI engines, they should be able to opt out. Partners are also restricted by the new policy from using Reddit’s content to identify individuals or their personal information, including for ad targeting. Partners also can’t use Reddit content to spam or harass its users or to conduct “background checks, facial recognition, government surveillance, or help law enforcement do any of the above.”

The policy additionally restricts access to adult media and clarifies that Reddit won’t sell its users’ personal information. The company notes also that it will never license non-public content like private messages or non-public account information, like users’ emails or browsing history, among other things.

To help researchers who want to use Reddit data for non-commercial purposes, the company has established a new subreddit, r/reddit4researchers. The company says it’s partnering with OpenMined to also develop a program to guide and grow researchers’ collaboration with Reddit.



Source link

Disclaimer

We strive to uphold the highest ethical standards in all of our reporting and coverage. We StartupNews.fyi want to be transparent with our readers about any potential conflicts of interest that may arise in our work. It’s possible that some of the investors we feature may have connections to other businesses, including competitors or companies we write about. However, we want to assure our readers that this will not have any impact on the integrity or impartiality of our reporting. We are committed to delivering accurate, unbiased news and information to our audience, and we will continue to uphold our ethics and principles in all of our work. Thank you for your trust and support.

Website Upgradation is going on for any glitch kindly connect at office@startupnews.fyi

More like this

M4 MacBook Pro teardown reveals nearly identical internals to...

While the new M4 MacBook Pros got a...

Apple @ Work: Understanding Apple’s Private Wi-Fi Address feature

Apple @ Work is exclusively brought to you...

Apple Card will soon stop offering 3% cash back...

Earlier this week, Apple announced some additional 3%...

Popular

Upcoming Events

Startup Information that matters. Get in your inbox Daily!