Reddit is upping its AI game again, this time with the implementation
  of an LLM-powered harassment filter for the benefit of its army of
  volunteer moderators.

  The existence of the new Harassment Filter was discovered during an
  [1]APK teardown by Android Authority, which spotted its existence in
  version 2024.10.0 of the official Reddit app for Android. That version
  of the official Android Reddit app has appeared in various unofficial
  sources online, but the Google Play Store still shows 2024.08.0 as the
  most recent generally available version.

  The app contains lines of code referencing a large language model
  trained to assist moderators in preventing publication of harassing
  posts, and is backed up by a Reddit [2]help page updated last week that
  reflects the existence of the new tool.

  "The Harassment Filter is an optional community safety setting that
  lets moderators automatically filter posts and comments that are likely
  to be considered harassing," the help page reads. "The filter is
  powered by a Large Language Model (LLM) that's trained on moderator
  actions and content removed by Reddit's internal tools and enforcement
  teams."

  The filter can be enabled in a Reddit community's mod tools, but
  individual moderators will need to have permissions to change subreddit
  settings to enable it.

  The harassment filter can be set to low ("filters the least content but
  with the most accurate results") and high ("filters the most content
  but may be less accurate"), and also includes an explicit allow list to
  force the AI to ignore certain keywords, up to 15 of which can be
  added.

  Once enabled, the filter creates a new tag in the moderation queue
  called "potential harassment," which moderators can review for
  accuracy.

  Reddit's help page says the feature is now available on desktop and the
  official Reddit apps, though it's not clear when the feature was added.
  We asked Reddit for more info, but haven't heard back.

More like (A)IPO

  Reddit, which is busily planning its IPO, has been doing everything it
  can to drum up investor support ahead of its [3]imminent stock market
  debut. As with all things to do with raising cash in 2024, that means
  sticking AI anywhere it can.

  The popular social media platform [4]signed a deal with a previously
  unknown AI business last month to let its models slurp up user
  conversations to train products, earning it $60 million for its
  efforts. While that figure hasn't been explicitly linked to Google,
  both Reddit and the Chocolate Factory announced an expanded partnership
  days after news of the AI deal broke.
    * [5]Thar be safe harbor: Reddit defeats third attempt to unmask
      digital pirates
    * [6]Reddit signs AI training deal with Google – and why OpenAI's
      Altman could be the winner
    * [7]Reddit: If you want to slurp our API to train that LLM, you
      better pay for it, pal
    * [8]Reddit confirms BlackCat gang pinched some data

  [9]According to Reddit, the deal gave Google access to its Data API,
  and through it "an efficient and structured way to access the vast
  corpus of existing content on Reddit … including supporting new ways to
  display Reddit content and providing more efficient ways to train
  models."

  Google, meanwhile, said that Reddit will gain the ability to integrate
  Google's [10]Vertex AI "to enhance search and other capabilities on the
  Reddit platform" as part of the deal. For those unfamiliar, Vertex is
  Google's tool for integrating Gemini capabilities into enterprise
  environments.

  How Reddit users – a notoriously [11]unruly bunch – will react to news
  that AI is being foisted on moderators isn't clear. Redditors haven't
  necessarily [12]reacted well to the company's planned IPO, with some
  referring to the plan as a "pump-and-dump" scheme to enrich Reddit
  leadership at the expense of the site.

  Reddit has experienced the effect of a blowback before, with
  [13]thousands of the site's subreddits going dark last year in protest
  of a plan to begin charging developers to use the site's APIs,
  effectively killing many third-party Reddit clients.

  Reddit knows its users are a risk – it even disclosed its concerns in
  an SEC [14]filing last month regarding its IPO plans. Ostensibly a tool
  to assist moderators, the addition of AI could also be seen as a subtle
  threat – protest again and we'll simply replace you volunteers with
  Gemini, at least until it gets trained on enough Reddit data to start
  siding with the protesters. ®

References

  1. https://www.androidauthority.com/reddit-ai-model-fight-harassment-apk-teardown-3422993/
  2. https://support.reddithelp.com/hc/en-us/articles/23856209638932-Harassment-Filter
  3. https://www.bloomberg.com/news/articles/2024-03-04/reddit-s-meme-stock-forum-users-threaten-to-bet-against-its-ipo
  4. https://www.theregister.com/2024/02/20/reddit_content_ai_deal/
  5. https://www.theregister.com/2024/02/13/reddit_pirates_win/
  6. https://www.theregister.com/2024/02/22/reddit_google_license_ipo_altman/
  7. https://www.theregister.com/2023/04/18/reddit_charging_ai_api/
  8. https://www.theregister.com/2023/06/20/reddit_confirms_blackcat_extortion_attempt/
  9. https://www.redditinc.com/blog/reddit-and-google-expand-partnership
 10. https://cloud.google.com/vertex-ai
 11. https://www.bloomberg.com/news/articles/2024-03-06/reddit-s-ipo-success-hinges-on-infamously-unruly-user-base
 12. https://www.reddit.com/r/technology/comments/1ayve0f/a_lot_of_redditors_hate_the_reddit_ipo_reddit/
 13. https://www.theregister.com/2023/06/12/reddit_api_outage/
 14. https://www.sec.gov/Archives/edgar/data/1713445/000162828024006294/reddits-1q423.htm#i1b9a579e78a34dfa99f7f26daeec195b_2017