Data leak

Socialarks Elasticsearch Exposure β€” 214 Million Social Media Profiles Scraped

πŸ“… 2021-01-01 🏒 Socialarks Elasticsearch database
Primary Source β†—

Incident Details

In January 2021, security researchers at vpnMentor discovered a publicly accessible Elasticsearch database belonging to Socialarks β€” a Chinese social media management company that offers social media marketing and customer relationship management services. The database contained approximately 408GB of data comprising approximately 214 million social media user profiles scraped from Facebook (11.5M profiles), Instagram (8.4M profiles), and LinkedIn (66.8M profiles). The scraped LinkedIn data included real names, email addresses, phone numbers, locations, job titles, work history, education history, and social media connections. Facebook data included private contact details not normally publicly available including personal phone numbers. The data appeared to have been scraped in violation of the platforms’ terms of service. LinkedIn and Facebook had both been previous targets of large-scale scraping. vpnMentor researchers found that Socialarks CEO Nolist Chen’s personal information was included in the dataset. Socialarks secured the database after notification. The exposure included profiles of high-profile executives including Facebook CEO Mark Zuckerberg, LinkedIn CEO Ryan Roslansky, and various politicians. The breach raised questions about data brokerage practices and the legality of large-scale social media scraping, particularly given GDPR and similar data protection obligations.

Technical Details

Initial Attack Vector
Socialarks β€” a Chinese social media management company β€” left an Elasticsearch database exposed publicly without authentication; the database contained scraped and aggregated social media profile data collected by Socialarks from LinkedIn, Facebook, Instagram, and other platforms
Vendor / Product
Socialarks Elasticsearch database
Software Package
Elasticsearch

Timeline

  1. 2021-01-01 Breach occurred
  2. 2021-01-11 Publicly disclosed