Firestore Database#

Firestore is Google’s serverless NoSQL document database.

Why use it for scraping?#

  • Flexible schema (scraped data formats change often).
  • Real-time updates (build dashboards that update instantly when the scraper runs).
  • Generous free tier.

Example#

from google.cloud import firestore

db = firestore.Client()
doc_ref = db.collection("job_postings").document("job_123")
doc_ref.set({
    "title": "Data Scientist",
    "company": "Google",
    "scraped_at": firestore.SERVER_TIMESTAMP
})