Tag view

#storage

Cross-subject tag search for related interview cards.

Clear

Results update as you type. Press / to jump straight into search.

Tagged with storage

4 cards

DevOps Easy Theory

AWS S3 basics

S3 is durable object storage used for files, backups, logs, static assets, and data lake style workloads.

  • Object storage not block storage
  • Highly durable
  • Great for static files and archives

AWS S3 basics

ETL / Data Engineering Easy Theory

Parquet vs CSV vs JSON

CSV is simple but weakly typed, JSON is flexible but verbose, and Parquet is compressed columnar storage optimized for analytics.

  • CSV is easy to inspect
  • JSON handles nested structure
  • Parquet is best for warehouse scans

Parquet vs CSV vs JSON

ETL / Data Engineering Easy Theory

What is a data lake?

A data lake stores large volumes of raw or semi-structured data cheaply for later processing.

  • Raw and flexible storage
  • Schema can be applied later
  • Needs governance to avoid becoming messy

What is a data lake?