Tools in Data Science

    • Sep 2025: Tools in Data Science
    • System Requirements
    • Marks Dashboard
    • 1. Development Tools
      • Coding environments
      • Editor: VS Code
      • Python tools: uv
      • JavaScript tools: npx
      • Terminal: Bash
      • Spreadsheet: Excel, Google Sheets
      • Database: SQLite
      • Supporting tools
      • Browser: DevTools
      • AI Terminal Tools: llm
      • Version Control: Git, GitHub
      • HTTP Requests: Curl, Postman
      • Supporting formats
      • Unicode
      • CSS Selectors
      • JSON
      • Markdown
    • 2. Deployment Tools
      • Hosting services
      • Static hosting: GitHub Pages
      • Notebooks: Google Colab
      • Serverless hosting: Vercel
      • HuggingFace Spaces
      • Local LLMs: Ollama
      • Deployment techniques
      • Images: Compression
      • REST APIs
      • CORS
      • Tunneling: ngrok
      • Containers: Docker, Podman
      • CI/CD: GitHub Actions
      • DevContainers: GitHub Codespaces
      • Web Framework: FastAPI
      • Authentication: Google Auth
    • 3. AI Coding
      • AI Coding Tools
      • Vibe coding
      • AI coding online
      • AI coding in IDEs
      • AI coding in the CLI
      • AI Coding Techniques
      • Code context engineering
      • AI coding strategies
      • Tests in AI coding
      • AI coding Tools
    • 4. Large Language Models
      • Basic Usage
      • Prompt engineering
      • TDS TA Instructions
      • TDS GPT Reviewer
      • Text Processing
      • LLM Sentiment Analysis
      • LLM Text Extraction
      • Base 64 Encoding
      • Embeddings
      • Embeddings
      • Multimodal Embeddings
      • Topic modeling
      • RAG
      • Vector databases
      • RAG with the CLI
      • Hybrid RAG with TypeSense
      • Multimodal
      • Vision Models
      • LLM Image Generation
      • LLM Speech
      • LLM Applications
      • Function Calling
      • LLM Agents
      • Pydantic AI
      • LLM Evals
    • Project 1
    • 5. Data Sourcing
      • Spreadsheet Scraping
      • Scraping with Excel
      • Scraping with Google Sheets
      • CLI and API Tools
      • Crawling with the CLI
      • BBC Weather API with Python
      • Nominatim API with Python
      • Wikipedia Data with Python
      • Web Automation
      • Scraping IMDb with JavaScript
      • Web Automation with Playwright
      • Document Processing
      • Scraping PDFs with Tabula
      • Convert PDFs to Markdown
      • Convert HTML to Markdown
      • AI Scraping
      • LLM Website Scraping
      • LLM Video Screen-Scraping
      • Advanced Techniques
      • Scheduled Scraping with GitHub Actions
      • Scraping emarketer.com
      • Scraping: Live Sessions
      • Making Open Data Useful
    • 6. Data Preparation
      • Excel Preparation
      • Data Cleansing in Excel
      • Data Transformation in Excel
      • Splitting Text in Excel
      • Data Aggregation in Excel
      • Command Line Tools
      • Data Preparation in the Shell
      • Data Preparation in the Editor
      • Database Tools
      • Data Preparation in DuckDB
      • Data Transformation with dbt
      • Specialized Tools
      • Cleaning Data with OpenRefine
      • Parsing JSON
      • Media Processing
      • Transforming Images
      • Extracting Audio and Transcripts
    • 7. Data Analysis
      • Excel Analysis
      • Correlation with Excel
      • Regression with Excel
      • Forecasting with Excel
      • Outlier Detection with Excel
      • Code Analysis
      • Data Analysis with Python
      • Data Analysis with SQL
      • Database Analysis
      • Data Analysis with Datasette
      • Data Analysis with DuckDB
      • AI Analysis
      • Vibe Analysis
      • Specialized Analysis
      • Geospatial Analysis with Excel
      • Geospatial Analysis with Python
      • Geospatial Analysis with QGIS
      • Network Analysis in Python
    • Project 2
    • 8. Data Visualization
      • Presentation Tools
      • Data Storytelling
      • HTML Slides: RevealJS
      • Markdown Presentations: Marp
      • Interactive Notebooks: Marimo
      • Excel Visualization
      • Visualizing Forecasts with Excel
      • Static Visualization
      • RAWgraphs
      • Data Visualization with Seaborn
      • Animated Visualization
      • Visualizing Animated Data with PowerPoint
      • Visualizing Animated Data with Flourish
      • Network Visualization
      • Visualizing Network Data with Kumu
      • Actor Network Visualization
      • AI Visualization
      • Data Visualization with ChatGPT
      • Data Storytelling with LLMs
    • Live sessions

    Spreadsheets

    • Spreadsheet: Excel, Google Sheets

    Spreadsheet: Excel, Google Sheets#

    You’ll use spreadsheets for data cleaning and exploration. The most popular spreadsheet program is Microsoft Excel followed by Google Sheets.

    You may be already familiar with these. If not, make sure to learn the basics of both.

    Go through the Microsoft Excel video training and make sure you cover:

    • Intro to Excel
    • Rows & columns
    • Cells
    • Formatting
    • Formulas & Functions
    • Tables
    • PivotTables

    Watch this video for an introduction to Google Sheets (49 min):

    Google Sheets Tutorial for Beginners (49 min)

    Backward Bash Sqlite Forward
    • Spreadsheet: Excel, Google Sheets