Software¶
Pages: Course Description · Class Schedule · Homework · Software
During the course, we will mainly be using Jupyter Notebook. Jupyter Notebooks can either be run locally (e.g., using Anaconda) or in the cloud (e.g., using Google Colab). We will be distributing the lecture materials as Google Colab notebooks, but you are welcome to download them and run them locally.
You can choose either way to run these notebooks depending on how much setup and installation you want to do on your local machine. We will be using Colab for live lectures to minimize the disruption due to local installation issues.
Please talk to the PI during office hours if you face any challenges with the software.
Python Version¶
We recommend Python 3.12 for this course. Any version 3.9 or newer will work.
Running Notebooks in the Cloud (Colab)¶
Google Colab lets you run Jupyter notebooks in the cloud without installing Python locally. It is ideal for this course's live coding and homework. This guide covers launching notebooks, using GPUs, managing files, installing packages, and ensuring reproducibility.
Quick Start¶
- Log in to your Google account.
- (Optional) Claim the free pro account for education to access more compute resources here
- Go through the official Colab tutorial here.
- Open the shared Colab link from the schedule or homework pages.
- Click Open in Playground to make an editable copy.
- Execute a cell with Ctrl+Enter(Cmd+Enteron macOS).
- Add a new cell with the + Code button or by pressing Bwhile a cell is selected. Colab saves your changes to Google Drive and requires no local installation.
You can find more details below.
Launching Colab¶
- Open a shared link from the syllabus or homework page. It will open in Colab.
- If the notebook is view-only, click the dropdown next to the notebook title and choose Open in Playground mode or use File → Save a copy in Drive to make your own editable copy.
- Run a cell with Ctrl+Enter(Cmd+Enteron macOS). Run and advance withShift+Enter.
Open from GitHub¶
- Use File → Open notebook → GitHub and paste a repo or file URL.
- Alternatively, append a GitHub URL to Colab: https://colab.research.google.com/github/<user>/<repo>/blob/<branch>/<path>.ipynb.
Runtimes, GPU/TPU, and Sessions¶
- Check current runtime type via Runtime → Change runtime type.
- Hardware accelerator: None, GPU, or TPU.
- Recommended for this course: GPU for deep learning content.
- Reset or restart via Runtime → Restart runtime.
- Colab sessions are ephemeral:
- Idle timeout: typically ~90 minutes without execution.
- Max session length: often 12 hours; GPU sessions may be shorter and quota-dependent.
- Local files in /contentvanish after the session ends; use Drive to persist.
Installing Packages¶
Use pip inside a cell. Installs persist only for the session.
- If you have a requirements.txt(e.g., from homework), upload it or fetch it, then:
- Prefer version pins for reproducibility (e.g., numpy==1.26.4).
Files and Data¶
Temporary workspace¶
- The working directory is /content. Anything saved here is temporary.
Upload and download¶
- Upload small files via the left sidebar → Files → Upload, or programmatically:
- Download a file you generated:
Mount Google Drive (persistent storage)¶
- Mount your Drive at /content/drive:
- Your files appear under /content/drive/MyDrive/. Example save/load:
import pandas as pd
path = '/content/drive/MyDrive/cme193/data.csv'
df = pd.read_csv(path)
df.to_csv('/content/drive/MyDrive/cme193/output.csv', index=False)
Download from the web or Drive (by link)¶
- Using wgetorcurlin a cell works well for public URLs:
- For Google Drive file IDs (publicly shared), use gdown:
Notebooks, Shortcuts, and UI Tips¶
- Insert cell below: B. Insert cell above:A.
- Delete cell: DthenD.
- Interrupt execution: Runtime → Interrupt executionor stop button.
- Clear outputs: Edit → Clear all outputs.
- Variable inspector: use %whosin a cell to list variables.
Reproducibility Best Practices¶
- Add a setup cell at the top of each notebook that installs and verifies versions:
import sys, platform
print('Python', sys.version)
print('Platform', platform.platform())
import numpy as np, pandas as pd, sklearn
print('numpy', np.__version__)
print('pandas', pd.__version__)
print('scikit-learn', sklearn.__version__)
- Pin versions in pipinstalls to avoid surprises.
- Keep data in Drive and reference it via absolute paths.
- Rerun the setup cell after switching runtime types (e.g., enabling GPU).
Using GPUs and PyTorch/TensorFlow¶
- After enabling GPU in runtime settings, verify availability:
import torch
print('CUDA available:', torch.cuda.is_available())
print('Device count:', torch.cuda.device_count())
- Choose device in your code:
- For TensorFlow 2, verify GPU:
Connect to Local Runtime (optional)¶
You can run Colab’s front-end against your own machine’s Jupyter kernel.
- Install Jupyter and the Colab Jupyter extension locally:
- Start a local server:
- In Colab: Connect → Connect to local runtime and enter http://localhost:8888/.
Common Issues and Fixes¶
- Package missing after restart: Reinstall in the first cell or install from requirements.txt.
- Module version mismatch: Pin versions and restart runtime after install.
- Out-of-memory errors: Use a smaller batch size, reduce dataset size, or switch to a GPU runtime.
- FileNotFoundError: Confirm path under /contentvs/content/drive/MyDriveafter mounting.
- Permission errors on Drive: Ensure the file is in your Drive or shared with your account and that you have mounted Drive.
- If you hit GPU quota limits, switch to CPU for non-deep-learning exercises or try again later.
Running Notebooks Locally¶
There are several ways to install Python locally; we recommend using conda. There are two common conda distributions: Anaconda (comes with a GUI and preinstalled packages) and Miniconda (bare-bones CLI tools). Please refer to the Anaconda guide for more details. You can choose either version depending on your preference. If you are not comfortable with the command line, we'd recommend using Anaconda, which comes with a GUI called Anaconda Navigator.
Anaconda (recommended and comes with a graphical user interface)¶
- Install Anaconda and choose the Python 3.12 distribution. This bundles Python, Jupyter Notebook, and common scientific packages.
- Open Anaconda Navigator (found in your Start menu on Windows, or Applications on macOS). On the home screen, locate and click the "Launch" button under Jupyter Notebook.
- From the Notebook Dashboard, navigate to the directory where you want to store your notebook. Click the "New" button on the right side and select the desired kernel (e.g., Python 3.12). This will open a new, untitled notebook in a new browser tab.
- Click on "Untitled" at the top of the notebook to rename it.
- Jupyter Notebooks are composed of cells. You can have:
- Code Cells: For writing and executing code (e.g., Python).
- Markdown Cells: For adding text, explanations, and formatting.
- Executing Code: Type your code into a code cell and press Shift + Enter to execute it. The output will appear directly below the cell.
- Saving: Notebooks are automatically saved periodically, but you can also manually save by clicking the "Save" icon or going to "File" > "Save and Checkpoint."
- Sharing: Notebooks can be downloaded in various formats (e.g., .ipynb, HTML, PDF) for sharing.
Miniconda (lightweight but requires command-line interface)¶
- Download Miniconda (Python 3.12) from the official site: https://docs.conda.io/en/latest/miniconda.html
- Install Miniconda and open a new terminal (macOS/Linux) or Anaconda Prompt (Windows).
- Create a clean environment for this course:
- Activate the environment:
- Install Jupyter and the IPython kernel into this environment:
- (Optional but recommended) Register the kernel so it shows up in Jupyter as a named option:
- Launch Jupyter Notebook:
   
   If a browser tab does not open, copy the http://127.0.0.1:8888/...orlocalhost:8888URL from the terminal into your browser (the port8888might be different if you have other services running on your machine).
- In a notebook, select the kernel via Kernel → Change kernel → Python (cme193) if you registered it in step 6.
Need extra packages? Use conda install <package> or pip install <package> within your environment before relaunching the notebook server.