Part 1: Software and Tools for Data Science

Part 1a: Installing Software Tools

Follow the instructions in the following setup guide to install all the required software:

Part 1b: Creating your STAT 107 git repository

Follow our guide to set up git for Data Science Discovery: [git Set Up].

Part 1c: Set up your Python notebook

In Data Science, all of our programming will be done in “notebooks.” Your python install will need a few libraries in order to run the notebooks. Using your command line, run the following:

conda install jupyter
conda install pandas

This will take a bit. You will need to press [Enter] to confirm you want to install of of the packages (the option [y]/n shows that y is default when you choose no option).


Part 2: Complete the lab01 Notebook

Using your command line, navigate to your stat107 repository (cd Desktop -> cd stat107 -> cd [NETID] OR cd OneDrive/Desktop -> cd stat107 -> cd [NetID] for some of you) and fetch the notebook from our release repository by running the following two git commands:

git fetch release
git merge release/lab01 -m "Merging initial files"

ONLY IF you get an error related to unrelated histories, use:

git merge release/lab01 --allow-unrelated-histories -m "Merging initial files"

Open the notebook with the command:

jupyter notebook

Inside of the notebook webpage:

Whenever you are done, you should save the current checkpoint (using File -> Save Checkpoint in the notebook) to save your work. Once your work is saved, you can exit the command line running the notebook with Ctrl + C.


Submitting Your Work

When you have completed working, you should always submit your work (even if you’re not quite finished). We will always grade the latest push you made before the due date (and ignore everything else) — submitting multiple times is okay and encouraged!

Inside of Jupyter:

After exiting Jupyter, your command prompt will return to accept new commands. Using your command prompt, run:

    git add -A
    git commit -m "submission (or any message here)"
    git push origin master

You can verify your submission was made by visiting the web interface to github:

Modified from Wade Fagen-Ulmschneider & Karle Flanagan’s STAT 107 - Fall 2019 guide with permission.