Data Science requires a few tools to help us discover interesting features in our data. We will primarily use two tools and several libraries within each of these tools. The three tools are:
python
, a simple programming language (this allows for the computer to do the work for us).git
, a distributed version control system/repository tool (this runs technology behind “github”).All of these tools are free (and open-source), so it just takes a few minutes for you to install them to get started!
You will need Python 3.6 (or later). We will first check if you have Python already (if you have done Data Science) and install it if you don’t already have it.
python --version
and press Enter.
cd Downloads
bash Miniconda3-latest-MacOSX-x86_64.sh
q
to exit the license screen and all default options are fine.Any modern version of git works. We will first check if you have git and install it if you don’t already have it.
git --version
and press Enter.
git version ...
(or similar), you are all set – no need to install git!Modified from Wade Fagen-Ulmschneider & Karle Flanagan’s STAT 107 - Fall 2019 guide with permission.