1 Setting up your data science environment
Here’s a general set of instructions for setting up a development environment that includes GitHub, Anaconda, and an Integrated Development Environment (IDE):
First, you’ll need to install Git on your computer. Git is a version control system that allows you to track changes in your code and collaborate with other developers. You can download the latest version of Git from the official website: https://git-scm.com/downloads
Next, create a GitHub account if you don’t already have one. GitHub is a web-based platform for version control and collaboration that uses Git. You can sign up for a free account at https://github.com/.
Anaconda is a distribution of Python and R that makes it easy to manage dependencies and packages for data science. You can download the latest version of Anaconda from the official website: https://www.anaconda.com/products/distribution.
After installing Anaconda, you can create a new environment for your data science project by opening Anaconda Navigator, then click on the Environments tab, and then click on the create button. You can then set the name of the environment, and the version of Python or R you want to use.
Finally, you can install your preferred IDE:
Spyder IDE is included in your Anaconda installation. You might want to add the Notebook plugin.
Visual Studio code with appropriate extensions.
Rstudio can be downloaded from https://rstudio.com/products/rstudio/download/#download.
Below you will find more detailed video instructions on installing and using the different tools in your development environment.