GitHub Cheat Sheet

GitHub Cheatsheet for Data Scientists

Git is a tool used for code management. It is open source and is very helpful for code development and collaboration. Git uses version control of code, which means every change to the code is recorded by version control in form of a database. In case of a mistake, version control allows us to go back in time, compare it to prior versions and help fix the error while causing the least amount of interruption to people who are working on that code....

February 28, 2023 · Pravi Devineni, PhD
Pandas Cheat Sheet

Data Exploration with Python using Pandas

Pandas is a popular data analysis tool that Install and import Pandas pip install pandas import numpy as np import pandas as pd Pandas Data Structures The core value of Pandas comes through the data structure options it provides, primarily Series (labeled, homogenously-typed, one-dimensional arrays) DataFrames (labeled, potentially heterogenously-typed, two-dimensional arrays) Pandas Series Create Series Create empty Series s = pd.Series(dtype='float64') Create Series from dictionary d = {'a': 1, 'b': 2, 'c': 3} s = pd....

January 18, 2023 · Pravi Devineni, PhD