Cheat Sheet: Exploratory Data Analysis
Cheat Sheet: Exploratory Data Analysis
Package/Method | Description | Code Example |
---|---|---|
Complete dataframe correlation | Correlation matrix created using all the attributes of the dataset. |
|
Specific Attribute correlation | Correlation matrix created using specific attributes of the dataset. |
|
Scatter Plot | Create a scatter plot using the data points of the dependent variable along the x-axis and the independent variable along the y-axis. |
|
Regression Plot | Uses the dependent and independent variables in a Pandas data frame to create a scatter plot with a generated linear regression line for the data. |
|
Box plot | Create a box-and-whisker plot that uses the pandas dataframe, the dependent, and the independent variables. |
|
Grouping by attributes | Create a group of different attributes of a dataset to create a subset of the data. |
|
GroupBy statements | a. Group the data by different categories of an attribute, displaying the average value of numerical attributes with the same category. b. Group the data by different categories of multiple attributes, displaying the average value of numerical attributes with the same category. |
|
Pivot Tables | Create Pivot tables for better representation of data based on parameters |
|
Pseudocolor plot | Create a heatmap image using a PsuedoColor plot (or pcolor) using the pivot table as data. |
|
Pearson Coefficient and p-value | Calculate the Pearson Coefficient and p-value of a pair of attributes |
|
Comments
Post a Comment