Think Stats Exploratory Data Analysis Second Edition by Allen B Downey – Ebook PDF Instant Download/Delivery:
9781491907368, 1491907363
Full download Think Stats Exploratory Data Analysis Second Edition after payment
Product details:
ISBN 10: 1491907363
ISBN 13: 9781491907368
Author: Allen B Downey
If you know how to program, you have the skills to turn data into knowledge, using tools of probability and statistics. This concise introduction shows you how to perform statistical analysis computationally, rather than mathematically, with programs written in Python. By working with a single case study throughout this thoroughly revised book, you’ll learn the entire process of exploratory data analysis—from collecting data and generating statistics to identifying patterns and testing hypotheses. You’ll explore distributions, rules of probability, visualization, and many other tools and concepts. New chapters on regression, time series analysis, survival analysis, and analytic methods will enrich your discoveries. Develop an understanding of probability and statistics by writing and testing code Run experiments to test statistical behavior, such as generating samples from several distributions Use simulations to understand concepts that are hard to grasp mathematically Import data from most sources with Python, rather than rely on data that’s cleaned and formatted for statistics tools Use statistical inference to answer questions about real-world data
Think Stats Exploratory Data Analysis Second Table of contents:
1. Exploratory Data Analysis
- A Statistical Approach
- The National Survey of Family Growth
- Importing the Data
- DataFrames
- Variables
- Transformation
- Validation
- Interpretation
- Exercises
- Glossary
2. Distributions
- Representing Histograms
- Plotting Histograms
- NSFG Variables
- Outliers
- First Babies
- Summarizing Distributions
- Variance
- Effect Size
- Reporting Results
- Exercises
- Glossary
3. Probability Mass Functions
- PMFs
- Plotting PMFs
- Other Visualizations
- The Class Size Paradox
- DataFrame Indexing
- Exercises
- Glossary
4. Cumulative Distribution Functions
- The Limits of PMFs
- Percentiles
- CDFs
- Representing CDFs
- Comparing CDFs
- Percentile-Based Statistics
- Random Numbers
- Comparing Percentile Ranks
- Exercises
- Glossary
5. Modeling Distributions
- The Exponential Distribution
- The Normal Distribution
- Normal Probability Plot
- The Lognormal Distribution
- The Pareto Distribution
- Generating Random Numbers
- Why Model?
- Exercises
- Glossary
6. Probability Density Functions
- PDFs
- Kernel Density Estimation
- The Distribution Framework
- Hist Implementation
- PMF Implementation
- CDF Implementation
- Moments
- Skewness
- Exercises
- Glossary
7. Relationships Between Variables
- Scatter Plots
- Characterizing Relationships
- Correlation
- Covariance
- Pearson’s Correlation
- Nonlinear Relationships
- Spearman’s Rank Correlation
- Correlation and Causation
- Exercises
- Glossary
8. Estimation
- The Estimation Game
- Guess the Variance
- Sampling Distributions
- Sampling Bias
- Exponential Distributions
- Exercises
- Glossary
9. Hypothesis Testing
- Classical Hypothesis Testing
- HypothesisTest
- Testing a Difference in Means
- Other Test Statistics
- Testing a Correlation
- Testing Proportions
- Chi-Squared Tests
- First Babies Again
- Errors
- Power
- Replication
- Exercises
- Glossary
10. Linear Least Squares
- Least Squares Fit
- Implementation
- Residuals
- Estimation
- Goodness of Fit
- Testing a Linear Model
- Weighted Resampling
- Exercises
- Glossary
11. Regression
- StatsModels
- Multiple Regression
- Nonlinear Relationships
- Data Mining
- Prediction
- Logistic Regression
- Estimating Parameters
- Implementation
- Accuracy
- Exercises
- Glossary
12. Time Series Analysis
- Importing and Cleaning
- Plotting
- Linear Regression
- Moving Averages
- Missing Values
- Serial Correlation
- Autocorrelation
- Prediction
- Further Reading
- Exercises
- Glossary
13. Survival Analysis
- Survival Curves
- Hazard Function
- Estimating Survival Curves
- Kaplan-Meier Estimation
- The Marriage Curve
- Estimating the Survival Function
- Confidence Intervals
- Cohort Effects
- Extrapolation
- Expected Remaining Lifetime
- Exercises
- Glossary
14. Analytic Methods
- Normal Distributions
- Sampling Distributions
- Representing Normal Distributions
- Central Limit Theorem
- Testing the CLT
- Applying the CLT
- Correlation Test
- Chi-Squared Test
- Discussion
- Exercises
Index
Colophon
People also search for Think Stats Exploratory Data Analysis Second :
think stats exploratory data analysis second edition
exploratory data analysis statistics
types of exploratory data analysis
think stats exploratory data analysis
Tags:
Allen Downey,Think Stats Exploratory,Data Analysis