Pandas for Everyone: Python Data Analysis, 2nd Edition by Daniel Y. Chen
Author:Daniel Y. Chen [Daniel Y. Chen]
Format: pdf
Tags: <span><div><p><strong>Manage and Automate Data Analysis with Pandas in Python</strong></p><p>Today, analysts must manage data characterized by extraordinary variety, velocity, and volume. Using the open source Pandas library, you can use Python to rapidly automate and perform virtually any data analysis task, no matter how large or complex. Pandas can help you ensure the veracity of your data, visualize it for effective decision-making, and reliably reproduce analyses across multiple data sets.</p><p><strong><em>Pandas for Everyone, 2nd Edition,</em></strong> brings together practical knowledge and insight for solving real problems with Pandas, even if youre new to Python data analysis. Daniel Y. Chen introduces key concepts through simple but practical examples, incrementally building on them to solve more difficult, real-world data science problems such as using regularization to prevent data overfitting, or when to use unsupervised machine learning methods to find the underlying structure in a data set.</p><p>New features to the second edition include:</p><ul><li><p>Extended coverage of plotting and the seaborn data visualization library</p></li><li><p>Expanded examples and resources</p></li><li><p>Updated Python 3.9 code and packages coverage, including statsmodels and scikit-learn libraries</p></li><li><p>Online bonus material on geopandas, Dask, and creating interactive graphics with Altair</p></li></ul><p>Chen gives you a jumpstart on using Pandas with a realistic data set and covers combining data sets, handling missing data, and structuring data sets for easier analysis and visualization. He demonstrates powerful data cleaning techniques, from basic string manipulation to applying functions simultaneously across dataframes.</p><p>Once your data is ready, Chen guides you through fitting models for prediction, clustering, inference, and exploration. He provides tips on performance and scalability and introduces you to the wider Python data analysis ecosystem.</p><ul><li><p>Work with DataFrames and Series, and import or export data</p></li><li><p>Create plots with matplotlib, seaborn, and pandas</p></li><li><p>Combine data sets and handle missing data</p></li><li><p>Reshape, tidy, and clean data sets so theyre easier to work with</p></li><li><p>Convert data types and manipulate text strings</p></li><li><p>Apply functions to scale data manipulations</p></li><li><p>Aggregate, transform, and filter large data sets with groupby</p></li><li><p>Leverage Pandas advanced date and time capabilities</p></li><li><p>Fit linear models using statsmodels and scikit-learn libraries</p></li><li><p>Use generalized linear modeling to fit models with different response variables</p></li><li><p>Compare multiple models to select the best one</p></li><li><p>Regularize to overcome overfitting and improve performance</p></li><li><p>Use clustering in unsupervised machine learning</p></li></ul></div></span>
Publisher: Addison-Wesley Professional
Published: 2022-12-22T00:00:00+00:00
Download
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.
Distributed Machine Learning with Python by Guanhua Wang(3386)
Getting Started with CockroachDB by Kishen Das Kondabagilu Rajanna(2556)
Exploratory Data Analysis with Python Cookbook by Ayodele Oluleye(1398)
R Web Scraping Quick Start Guide by Olgun Aydin(1065)
Getting Started With CockroachDB: A Guide to Using a Modern, Cloud-Native, and Distributed SQL Database for Your Data-Intensive Apps by Kishen Das Kondabagilu. Rajanna(1020)
PostgreSQL 13 Cookbook: Over 120 recipes to build high-performance and fault-tolerant PostgreSQL database solutions by Vallarapu Naga Avinash Kumar(990)
Mastering PostgreSQL 15 - Fifth Edition by Hans-Jürgen Schönig(664)
Apache Hadoop 3 Quick Start Guide by Hrishikesh Karambelkar(428)
Pandas for Everyone: Python Data Analysis, 2nd Edition by Daniel Y. Chen(420)
Learn SQL with MySQL: Retrieve and Manipulate Data Using SQL Commands with Ease by Ashwin Pajankar(371)
SQL Query Design Patterns and Best Practices by Steve Hughes & Dennis Neer & Dr. Ram Babu Singh & Shabbir H. Mala & Leslie Andrews & Chi Zhang(366)
Deploy Node.js on GCP: A comprehensive guide to deploying Node.js on Google Cloud Platform by Jonathan Lin(357)
Configuring Sales and Distribution in SAP ERP by Unknown(331)
Leveling Up with SQL by Mark Simon(306)
Learning Data Science by Sam Lau(301)
Intermediate Python by Oswald Campesato(298)
Pandas Basics by Oswald Campesato(269)
Data Engineering with AWS: A Comprehensive Guide to Building Robust Data Pipelines by Paul Brian(269)
The Definitive Guide to Data Integration by Pierre-Yves BONNEFOY Emeric CHAIZE Raphaël MANSUY Mehdi TAZI(263)
