Introduction
September 17th is Practical Business Python’s anniversary. Last year, I reflected on 5 years of growth. This year, I wanted to take a step back and develop a guide to guide readers through the content on PB Python.
As of this writing, I have 84 articles on the site. They vary from fairly complex and lengthy to quick summaries. When I wrote them, I did it based on my interests at the time and without much thought on progression. Now that I have a decent volume of articles, I want to organize them in a more meaningful way.
My ultimate goal for this site is that I want it to be a resource to help people use Python to automate away many of the repetitive tasks they do on a daily basis with tools like Excel. A secondary goal for is to cover more advanced Python topics that are difficult to do in Excel.
One reason for developing this guide is that at least 90% of my traffic comes from organic search. These users come to the site, read an article and move on. I hope those that find this guide will stay a little longer and find other relevant content and use this as a resource to navigate the Python ecosystem.
Secondly, this guide will be useful to help me understand gaps in the content and keep a mental framework for continuing to develop content. My intent is to update the sections below as I add new content. I will also maintain a link at the top of the Archives page to point people in the right direction.
Table of Contents
Getting Started with Python
Before you begin your Python journey, here are a couple of articles that are helpful for getting everything set up on your system:
- Best Practices for Managing Your Code Library
- Building a Repeatable Data Analysis Process with Jupyter Notebooks
- Exploring an Alternative to Jupyter Notebooks for Python Development
- TalkPython Podcast - Escaping Excel Hell with Python and Pandas
Just as importantly, you need to think about how long this journey will take and what you may need to do to spread the knowledge within your organization:
Pandas Fundamentals
Pandas is a rich library with a lot of functionality. If you are new to pandas, this is the order I would recommend reading the articles:
Basic pandas concepts:
- Common Excel Tasks Demonstrated in Pandas and Part 2
- Excel Filter and Edit - Demonstrated in Pandas
- Tips for Selecting Columns in a DataFrame
- Overview of Pandas Data Types
- Using The Pandas Category Data Type
- Creating Pandas DataFrames from Lists and Dictionaries
- Cleaning Up Currency Data with Pandas
- Stylin with Pandas
Grouping and summarizing data:
- Pandas Pivot Table Explained
- Pandas Crosstab Explained
- Understanding the Transform Function in Pandas
- Pandas Grouper and Agg Functions Explained
- Binning Data with Pandas qcut and cut
Data input and output:
Advanced Pandas Concepts
After you have experience with the basics of pandas, here are some articles that describe more complex topics:
Reporting
One of the key challenges with moving to python is figuring out the best way to share your results with others. Here are several posts that describe alternatives:
- Generating Excel Reports from a Pandas Pivot Table
- Improving Pandas Excel Output
- Creating Advanced Excel Workbooks with Python
- Interactive Data Analysis with Python and Excel
- Creating PDF Reports with Pandas Jinja and WeasyPrint
- Automated Report Generation with Papermill and second part
- Creating Interactive Dashboards from Jupyter Notebooks
Python Libraries
There are several libraries that will be useful for this journey. Some are in the standard library and others are 3rd party applications. All can be useful for task automation:
- Using Pythons Pathlib Module
- Web scraping with Beautifulsoup
- Collecting Data with Google Forms and Pandas
- Adding a Simple GUI to Your Pandas Script
- Building a PDF Splitter Application
- Build a Celebrity Look-Alike Detector with Azures Face Detect and Python
- sidetable - Create Simple Summary Tables in Pandas
Working with Windows
Windows has it’s own quirks. Several articles are helpful for working in a Windows environment:
Data Science Topics
Data science is a broad category and many articles are related to various data science topics. These articles are more focused on specific data science tasks:
Data Visualization
Python’s data visualization landscape is complex and it can be difficult to determine the best tool to use. Here are some posts about the visualization landscape:
Articles on some specific libraries.
Matplotlib:
- Simple Graphing with IPython and Pandas
- Effectively Using Matplotlib
- Creating a Waterfall Chart in Python
- Building a Bullet Graph in Python
Altair:
- Introduction to Data Visualization with Altair
- Intro to pdvega - Plotting for Pandas using Vega-Lite
Bokeh:
- Interactive Visualization of Australian Wine Ratings
- Building Bullet Graphs and Waterfall Charts with Bokeh
Plotly:
Seaborn:
Site Updates
This section contains ad-hoc posts about the site and the technology behind it.
Additional Resources
Book reviews and other recommended resources:
from Practical Business Python
read more
No comments:
Post a Comment