Thursday, May 23, 2019

Data School: Data science best practices with pandas (video tutorial)

Data science best practices with pandas (video tutorial)

The pandas library is a powerful tool for multiple phases of the data science workflow, including data cleaning, visualization, and exploratory data analysis. However, the size and complexity of the pandas library makes it challenging to discover the best way to accomplish any given task.

In this in-depth tutorial, which I presented at PyCon 2019, you'll use pandas to answer questions about a real-world dataset. Through each exercise, you'll learn important data science skills as well as "best practices" for using pandas. By the end of the tutorial, you'll be more fluent at using pandas to correctly and efficiently answer your own data science questions.

This is an intermediate level tutorial, so if you're new to pandas, I recommend starting with my other video series: Easier data analysis with pandas.

If you want to follow along with the exercises at home, you can download the dataset and notebook from GitHub.

Here are some of the topics covered in the video:

  • adjusting for bias in your dataset
  • handling missing values
  • choosing an appropriate plot
  • customizing your plot
  • using the datetime data type
  • filtering using loc versus query
  • using multiple aggregation functions
  • checking for small sample sizes
  • method chaining
  • verifying your results using random samples
  • evaluating a "stringifed" Python container
  • applying a custom function to a Series
  • writing lambda functions

Let me know if you have any questions, and I'm happy to answer them!

P.S. If you like this video, you should check out my interactive pandas course, Analyzing Police Activity with pandas.



from Planet Python
via read more

15 comments:

  1. The information distribution center offloads information from a huge number of sources. data science course in pune

    ReplyDelete
  2. Thanks for sharing your valuable information to us, it is very useful
    data science

    ReplyDelete

  3. Such a very useful article. Very interesting to read this article.I would like to thank you for the efforts you had made for writing this awesome article. I would like to state about something which creates curiosity in knowing more about it. It is a part of our daily routine life which we usually don`t notice in all the things which turns the dreams in to real experiences. Back from the ages, we have been growing and world is evolving at a pace lying on the shoulder of technology."data science courses" will be a great piece added to the term technology. Cheer for more ideas & innovation which are part of evolution.

    ReplyDelete
  4. A good blog always comes-up with new and exciting information and while reading I have feel that this blog is really have all those quality that qualify a blog to be a one.
    ExcelR Data Analytics courses

    ReplyDelete
  5. Excellent Blog! I would like to thank for the efforts you have made in writing this post. I am hoping the same best work from you in the future as well. I wanted to thank you for this websites! Thanks for sharing. Great websites!
    data analytics training in hyderabad
    business analytics course
    data science course

    ReplyDelete
  6. I really appreciate this wonderful post that you have provided for us. I assure this would be beneficial for most of the people.
    data science course
    360DigiTMG

    ReplyDelete
  7. The information provided on the site is informative. Looking forward more such blogs. Thanks for sharing .
    Artificial Inteligence course in Varanasi
    AI Course in Varanasi

    ReplyDelete
  8. Nice to be visiting your blog again, it has been months for me. Well this article that i’ve been waited for so long. I need this article to complete my assignment in the college, and it has same topic with your article. Thanks, great share. buy instagram spread likes cheap

    ReplyDelete
  9. Thank you for taking the time to post this blog. I am pleased with your work after reading this post. This is very useful for us. Keep sharing such blogs. Mexico Import Data

    ReplyDelete
  10. Nice article. I liked very much. All the information given by you are really helpful for my research. keep on posting your views.
    data science certification

    ReplyDelete
  11. I’m happy I situated this blog! From time to time, students wish to cognitive the keys of productive literary essays composing. Your superior information regarding this smart post will become a correct basis for such individuals. nice one
    data scientist certification

    ReplyDelete
  12. This video tutorial is awesome, great explanation. Panda is an important library available for data science and data analysis applications. Learn Data structure and algorithm in Python. Keep sharing these educational blogs, looking forward for more.

    ReplyDelete
  13. I find your opinion quite interesting, but the other day I stumbled upon a bit of completely different advice from another blogger, I need to think that one through, thanks for posting.
    data analytics courses in malaysia

    ReplyDelete
  14. Amazing Articles ! I would like to thank you for the efforts you had made for writing this awesome article. This article inspired me to read more. keep it up.If you are Searching for info click on given link
    Data science course in pune

    ReplyDelete

TestDriven.io: Working with Static and Media Files in Django

This article looks at how to work with static and media files in a Django project, locally and in production. from Planet Python via read...