7 Free Kaggle Micro-Programs for Information Science Novices
Picture by Writer
Do you keep in mind that one information science course you signed up for however by no means bought round to ending? Nicely, you’re not alone.
Most information science newcomers enroll in a number of programs: free or paid. However as a result of information science programs usually cowl a variety of subjects—from programming to information evaluation, visualization, and extra—it takes a number of weeks to work by means of them. And even when they begin sturdy, most learners begin feeling overwhelmed after the primary few modules and fail to make progress. Enter Kaggle (micro)programs.
The sequence of micro-courses from Kaggle are a superb various in case you discover longer programs tougher to get by means of. They’re nice sources to study information science expertise—Python, pandas, machine studying, and extra—with out feeling overwhelmed. The programs are designed such that they take just a few hours to complete, and embody tutorial and follow parts. Now let’s go over some beginner-friendly programs and what they cowl.
Python is likely one of the most generally used languages in information science. Moreover serving to you in your information profession, Python can also be useful if you wish to break into software program engineering sooner or later. The Python course on Kaggle will assist you study the next:
- Python fundamentals (syntax and variables)
- Capabilities
- Booleans and conditionals
- Lists, loops, and checklist comprehensions
- Strings and dictionaries
- Working with exterior libraries
For those who really feel such as you want a good easier intro to programming earlier than diving into Python, you possibly can take a look at the intro to programming course.
As a result of the following programs on Pandas and information visualization require you to be comfy with the contents of this course, you shouldn’t skip the Python course in case you are new to programming with Python.
Hyperlink: Learn Python
When you’re accustomed to fundamental Python you possibly can learn pandas, a strong information evaluation and manipulation library.
By means of a sequence of quick classes and hands-on coding train, the pandas will assist you study to carry out the next operations on pandas dataframes:
- Creating, studying, and writing
- Indexing, choosing, and assigning
- Renaming and mixing
- Abstract features and maps
- Grouping and sorting
- Information sorts and lacking values
Hyperlink: Learn Pandas
Now that you know the way to research information with Python and pandas, it is time to construct on that by studying methods to visualize your information.
The Data Visualization course covers the basics of making useful plots and charts utilizing the Python library Seaborn. The course covers the next:
- Line charts
- Bar charts and warmth maps
- Scatterplots
- Histograms and density plots
- Selecting plot sorts
You additionally must work on a ultimate challenge to use what you realized.
Hyperlink: Learn Data Visualization
SQL is the only most important information science ability that you could study. To grasp why SQL is tremendous necessary for information science, learn “Why SQL is the Language to Learn for Data Science” by KDnuggets contributor Nate Rosidi.
The Intro to SQL course will train you methods to you question information ets with SQL utilizing the BigQuery Python consumer and covers SQL fundamentals, filtering, and writing readable SQL queries:
- Getting began with SQL and BigQuery
- Choose, from, and the place
- Group by, having, and rely
- Order by
- As and with
- Becoming a member of information
Hyperlink: Learn Intro to SQL
Now that you’re comfy with SQL fundamentals, you possibly can take the Advanced SQL course to develop your SQL expertise additional. This course builds on the intro to SQL course and covers the next subjects on combining information from a number of tables and performing extra complicated operations:
- Joins and unions
- Analytic features
- Nested and repeated information
- Writing environment friendly queries
Hyperlink: Learn Advanced SQL
For those who’ve already labored your method by means of the above programs, try to be comfy with programming and information evaluation with Python and SQL. You’re now able to get began with machine studying.
The Intro to Machine Learning course covers:
- How ML fashions work
- Fundamental information exploration
- Mannequin validation
- Underfitting and overfitting
- Random forests
You can too make a submission to a beginner-friendly Kaggle competitors.
Hyperlink: Learn Intro to Machine Learning
The Intermediate Machine Learning course builds on the Intro to Machine Studying course and teaches you methods to deal with lacking values, categorical variables, and keep away from the difficult drawback of information leakage when coaching machine studying fashions.
The subject coated embody:
- Lacking values
- Categorical variables
- ML pipelines
- Cross validation
- XGBoost
- Information leakage
Hyperlink: Intermediate Machine Learning
I hope you discovered this round-up of programs useful.
As talked about, they’re all free. And it solely takes just a few hours to study an important information science ability. So you can begin out in your information science journey one micro-course at a time. Completely satisfied studying!
Bala Priya C is a developer and technical author from India. She likes working on the intersection of math, programming, information science, and content material creation. Her areas of curiosity and experience embody DevOps, information science, and pure language processing. She enjoys studying, writing, coding, and low! Presently, she’s engaged on studying and sharing her information with the developer neighborhood by authoring tutorials, how-to guides, opinion items, and extra.