Knowledge Hub

Scikit-learn Advanced Features | Data Science

Scikit-learn Advanced Features | Data Science

Neither Titanic dataset nor sklearn a new thing for any data scientist but there are some important features in scikit-learn that will make any model pre-processing and tuning easier, to be specific this notebook will cover the following concepts ColumnTransformer ...
Analyze COVID-19 Dataset with Databricks | Databricks Unified Analytics Platform

Analyze COVID-19 Dataset with Databricks | Databricks Unified Analytics Platform

In this article, we will analyze COVID-19 Dataset using Databricks unified analytics platform using the community edition of the platform, which is totally for free and you can use it as your playground to test Apache Spark applications in Python ...
Data Engineering Detailed Roadmap | Data Engineering

Data Engineering Detailed Roadmap | Data Engineering

Data Engineering become a critical part in the past few years in almost any organization that use data heavily in their system, and I am sure you heard a lot about the comparison between data engineers and data scientist and ...
Detailed Guide for String Wrangling in SQL | MySQL | SQL Analysis

Detailed Guide for String Wrangling in SQL | MySQL | SQL Analysis

Extracting information from string columns is almost a repetitive necessity in Data Engineers, Data Scientists, and Business Analysts day to day tasks, and this task can be done using a programming language such as Python, or by SQL depends on ...
Handling Dates and Time in Pandas

Handling Dates and Time in Pandas

Date and Time are part of almost any dataset data scientist, data engineer, or data analyst will work on, so to know how to handle this kind of data is a crucial skill which will save you a lot of ...
Build Data Analysis and Data Discovery Web Application for Data Science projects in few minutes | Data Science | Data Analytics

Build Data Analysis and Data Discovery Web Application for Data Science projects in few minutes | Data Science | Data Analytics

Data preparation and data discovery consume a great amount of time in any data science or data analytics job, one of the solutions is to write a template script that you can use in this phase of your job, but ...