Knowledge Hub

So Which Machine Learning Algorithm to use?

A lot of data science practitioners found the process of selecting a machine learning algorithm overwhelming and confusing. That’s because there are a bunch of algorithms that can do the same task. For example, classification can be done using a ...
Apache Hive Table Types | Apache Hive

Apache Hive Table Types | Apache Hive

Apache Hive is designed to give data engineers and data scientist a SQL like access to the big data available in Hadoop cluster, so we can think of it as a normal RDBMS, in normal RDBMS we have database, and ...
Data Science Roadmap .. Concepts, Tools, and Technologies

Data Science Roadmap .. Concepts, Tools, and Technologies

In this article, we will depict some skills and concepts that must be learned in the journey of becoming a data scientist but first, what is data science?  Data Science is the art of uncovering the insights and trends in ...
Introduction to Hive | Apache Hive

Introduction to Hive | Apache Hive

Hive was initially developed by Facebook in 2007 to help the company handle massive amounts of new data. At the time Hive was created, Facebook had a 15TB dataset they needed to work with. A few short years later, that ...
Setup Talend Open Studio on Linux

Setup Talend Open Studio on Linux

Introduction Talend is an open-source data integration platform. It provides different solutions and services for data integration, data quality, cloud storage, and Big Data. According to the latest Gartner report, Talend named in the leader’s quadrant among other data integration ...
How to choose your ETL solution | Data Integration

How to choose your ETL solution | Data Integration

ETL stands for Extraction Transform Load is a common concept in data engineering, and as we can imply from the name of the concept that this concept has three types of operations, Extract which indicate the process of extracting data ...