• Courses
  • Knowledge Hub
  • Cheat Sheets
  • Market Place
  • Plans and Pricing
  • Contact Us
  • Become an Instructor
DataValley
Category
Data Engineering
Data Modeling
Data Science
Machine Learning
Data Visualization
{{ search }}
Log in Sign Up

Login/Sign Up

Courses Favorites 0

Search

Category
Data Engineering
Data Modeling
Data Science
Machine Learning
Data Visualization
{{ search }}

Menu

  • Courses
  • Knowledge Hub
  • Cheat Sheets
  • Market Place
  • Plans and Pricing
  • Contact Us
  • Become an Instructor

Dimensional Modeling … Design Methodology for Analytics Oriented Data Warehouse | Data Warehouse

August 30, 2020radwa.ali Data Engineering, Data Modeling, Data Warehouse

Data warehouses has been around since the 80s. Throughout these years, it has proven its capabilities to support decision making and business analysis. Data warehouses allow Integrating many source systems such as databases, spreadsheets, and flat files. Cleansing and Transformation can be applied to these data after integration then organizes it in a way that […]

Docker Commands | Dockers

August 22, 2020mahmoud.feteha Containerization, Data Engineering, Docker

In our last blog, we talked about Docker architecture, how to install Docker, and the main differences between Containerization vs Virtualization. Here, we are going to dive into and see how to use the Docker in action. Let’s cap up for what we need here, which is the difference between Docker image and Docker container, […]

Your Guide to NoSQL Databases | Data Engineering

August 19, 2020Ahmed Ibrahem Concepts and Technologies, Data Engineering, NoSQL

One of the major reasons that the era of big data started was the increase in the number of data source and variety of data types that each organization has nowadays, almost any organization has different types of data not only structured data but also it can have unstructured or semi-structured data, and each type […]

Getting Started with Containers & Dockers | Dockers

August 17, 2020mahmoud.feteha Containerization, Data Engineering, Docker

Introduction Containerization revolutionized the software development and it becomes a common building block in today’s architecture, applications, big data environments, and data engineering applications can be deployed and developed inside containers In this article, we will know more containers and its advantage, and we will discuss Dockers which is a container image that packages all […]

Ultimate Guide to choose the best Chart for your Dashboard | Business Intelligence

August 15, 2020nada.abdulwahab Business intelligence, Data Analytics, Data Visualization

Introduction Everyone have a different view to the data, you can extract insights from the data and another one extract different insights from the same data.Also the different audiences have different informational needs, so when you’re building your dashboard ask the decision makers: “What are we trying to extract and know from this analyze to […]

GPT-3, The model simply knows!

August 14, 2020Ahmed Abulkhair Concepts and Technologies, Data Science

Natural Language Processing ( NLP ) has been one of the most challenging areas in deep learning. This is due to several reasons. First, human language is complicated, even for humans themselves!  Consider asking someone about his experience in learning Chinese, for example. Without a doubt, he will tell you that this is difficult. The […]

Aggregation Queries in Apache Hive | Apache Hive

August 13, 2020mtarek Apache Hive, Data Engineering

Introduction Data aggregation is the process of gathering and expressing data in a summary to get more information about particular groups based on specific conditions. HiveQL offers several built-in aggregate functions, such as max, min, avg,..etc. It also supports advanced aggregation using keywords such as Variance and Standard Deviation and different types of window functions. […]

Quick Reference to six D’s of the data field

August 10, 2020Ahmed Ibrahem Concepts and Technologies

For any professional or beginner in the data field, regardless of your specialty or technology you will work on, you will hear about one or more of the following concepts, and we can say it is absolutely important for any data professional to know at least the general concept of any of the following concepts. […]

Azure Data Factory – Modern ETL On Cloud – Data Migration Use Case | Azure Data Factory

August 6, 2020Ahmed Ibrahem Azure, Cloud Computing, Data Engineering, Data Integration, ETL

Introduction ETL is one of the major tasks for any data engineer, and we have many solutions either on-premise or cloud solutions available in the market to implement this concept, in Microsoft Azure, Azure Data Factory is the ETL solution to implement data pipelines using data from the cloud source or data from on-premise sources, […]

Scikit-learn Advanced Features | Data Science

Ahmed Abulkhair Data Analytics, Data Science, Python

Neither Titanic dataset nor sklearn a new thing for any data scientist but there are some important features in scikit-learn that will make any model pre-processing and tuning easier, to be specific this notebook will cover the following concepts ColumnTransformer Pipeline SimpleImputer StandardScalar OneHotEncoder OrdinalEncoder GridSearch The dataset used in this article can be found […]

Learn

Courses
Cheat Sheets
Market Place
Plans and Pricing

About

DataValley is the e-learning platform for everything data science. From beginners to gurus, data geeks of all levels can find something at DataValley to help them enhance their skills.

Contact

DataValley Technologies.

wecare@datavalley.technology

Copyright © 2021 DataValley Technologies.
Search