Talend is an open-source data integration platform. It provides different solutions and services for data integration, data quality, cloud storage, and Big Data. According to the latest Gartner report, Talend named in the leader’s quadrant among other data integration solutions. In this article, we will show you step by step to install and configure Talend open studio on Linux operating system. This article will include also the steps to install java as a prerequisite for Talend.
In the next article we will demonstrate how to use Talend ETL capabilities to implement one of the critical types of Data Warehouse table, which is Slowly Changing Dimension table, but before that in this article we will make sure we have Talend is installed and configured correctly to start using in the coming use cases.
Talend On Linux
Make sure that you have the following prerequisites before starting the installation:
- Talend Open Studio installer files.
Java is a main prerequisite to install Talend, make sure to download the java JRE version for your environment as following
For JDK version 8
$ Sudo apt install openjdk-8-jdk
For JDK version 11
$ sudo apt install openjdk-11-jdk
In the following steps we will download Talend installer files and we will walk through step by step on how to start Talend Open Studio
- Download the Talend zip file from here http://Download the Talend zip file from here
- Once the download complete Extract the file on your hard drive
- Check that java has an environment variable on your machine
Check Java installation
Run the following command to check if you have Java installed and configured probably on your environment or not
$ echo $JAVA_HOME
If the output is empty of the previous command is empty, follow the next steps to add environment variable to point to your Java installation
1- open the ~/.bashrc file using vi editor or nano or any kind of file editors you have on your environment
$ nano ~/.bashrc
2- Add the java environment variable to your environment using the following format
export JAVA_HOME=<directory where java is installed >
Run Talend Open Studio
After extraction of the Talend zip file we just downloaded, open terminal from the directory of where the file was extracted, give access to the user to make changes on the script file
$ chmod +x TOS_DI-linux-gtk-x86.sh
Now start Talend Open Studio by executing the script TOS_DI-linux-gtk-x86.sh from the command terminal
Don’t worry about the failed messages it’s because the installing of third-party libraries on the Windows can’t be installed
As first time user,you need to set up a new project or can also import Demo project, in the next startup you will have a list of existing projects to select from
- Talend Studio requires specific third-party Java libraries or database drivers (.jar files) to be installed to connect to sources and targets. Those libraries or drivers, known as external modules, can be required by some of Talend components or by some connection wizards or by both. Due to license restrictions, Talend may not be able to ship certain external modules within Talend Studio.
- choose the feature to install Third-party libraries (43)
- click finish
- Accept all the license of the selected packages