Talend is an open-source data integration platform. It provides different solutions and services for data integration, data quality, cloud storage, and Big Data. According to the latest Gartner report, Talend is named in the leader’s quadrant among other data integration solutions. In this article, we will show you step-by-step how to install and configure Talend Open Studio on the Linux operating system. This article will also include the steps to install Java as a prerequisite for Talend.
In the next article, we will demonstrate how to use Talend ETL capabilities to implement one of the critical types of Data Warehouse table, which is the Slowly Changing Dimension table, but before that, in this article, we will make sure we have Talend is installed and configured correctly to start using in the coming use cases.
Talend On Linux
Make sure that you have the following prerequisites before starting the installation:
- Talend Open Studio installer files.
Java is a main prerequisite to install Talend, make sure to download the java JRE version for your environment as follows
For JDK version 8
$ Sudo apt install openjdk-8-jdk
For JDK version 11
$ sudo apt install openjdk-11-jdk
In the following steps, we will download Talend installer files and will walk through step by step on how to start Talend Open Studio
- Download the Talend zip file from here http://Download the Talend zip file from here
- Once the download is complete Extract the file from your hard drive
- Check that Java has an environment variable on your machine
Check Java installation
Run the following command to check if you have Java installed and configured on your environment or not
$ echo $JAVA_HOME
If the output is empty of the previous command is empty, follow the next steps to add an environment variable to point to your Java installation
1- open the ~/.bashrc file using vi editor or nano or any kind of file editor you have in your environment
$ nano ~/.bashrc
2- Add the Java environment variable to your environment using the following format
export JAVA_HOME=<directory where java is installed >
Run Talend Open Studio
After extraction of the Talend zip file we just downloaded, open the terminal from the directory where the file was extracted, give access to the user to make changes on the script file
$ chmod +x TOS_DI-linux-gtk-x86.sh
Now start Talend Open Studio by executing the script TOS_DI-linux-gtk-x86.sh from the command terminal
Don’t worry about the failed messages it’s because the installing of third-party libraries on the Windows can’t be installed
As first time user, you need to set up a new project or can also import a Demo project, in the next startup you will have a list of existing projects to select from
- Talend Studio requires specific third-party Java libraries or database drivers (.jar files) to be installed to connect to sources and targets. Those libraries or drivers, known as external modules, can be required by some of Talend components or by some connection wizards or by both. Due to license restrictions, Talend may be unable to ship certain external modules within Talend Studio.
- choose the feature to install Third-party libraries (43)
- click finish
- Accept all the licenses of the selected packages