Tuesday, 19 September 2017

Reading and inserting Arabic using Talend

·         Arabic Data supports UTF-8 characters
·         UTF-8 encodes each Unicode character as a variable number of 1 to 4 octets, where the number of octets depends on the integer value assigned to the Unicode character. It is an efficient encoding of Unicode documents that use mostly US-ASCII characters because it represents each character in the range U+0000 through U+007F as a single octet
Steps
·         First we need create Table to accept UTF-8 encoding format field.
·         For example I need to load the Arabic names into Name filed so the create statement will be like
o   “create table test.L_Arabic (Id int,Name Varchar(255)CHARACTER SET utf8 COLLATE utf8_general_ci NOT NULL);”
·         Design a Talend job
·         tFixedFlowInPut-àtMysqloutput


Configure tMysqloutput with required parameter.
We need to put in Advance setting tab"useUnicode=true&characterEncoding=UTF-8"


Save and Run the job.
Output


2 comments:

  1. its good to know that other languages(Arabic) can be dumped into DB.

    ReplyDelete

java.io.IOException: org.eclipse.aether.deployment.DeploymentException: Failed to deploy artifacts: Could not transfer artifact Return code is: 400, ReasonPhrase: Repository does not allow updating assets: releases.

 HI, by Default in Nexus release branch will not allow redeploy of the Same version of Talend job. if we need to redeploy same job again int...