June 30, 2013
By using above class and method you can generate a sequence in Talend.
The number you select in tFilter the process will leave those rows to reach to the target.
Thank you, for reading!!!!
June 26, 2013
- shared_conn_demo–>> This job holds our two child jobs (parent,child)
- parent_db_conns–>>The primary job is the one in which connection(shared) is registered for our sub jobs.
- child_job_conn–>>This is the third job which will be using the connection created in parent job.
just fill all the enteries required to create a connection here.
and leave all the field blanks.
2) shared_conn_demo:-Now you just have to connect both the jobs and then run them.
Give the trigger as OnSubjobOk
That’s it now you can use or share your connections through out your all jobs and save connections.
Thank you for reading…
June 24, 2013
This post would give a fair intoduction to R-programming .
1)To get started with R, download and install R for your desktop from r-project.org .R is not available for Android or iOS.
2)I would suggest you install RStudio a free IDE which has numerous features.
3)Setting your work directory
Use the setwd() function to change or set your working directory
eg. setwd(“~/directory_name”) , we use only forward slashes in case of Windows as well as Unix.
4)Instaling and using packages
We can take advantage of various add-ons packages which have been developed . To install this package , we use the following command,
to see all installed packeges , use the following command,
Now to use this installed package , load it with the following command
In the coming posts we will cover some basic commands , along with a scenario depicting data-analysis.
June 22, 2013
Late arriving dimensions issues are commonly encountered in a datawarehousing environment.
The snapshot below shows the overall mapping to resolve the late arrviving mapping
In this case we consider a type1 Insert/Update SCD dimension table and use a reusable sequence to resolve this issue:
Here is the snapshop of the over all mapping:
These are the overall steps followed:
1)We have 2 dimension tables namely the product dimension and the store dimension ,which are used for fact table loading (sales fact table).
2) We simulated a condition where in the late arriving dimension issue arises in the product dimension while loading into the sales fact table.
3)Add the source table as the sales oltp table .
4)Add two look up transformation for the product and the store dimension.
5)We have added a sequencer named product_dimension_seq , this transformation is reusable sequence transformation that i have used while loading into the product dimension table , this helps me to come up with an SKEY(surrogate key) in case of an late arriving dimension.
6)In the expression dimension
As you can observe in the expression port, we have an out_prod_SKEY where in the following condition is defined “IIF(ISNULL(product_skey),product_seq_NEXTVAL,product_skey)“
where in case the SKEY is NULL we use the sequencer’s value to generate an SKEY.
6)Now we have to targets in the flow:
i)We use a filter transformation to find the product_SKEY is null and load it into the dimension table with the value from the sequence generator , since the product dimension is an Insert else Update, whenever the dimension arrives the values will be updated.
ii)Now since we have an SKEY for the missing or late dimension , we now load the data into the sales fact table.
This will solve the late dimension issue . Thanks for reading !!!!!!
In case of any queries please do comment…..
June 1, 2013