Something about DataStage, DataStage Administration, Job Designing,Developing, DataStage troubleshooting, DataStage Installation & Configuration, ETL, DataWareHousing, DB2, Teradata, Oracle and Scripting.
Tuesday, October 29, 2013
Monday, October 28, 2013
DataStage Scenario - Problem2
Goal : To get the Unique and Duplicates values from Input Data
Input :
There is a input file which contains duplicates data, Suppose :Friday, October 25, 2013
DataStage Scenario - Problem1
Goal :
Process a text file which contains records arranged in blocks consisting of a header record, details (items, body) and a trailer. The aim is to normalize records and load them into a relational database structure.Details :
Thursday, October 24, 2013
Interview Questions : DataWareHouse - Part 3
What is data warehousing?
"In computing, a data warehouse (DW) is a database used for reporting and analysis. The data stored in the warehouse is uploaded from the operational systems. The data may pass through an operational data store for additional operations before it is used in the DW for reporting.
A data warehouse maintains its functions in three layers: staging, integration, and access.
Staging is used to store raw data for use by developers. The integration layer is used to integrate data and to have a level of abstraction from users. The access layer is for getting data out for users.
The term Data Warehouse was coined by Bill Inmon in 1990, which he defined in the following way:
Tuesday, October 22, 2013
Adding background colors or background images to emails and other Notes documents - Part 2
To add a reusable graphic from an application
- While creating or editing a document, click Text > Text Properties.
- In the properties box, click the arrow next to Text and select Document.
- Click the Background tab.
- Enter the name of the graphic in the text field in the Graphic or Resource section, or use the folder button to select a graphic.
Friday, October 18, 2013
WebSphere Application Server[WAS] fails to start in Linux
The application server fails to start after system is restarted. No messages are generated in the application server logs.
The Metadata server startup script fails to finish. You
must issue the nohup command for the Metadata server
startup script.
Check to ensure that WebSphere Application Server is running.
To connect to the application server
using an Internet browser:
- Start your Internet browser.
- Type the application server address in the form: http//isserver:portnumber, where isserver is the name of the computer where the services tier is installed or its tcp/ip address, and portnumber is the port for connecting to the services tier (by default, 9080).
Thursday, October 17, 2013
How to install Linux / UNIX *.tar.gz tarball files
tar.gz also known as tarball, an archive format for electronic data and software. Most Linux tarball contains a source code for software. If you are new to Linux I recommend using apt-get, rpm and yum command to install all binary packages.
Tarballs are a group of files in one file. Tarball files have the extension .tar.gz, .tgz or .tar.bz2. Most open source software use tarballs to distribute programs/source codes.
Tuesday, October 15, 2013
How many places u can call Routines?
Routines can be called at the following places:
a) In the job properties There is an option to call the Before and After job subroutines.
b) In the job sequence there is an activity called "Routine Activity".From there also the routines could be called.
c) In the derivation part of theTransformer of a parallel job "parallel routines"can be called.
d) In the derivation part of theTransformer of a server job "server routines"can be called.
f)In the server job stages also before and after job subroutines can be called.
g)We have called routines from user variable activity of a sequence as well.(Here basically transforms are called which return a value to the variable of the user variable activity after their execution.)
Friday, October 11, 2013
Difference between OLTP and OLAP
OLTP:
Online Transactional Processing databases are functional orientated, they are designed to provide real-time responses from concurrent users and applications. To be more specific, OLTP databases must provide real-time concurrent (multi-threaded) processing of all SQL transaction (writes/updates and reads). Another characteristic of an OLTP database, is the fact that its state (underlying data) is constantly changing. Examples of OLTP are databases that support e-commerce applications.
OLTP databases are highly Normalized relational databases. This means that there is very little or no data redundancy. This ensures data consistency (part of the ACID standard). Normalization is the process of arranging data into logical, organized groups of tables, reducing data repetition or going so far as to completely eliminating it. As a result the data is logically grouped into tables, and these tables form relationships with one another through the use of primary and foreign keys. There are different levels of normalization and OLTP data models usually meet the 3rd Normal Form also known as the Entity Attribute Relationship Model.
Labels:
Concept
,
database
,
DataWareHouse
,
difference
,
form
,
normal
,
OLAP
,
OLTP
Monday, October 07, 2013
Create a unique counter in datastage
This entry describes various ways of creating a unique counter in DataStage jobs.
A parallel job has a surrogate key stage that creates unique IDs, however it is limited in that it does not support conditional code and it may be more efficient to add a counter to an existing transformer rather than add a new stage.
In a server job there are a set of key increment routines installed in the routine SDK samples that offer a more complex counter that remembers values between job executions.
The following section outlines a transformer only technique.
Tuesday, October 01, 2013
Some Windows Shortcuts
1. CTRL+C (Copy)
2. CTRL+X (Cut)
3. CTRL+V (Paste)
4. CTRL+Z (Undo)
5. DELETE (Delete)
6. SHIFT+DELETE (Delete the selected item permanently without placing the item in the Recycle Bin)
7. CTRL while dragging an item (Copy the selected item)
8. CTRL+SHIFT while dragging an item (Create a shortcut to the selected item)
9. F2 key (Rename the selected item)
2. CTRL+X (Cut)
3. CTRL+V (Paste)
4. CTRL+Z (Undo)
5. DELETE (Delete)
6. SHIFT+DELETE (Delete the selected item permanently without placing the item in the Recycle Bin)
7. CTRL while dragging an item (Copy the selected item)
8. CTRL+SHIFT while dragging an item (Create a shortcut to the selected item)
9. F2 key (Rename the selected item)
Subscribe to:
Posts
(
Atom
)