Posts

Abinitio advance

 Why should you complete this course? Records are the units that components use to process data. Data is read from a data source (a file, a database table, a queue, and so on) as a stream of bytes. Components use the record format on their in ports to parse these bytes into meaningful units of data -- that is, records.  A record is a collection of fields. Each field in the record can be a different data type and size. Some of the fields themselves might be records; these are called "nested records" or "subrecords". When you design a graph, you write record formats for the ports of components in the graph. These record formats simply define record types that apply to the data on those ports. Here are the contents of the stores.dml file that you'll use in this course: record   decimal(4) store_no = 000;   string("\0") store_manager = "";   string("\0") address = "";   string("\307") city = "";   string(2)

UNIX

 UNIX 1. What is ls –ltra ls lists the files in the current working directory - but if another directory is specified, then ls will  list the files there, and in fact the user may specify any list of files and directories to be listed. Options:  a) - l : long format, displaying Unix file types, permissions, number of hard links, owner,  group, size, last-modified date and filename b) -t : sort the list of files by modification time.(newest first)  c) -a : lists all files in the given directory, including those whose names start with "."  (Which are hidden files in Unix). By default, these files are excluded from the list. d) -r : will reverse the sorting order. 2. How to rename a file abc.dat to xyz.dat  $ mv abc.dat xyz.dat 3. Create a file with time stamp 2015-jan-15  $ touch -t 1501150808 xyz.dat 4. How to create empty files abc.dat and xyz.dat $ touch abc.dat $ touch xyz.dat 5. Remove a) All files in current directory $ rm * b) All files in current directory (ask confirma

ETL

Image
    Abinitio Scenario's 1. Read each value and find sum. Input             12345             1234             123             12             1 Output             15             10             6             3             1 Code -  Input File --> Reformat --> Output File 2. Find the sum of runs in each over. Input Output Code -  Input File --> Rollup with key Change option --> Output File 3.   Concatenate the over value according to the given runs. Input Output Code -  Input File --> Reformat --> Output File 4.    Input :            chennai    bangalore 300            bangalore chennai 300            chennai    bangalore 300            chennai    trichy        200            trichy       chennai     200 Output :               chennai    trichy    200            bangalore chennai 300 Code -  Input File --> Reformat --> Sort --> Dedup Sort --> Output File Input File dml  -              record                     string(" ") city_1;