DataStage Interview Questions & Answers

  • Question 1. Explain Data Stage?

    Answer :

    A data stage is simply a tool which is used to design, develop and execute many applications to fill various tables in data warehouse or data marts.Learn more about DataStage in this insightful blog post now.

  • Question 2. Tell How A Source File Is Populated?

    Answer :

    We can generate a source file in various ways such as by making a SQL query in Oracle, or  by using row generator extract tool etc.

  • Oracle 10g Interview Questions

  • Question 3. Write The Command Line Functions To Import And Export The Ds Jobs?

    Answer :

    To signify the DS jobs, dsimport.exe is used and to export the DS jobs, dsexport.exe is used.

  • Question 4. Differentiate Between Datastage 7.5 And 7.0?

    Answer :

    In Datastage 7.5 various new stages are added for more sturdiness and smooth performance, such as Procedure Stage, Command Stage,etc.

  • Oracle 10g Tutorial

  • Question 5. Explain Merge?

    Answer :

    Merge means to merge two or more tables. The two tables are merged on the origin of Primary key columns in both the tables.Interested in learning DataStage? Well, we have the in-depth DataStage Courses to give you a head start in your career.

  • Shell Scripting Interview Questions

  • Question 6. Differentiate Between Data File And Descriptor File?

    Answer :

    As the name says, data files contains the data and the descriptor file contains the information about the data in the data files.

  • Question 7. Differentiate Between Data Stage And Informatica?

    Answer :

    In datastage, there is a perception of separation, parallelism for node configuration. While, there is no perception of separation and parallelism in informatica for node configuration. Also, Informatica is more scalable than Datastage. Datastage is more easy to use as compared to Informatica.

  • Shell Scripting Tutorial Microstrategy Interview Questions

  • Question 8. Explain Routines And Their Types?

    Answer :

    Routines are basically group of functions that is described by DS manager. It can be called through transformer stage. Routines are of three types such as, parallel routines, server routines and main frame routines.

  • Question 9. How Can We Write Parallel Routines In Data Stage Px?

    Answer :

    We can mention parallel routines in C or C++ compiler. Such routines are also developed in DS manager and can be called from transformer stage.

  • Informatica Interview Questions

  • Question 10. What Is The Procedure Of Removing Duplicates, Without The Remove Duplicate Stage?

    Answer :

    Duplicates can be detached by using Sort stage. We can use the opportunity, as allow duplicate = false.

  • Microstrategy Tutorial

  • Question 11. What Steps Should Be Taken To Recover Datastage Jobs?

    Answer :

    In order to recover presentation of Datastage jobs, we have to first create the baselines. Secondly, we should not use only one flow for presentation testing. Thirdly, we should work in growth. Then, we should appraise data skews. Then we should separate and solve the problems, one by one. After that, we should allocate the file systems to take away bottlenecks, if any. Also, we should not embrace RDBMS in start of testing phase. Last but not the least, we should understand and evaluate the available tuning knobs.

  • IBM Cognos Interview Questions

  • Question 12. Compare And Contrast Between Join, Merge And Lookup Stage?

    Answer :

    All the three are dissimilar from each other in the way they use the memory storage, compare input necessities and how they treat various data . Join and Merge needs minimum memory as compared to the Lookup stage.

  • Oracle 10g Interview Questions

  • Question 13. Describe Quality Stage?

    Answer :

    Quality stage is also called as Integrity stage. It assists in integrating various types of data from different sources.

  • Informatica Tutorial

  • Question 14. Describe Job Control?

    Answer :

    Job control can be best performed by using Job Control Language (JCL). This tool is used to execute various jobs concurrently, without using any kind of loop.

  • Question 15. Contrast Between Symmetric Multiprocessing And Massive Parallel Processing?

    Answer :

    In Symmetric Multiprocessing, the hardware resources are communal by processor. The processor has one operating system and it communicates through shared memory. While in Massive Parallel processing, the CPU contact the hardware resources completely. This type of processing is also called as Shared Nothing, as nothing is common in this. It is quicker than the Symmetric Multiprocessing.

  • Data Warehouse ETL Toolkit Interview Questions

  • Question 16. Write The Steps Required To Kill The Job In Datastage?

    Answer :

    To destroy the job in Datasatge, we have to kill the individual processing ID.

  • IBM Cognos Tutorial

  • Question 17. Contrast Between Validated And Compiled In The Datastage?

    Answer :

    In Datastage, validating a job means, executing a job. While validating, the Datastage engine checks whether all the necessary properties are given or not. In other case, while compiling a job, the Datastage engine checks that whether all the given property are suitable or not.

  • Teradata Interview Questions

  • Question 18. How We Can Run Date Conversion In Datastage?

    Answer :

    We can use date conversion function for this reason i.e. Oconv (Iconv(Filedname,”Existing Date Format”),”Another Date Format”).

  • Shell Scripting Interview Questions

  • Question 19. What Is The Need Of Exception Activity In Datastage?

    Answer :

    All the stages after the exception activity in Datastage are run in case of any unfamiliar error occurs while executing the job sequencer.Learn how the DataStage Training Videos  can take your career to the next level!

  • Data Warehouse ETL Toolkit Tutorial

  • Question 20. Explain Apt_config In Datastage?

    Answer :

    It is the environment variable which is used to recognize the *.apt file in Datastage. It is also used to keep the node information, scratch information and disk storage information.

  • PL/SQL Interview Questions

  • Question 21. Write The Different Types Of Lookups In Datastage?

    Answer :

    There are two types of Lookups in Datastage i.e. Normal lookup and Sparse lookup.

  • Question 22. How We Can Covert Server Job To A Parallel Job?

    Answer :

    We can convert a server job in to a parallel job by using Link Collector and IPC Collector.

  • Teradata Tutorial

  • Question 23. Explain Repository Tables In Datastage?

    Answer :

    In Datastage, the Repository is second name for a data warehouse. It can be federalized as well as circulated.

  • Oracle 11g Interview Questions

  • Question 24. Describe Oconv () And Iconv () Functions In Datastage?

    Answer :

    In Datastage, OConv () and IConv() functions are used to convert formats from one format to another i.e. conversions of time, roman numbers, radix, date, numeral ASCII etc. IConv () is mostly used to change formats for system to understand. While, OConv () is used to change formats for users to understand.

  • Microstrategy Interview Questions

  • Question 25. Define Usage Analysis In Datastage?

    Answer :

    In Datastage, Usage Analysis is done within few clicks. Launch Datastage Manager and right click on job. Then, select Usage Analysis.

  • Oracle 11g Tutorial

  • Question 26. How We Can Find The Number Of Rows In A Sequential File?

    Answer :

    To find rows in chronical file, we can use the System variable @INROWNUM.

  • Abinitio Interview Questions

  • Question 27. Contrast Between Hash File And Sequential File?

    Answer :

    The only dissimilarity between the Hash file and Sequential file is that the Hash file stores data on hash algorithm and on a hash key value, while sequential file doesn’t have any key value to save the data. Hence we can say that hash key feature, searching in Hash file is faster than in sequential file.

  • Informatica Interview Questions

  • Question 28. How We Can Clean The Datastage Repository?

    Answer :

    We can clean the Datastage repository via the Clean Up Resources functionality in the Datastage Manager.

  • Question 29. How We Can Called Routine In Datastage Job?

    Answer :

    We can call a routine from the transformer stage in Datastage job.

  • TeraData DBA Interview Questions

  • Question 30. Differentiate Between Operational Datastage (ods) And Data Warehouse?

    Answer :

    We can say, ODS is a small data warehouse. An ODS doesn’t have information for more than 1 year while a data warehouse have detailed information about the entire business.

  • Question 31. For What Nls Stand For In Datastage?

    Answer :

    NLS stand for National Language Support. It can be used to integrate various languages such as French, German, and Spanish etc. in the data, requisite for processing by data warehouse.

  • Question 32. Can You Explain How Could Anyone Crash The Index Before Loading The Data In Target In Datastage?

    Answer :

    In Datastage, we can crash the index before loading the data in target by using the Direct Load functionality of SQL Loaded Utility.

  • Talend Interview Questions

  • Question 33. Does Datastage Support Gradually Changing Dimensions ?

    Answer :

    Yes,Version 8.5 + supports this feature in datastage.

  • IBM Cognos Interview Questions

  • Question 34. How Complicated Jobs Are Implemented In Datstage To Recover Performance?

    Answer :

    In order to recover performance in Datastage, it is suggested, not to use more than 20 stages in every job. If you need to use more than 20 stages then it is advisable to use next job for those stages.

  • Question 35. Name The Third Party Tools That Can Be Used In Datastage?

    Answer :

    The third party tools that can be used in Datastage, are Autosys, TNG and Event Co-ordinator.

  • Question 36. Describe Project In Datastage?

    Answer :

    When ever we begin the Datastage client, we are asked to join to a Datastage project. A Datastage project have Datastage jobs, built-in apparatus and Datastage Designer or User-Defined components.

  • Data Warehouse ETL Toolkit Interview Questions

  • Question 37. What Types Of Hash Files Are There?

    Answer :

    There are two types of hash files in which are Static Hash File and Dynamic Hash File.

  • Question 38. Describe Meta Stage?

    Answer :

    In Datastage, MetaStage is used to store metadata that is beneficial for data lineage and data analysis.

  • Question 39. Why Unix Environment Is Useful In Datastage?

    Answer :

    It is useful in Datastage because sometimes one has to write UNIX programs such as batch programs to raise batch processing etc.

  • Question 40. Contrast Between Datastage And Datastage Tx?

    Answer :

    Datastage is a tool from ETL i.e. Extract, Transform and Load and Datastage TX is a tool from EAI i.e. Enterprise Application Integration.Learn more about the ETL process in this insightful blog now.

  • Teradata Interview Questions

  • Question 41. What Is Size Of A Transaction And An Array Means In A Datastage?

    Answer :

    Transaction size means the number of row written before committing the account in a table. An array size means the number of rows written/read to or from the table respectively.

  • Question 42. Name The Various Types Views In A Datastage Director?

    Answer :

    There are three types of views in a Datastage Director i.e. Log View, Job View and Status View.

  • PL/SQL Interview Questions

  • Question 43. What Is The Use Of Surrogate Key?

    Answer :

    Surrogate key is mostly used for getting data faster. It uses catalog to perform the retrieval operation.

  • Question 44. How Discarded Rows Are Processed In Datastage?

    Answer :

    In the Datastage, the discarded rows are managed by constraints in transformer. We can either place the discarded rows in the properties of a transformer or we can create a brief storage for discarded rows with the help of REJECTED command.

  • Question 45. Contrast Between Odbc And Drs Stage?

    Answer :

    DRS stage is faster than the ODBC stage because it uses local databases for connectivity.

  • Question 46. Describe Orabulk And Bcp Stages?

    Answer :

    Orabulk stage is used to store big amount of data in one target table of Oracle database. The BCP stage is used to store big amount of data in one target table of Microsoft SQL Server.

  • Question 47. Describe Ds Designer?

    Answer :

    The DS Designer is used to make work area and add many links to it.

  • Question 48. What Is The Need Of Link Partitioner And Link Collector In Datastage?

    Answer :

    In Datastage, Link Partitioner is used to split data into various parts by certain partitioning methods. Link Collector is used to collect data from many partitions to a single data and save it in the target table.