bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizbigdata.docx  · web viewpipe. decision....

99
Question1. Which of the following is a monitoring solution for hadoop? 1. Sirona 2. Sentry 3. Slider 4. Streams Question2. __________ is a distributed machine learning framework on top of spark 1. MLlib 2. Spark Streaming 3. GraphX 4. RDDs Question3. Point out the correct statement? 1. Knox is a stateless reverse proxy framework 2. Knox also intercepts REST/HTTP calls and provides authentication 3. Knox scales linearly by adding more knox nodes as the load increases 4. All of the mentioned

Upload: duongliem

Post on 11-Aug-2018

221 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

Question1. Which of the following is a monitoring solution for hadoop?

1. Sirona2. Sentry3. Slider4. Streams

Question2. __________ is a distributed machine learning framework on top of spark

1. MLlib2. Spark Streaming3. GraphX4. RDDs

Question3. Point out the correct statement?

1. Knox is a stateless reverse proxy framework2. Knox also intercepts REST/HTTP calls and provides

authentication3. Knox scales linearly by adding more knox nodes as the

load increases4. All of the mentioned

Question4. PCollection, PTable, and PGroupedTable all support a __________ operation.

1. Intersection2. Union3. OR

Page 2: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

4. None of the mentioned

Question 5. How many types of mode are present in Hama?1. 2

2. 3

3. 4

4. 5

Question6. The IBM ____________ Platform provides all the foundational building blocks of trusted information, including data integration, data warehousing, master data management, big data and information governance.

1. Infostream2. Infosphere3. Infosurface4. Infodata

Question7. ________ is the name of the archive you would like to create.

1. Archive2. Archive name3. Name4. None of the mentioned

Question 8. Ambari provides a _______API that enables integration with existing tools, such as Microsoft System Center.

Page 3: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

1. Restless2. Web services3. Restful4. None of the mentioned

Question9. _______ forge software for the development of software projects.

1. Oozie2. Allura3. Ambari4. All of the mentioned

Question10. Posting format now uses a __________ API when writing postings just like doc values.

1. Push2. Pull3. Read4. All of the mentioned

Question11. Point out the correct statement

1. Building Pylucene requires CNU make, a recent version of ant capable of building java lucene and a c++ compiler

2. Pylucene is supported on Mac OS X, linux, SOlaries and windows

3. Use of the setuptools is recommended for lucene4. All the mentioned

Page 4: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

5. Question12. ________ builds virtual machines of branches trunk and 0.3 for KVM, VMWare and virtual box.

1. Bigtop-trunk-pakagetest2. Bigtop-trunk-repository3. Bigtop-VM-matrix4. None of the mentioned

Question13. Zookeeper is used for configuration, leader election in cloud edition of

1. Solr2. Solur3. Solar1014. Solr

Question14. How are keys and values presented and passed to the reducers during a standard sort and shuffle phase of Mapreduce?

1. Keys are presented to reducer in sorted order; values for a given key are not sorted

2. Keys are presented to reducer in sorted order; values for a given key are sorted in ascending order

3. Keys are presented to reducer in random order; values for a given key are not sorted

4. Keys are presented to reducer in random order; values for a given key are sorted in ascending order

Question15. Datastage RTI is real time integration pack for:

Page 5: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

1. STD2. ISD3. EXD4. None of the above

Question16. Which mapreduce stage serves as a barrier, where all the previous stages must be completed before it may proceed?

1. Combine2. Group (a.k.a. ‘shuffle’)3. Reduce4. Write

Question17. Which of the following format is more compression aggressive?

1. Partition compressed2. Record compressed3. Block compressed4. Uncompressed

Question18. _________ is the way of encoding structured data in an efficient yet extensible format.

1. Thrift2. Protocol buffers3. Avro4. None of the above

Page 6: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

Question19. Which of the following argument is not supported by import-all-table tool?

1. Class name2. Package name3. Database name4. Table name

Question20. Which of the following operating system is not supported by big top?

1. Fedora2. Solaris3. Ubuntu4. SUSE

Question21. Distributed modes are mapped in the _____ file.

1. Groomservers2. Grervers3. Grsvers4. Groom

Question22. ________ is the architectural center of hadoop that allows multiple data processing engines.

1. YARN2. Hive3. Incubator4. Chuckwa

Page 7: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

Question23. Users can easily run spark on top of amazons____________

1. ‘infosphere2. ‘EC23. EMR4. None of the above

Question24. Which of the following projects is interface definition language for hadoop?

1. Oozie2. Mahout3. Thrift4. Impala

Question25. Output of the mapper is first written on the local disk for sorting and _____ process.

1. Shuffling2. Secondary sorting3. Forking4. Reducing

Question26. HDT projects work with eclipse version _____ and above

1. 3.42. 3.53. 3.64. 3.7

Page 8: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

Question27. Which of the following language is not supported by spark?

1. Java2. Pascal3. Scala4. Python

Question28. Data analytics scripts are written in __________

1. Hivw2. CQL3. Piglatin4. Java

Question29. Ripper is a browser based mobile phone emulator designed to aid in the development of ______ bases mobile application.

1. Javascript’2. Java3. C++4. HTML5

Question30. If you set the inline LOB limit to ____, all large objects will be placed in external storage.

1. 02. 13. 24. 3

Page 9: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

Question31. Hadoop archives reliability by replicating the data across multiple hosts, and hence does not require _____ storage on hosts.

1. RAID2. Standard RAID levels3. ZFS 4. Operating system

Question32. The configuration file must be owned by the user running

1. Data manager2. Node manager3. Validation manager4. None of the above

Question33. ________ is non blocking a synchronous event driven high performance web framework

1. AWS2. AWF3. AWT4. ASW

Question34. Falcon provides seamless integration with

1. HCatalog2. Metastore3. HBase4. Kafka

Page 10: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

Question35. One supported datatype that deserves special mention are:

1. Money2. Counters3. Smallint4. Tinyint

Question36. _______ are chukwa processes that actually produce data

1. Collectors2. Agents3. Hbase table4. HCatalog

Question37. Which of the following hadoop file formats is supported by impala?

1. Sequencefile’2. Avro3. Rcfile4. All of the above

Question38. Avro is said to be the future ___________ layer of hadoop

1. RMC2. RPC3. RDC4. All of the above

Page 11: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

Question39. ______ nodes are the mechanism by which a workflow triggers the execution of a computation/processing task

1. Server2. Client3. Mechanism4. Action

Question40. The _______ attribute in the join node is the name of the workflow join node

1. Name2. To3. Down4. All of the above

Question41. Yarn commands are invoked by the _____ script

1. Hive2. Bin3. Hadoop4. Home

Question42. Which of the following function is used to read data in PIG?

1. Write2. Read3. Load4. None of the above

Page 12: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

Question43. Which of the following hive commands is not supported by hcatalog?

1. Alter index rebuild2. Create new3. Show functions4. Drop table

Question44. Apache hadoop development tools is an effort undergoing incubation at

1. ADF2. ASF3. HCC4. AFS

Question45. Kafka users key value pairs in the _________ file format for configuration

1. RFC2. Avro3. Property4. None of the above

Question46. Facebook tackles big data with __________ based in hadoop

1. Project prism2. Prism3. Project big4. Project data

Page 13: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

Question47. The size of block in HDFCs is

1. 512 bytes2. 64 mb3. 1024 kb4. None of the above

Question48. Which is the most popular NoSQL databases for scalable big data store with hadoop?

1. Hbase2. mongoDB3. Cassandra4. None of the above

Question 49. A ________- can route requests to multiple knox instances

1. Collector2. Load balancer3. Comparator4. All of the above

Question50. Hcatalog is installed with hive, starting with hive release

1. 0.10..02. 0.9.03. 0.11.04. 0.12.0

Question51. Table metadata in hive is:

Page 14: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

1. Stored as metadata on the name node2. Stored along with the data in HDFCs3. Stored in the metastore4. Stored in zookeeper

Question52. Avro schemes are defined with ________

1. JSON2. XML3. JAVA4. All of the above

Question53. Spark was initially started by ___________ at uc Berkeley AMPlab in 2009

1. Matei Zaharia2. Mahek Zaharia3. Doug cutting4. Stonebreaker

Question54. __________ does rewrite data and pack rows into column for certain time periods

1. Open TS2. Open TSDB3. Open TSD4. Open DB

Question55. Which of the following phrases occur simultaneously

1. Shuffle and sort

Page 15: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

2. Reduce and sort3. Shuffle and map4. All of the above

Question56. ________ command fetches the contents of row or a cell

1. Select2. Get3. Put4. None of the above

Quesiotn57. _______ are encoded as a series of blocks

1. Arrays2. Enum3. Unions4. Maps

Question58. Hive also support custom extensions written in

1. C#2. Java3. C4. C++

Question59. How many types of nodes are present in storm cluster?

1. 12. 23. 3

Page 16: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

4. 4

Question60. All decision nodes must have a ____________ element to avoid bringing the workflow into an error state if none of the predicates evaluates to true.

1. Name2. Default3. Server4. Client

Question61. ________ is a rest API for Hcatalog

1. Web hcat2. Wbhcat3. Inphcat4. None of the above

Question62. Streaming supports streaming commands option as well as ____________ command options

1. Generic2. Tool3. Library4. Task

Questio63. By default collectors listen on port

1. 80082. 80703. 80804. None of the above

Page 17: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

Question64. _______ communicate with the client and handle data related operations.

1. Master server2. Region server3. Htable4. All of the above

Question65. We can declare the scheme of our data either in ________ file

1. JSON2. XML3. SQL4. VB

Question66. ________ provides a couchbase server hadoop connector by means of sqoop

1. Memcache2. Couchbase3. Hbase4. All of the above

Question67. Storm integrates with _________ via apache slider

1. Scheduler2. Yarn3. Compaction4. All of the above

Page 18: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

Question68. Avro-backed table can simply be created by using ___________ in a DDL statement

1. Stored as avro2. Stored as hive’3. Stored as avrohive4. Stored as serd

Question69. Drill analyze semistructured/nested data coming from ______applications

1. RDBMS2. NoSQL3. newSQL4. none of the above

Question70. The hadoop list includes the HBase Database, the apache mahout __________ system and matrix operations.

1. Machine learning2. Pattern recognition3. Statistical classification4. Articficial classification

Question71. Oozie workflow jobs are directed _______ graphs of actions

1. Acyclical2. Cyclical3. Elliptical

Page 19: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

4. All of the above

Question72. ___ is an open source SQL query engine for apache Hbase

1. Pig2. Phoenix3. Pivot’4. None of the above

Question73. $ pig x tez_local will enable _____ mode in pig

1. Mapreduce2. Tez3. Local4. None of the above

Question74. In comparison to SQl, pig uses

1. Lazy evaluation2. ETL3. Supports pipelines splits4. All of the above

Question75. For Apache _________ users, storm utilizes the same ODBC interfaces

1. C takes2. Hive3. Pig4. Oozie

Page 20: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

Question76. In one or more actions started by the workflow job are executed when the _________ node is reached, the actions will be killed.

1. Kill’2. Start3. End4. Finish

Question77. Which of the following data type is supported by hive?

1. Map2. Record3. String4. Enum

Question78. Hcatalog supports reading and writing files in any format for which a _____ can be written

1. SerDE2. SaerDear3. Doc Sear4. All

Question79. _______ is python port of the core project

1. Solr2. Lucene core3. Lucy4. Pylucene

Page 21: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

Question80. Apache storm added open source, stream data processing to _________ data platform

1. Cloudera2. Hortonworks3. Local cloudera4. Map R

Question81. Which of the following is spatial information system?

1. Sling2. Solr3. SIS4. All of the above

Question82. _______ properties can be overridden by specifying them in a job-xml file or configuration element.

1. Pipe2. Decision3. Flag4. None of the above

Question83. CDH process and control sensitive data and facilities:

1. Multi-tenancy2. Flexibility3. Scalability4. All of the above

Page 22: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

Qyestion84. Avro supports _________ kinds of complex types

1. 32. 43. 64. 7

Question85. With _________we can store data and read it easily with various programming languages.

1. Thrift2. Protocol buffers3. Avro4. None of the above

Question86. A float parameter defaults to 0.0001f, which means we can deal with 1 error every ________ rows

1. 10002. 100003. 1 million rows4. None of the above

Question87. The ________ data mapper framework makes it easier to use a database with Java or.NET applications

1. iBix2. Helix3. iBATIS4. iBAT

Page 23: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

Question88. ___________ is the most popular high level java API in Hadoop Ecosystem

1. scalding2. HCatalog3. Cascalog4. Cascading

Question89. Spark includes a collection over _________ operations for transforming data and familier data frame APIs for manipulating semi-structured data

1. 502. 603. 704. 80

Question90. Zookeper’s architecture supports high ________ through redundant services

1. Flexibilty’2. Scalability3. Availability4. Interactivity

Question91. The Lucene ____________ is pleased to announce the availability of Apache Lucene 5.0.0 and Apache solr 5.0.0

1. PMC2. RPC

Page 24: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

3. CPM4. All of the above

Question92. EC2 capacity can be increased or decreased in real time from as few as one to more than ________ virtual machines simultaneousl

1. 10002. 20003. 30004. None of the above

Question93. HTD has been tested on_________- and Juno. And can work 0n kepler as well

1. Raibow2. Indigo3. Idiavo4. Hadovo

Question94. Each kafka partition has one server which acts as the __________

1. Leaders2. Followers3. Staters4. All of the above

Question95. The right numbers of reduces seems to be

1. 0.92. 0.8

Page 25: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

3. 0.364. 0.95

Question96. Which of the following is a configuration management system?

1. Alex2. Puppet3. Acem4. None of the above

Question97. Which of the following is the only for storage with limited compute?

1. Hot2. Cold3. Warm4. All_SSD

Question98. Grooms servers start up with a _______ instance and a RPC proxy to contact the bsp master

1. RPC2. BSP Peer3. LPC4. None of the above

Question99. A ________ represents a distributed, immutable collection of elements of type t.

1. Pcollect2. Pcollection

Page 26: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

3. Pcol4. All of the above

Question100. ________ is used to read data from bytes buffers

1. Write{}2. Read{}3. Readwrite{}4. All of the above

Q101-Which is the default Input Formats defined in Hadoop ?

1. SequenceFileInputFormat

2. ByteInputFormat

3. KeyValueInputFormat

4. TextInputFormat

Q102. Which of the following is not an input format in Hadoop ?

1.TextInputFormat

2. ByteInputFormat

3. SequenceFileInputFormat

Page 27: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

4. KeyValueInputFormat

Q103. Which of the following is a valid flow in Hadoop ?

1.Input -> Reducer -> Mapper -> Combiner -> -> Output

2. Input -> Mapper -> Reducer -> Combiner -> Output

3. Input -> Mapper -> Combiner -> Reducer -> Output

4. Input -> Reducer -> Combiner -> Mapper -> Output

Q104. MapReduce was devised by ...

1.Apple

2. Google

3. Microsoft

4. Samsung

Q105. Which of the following is not a phase of Reducer ?

1. Map

2. Reduce

3. Shuffle

Page 28: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

4. Sort

Q106. How many instances of Job tracker can run on Hadoop cluster ?

1.1

2. 2

3.3

4.4

Q107. Which of the following is not the Dameon process that runs on a hadoop cluster ?

1.JobTracker

2.DataNode

3.TaskTracker

4.TaskNode

Q108-As companies move past the experimental phase with Hadoop, many cite the need for additional capabilities, including:

Page 29: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

1.Improved data storage and information retrieval

2.Improved extract, transform and load features for data integration

3.Improved data warehousing functionality

4.Improved security, workload management and SQL support

Q109-Point out the correct statement :

1.Hadoop do need specialized hardware to process the data

2.Hadoop 2.0 allows live stream processing of real time data

3.In Hadoop programming framework output files are divided in to lines or records

4.None of the mentioned

Q110-. According to analysts, for what can traditional IT systems provide a foundation when they’re integrated with big data technologies like Hadoop ?

1.Big data management and data mining

2. Data warehousing and business intelligence

3.Management of Hadoop clusters

4.Collecting and storing unstructured data

Page 30: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

Q111- Point out the wrong statement :

1.Hardtop’s processing capabilities are huge and its real advantage lies in the ability to process terabytes & petabytes of data

2.Hadoop uses a programming model called “MapReduce”, all the programs should confirms to this model in order to work on Hadoop platform

3.The programming model, MapReduce, used by Hadoop is difficult to write and test

4.All of the mentioned

Q112- What was Hadoop named after?

1. Creator Doug Cutting’s favorite circus act

2.Cutting’s high school rock band

3.The toy elephant of Cutting’s son

4.A sound Cutting’s laptop made during Hadoop’s development

Q113- All of the following accurately describe Hadoop, EXCEPT:

1.Open source

2. Real-time

3.Java-based

4. Distributed computing approach

Page 31: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

Q114- __________ can best be described as a programming model used to develop Hadoop-based applications that can process massive amounts of data.

1.MapReduce

2.Mahout

3.Oozie

4.All of the mentioned

Q115- __________ has the world’s largest Hadoop cluster.

1.Apple

2. Datamatics

3.Facebook

4.None of the mentioned

Q116- Facebook Tackles Big Data With _______ based on Hadoop.

1.‘Project Prism’

2.‘Prism’

3.‘Project Big’

4. ‘Project Data’

Page 32: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

Q 117- What is the main problem faced while reading and writing data in parallel from multiple disks?

1.Processing high volume of data faster.

2. Combining data from multiple disks.

3. The software required to do this task is extremely costly.

4. The hardware required to do this task is extremely costly.

Q118 - Under Hadoop High Availability, Fencing means

1.Preventing a previously active namenode from start running again.

2. Preventing the start of a failover in the event of network failure with the active namenode.

Page 33: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

3. Preventing the power down to the previously active namenode.

4. Preventing a previously active namenode from writing to the edit log.

Q119 - The default replication factor for HDFS file system in hadoop is

1.1

2. 2

3. 3

4. 4

Q120 - The hadfs command put is used to

1.Copy files from local file system to HDFS.

Page 34: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

2. Copy files or directories from local file system to HDFS.

3. Copy files from from HDFS to local filesystem.

4. Copy files or directories from HDFS to local filesystem.

Q121 - The namenode knows that the datanode is active using a mechanism known as

1.heartbeats

2. datapulse

3. h-signal

4. Active-pulse

Q122 - When a machine is declared as a datanode, the disk space in it

Page 35: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

1.Can be used only for HDFS storage

2. Can be used for both HDFS and non-HDFs storage

3. Cannot be accessed by non-hadoop commands

4. cannot store text files.

Q123 - The data from a remote hadoop cluster can

1. not be read by another hadoop cluster

2. be read using http

3. be read using hhtp

4. be read suing hftp

Q124 - Which one is not one of the big data feature?

Page 36: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

1. Velocity

2. Veracity

3. volume

4. variety

Q125 - What is HBASE?

1. Hbase is separate set of the Java API for Hadoop cluster.

2. Hbase is a part of the Apache Hadoop project that provides interface for scanning large amount of data using Hadoop infrastructure.

3. Hbase is a "database" like interface to Hadoop cluster data.

4. HBase is a part of the Apache Hadoop project that provides a SQL like interface for data processing.

Page 37: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

Q125 - Which of the following is false about RawComparator ?

1.Compare the keys by byte.

2. Performance can be improved in sort and suffle phase by using RawComparator.

3. Intermediary keys are deserialized to perform a comparison.

Q 126 - Zookeeper ensures that

1. All the namenodes are actively serving the client requests

2. Only one namenode is actively serving the client requests

3. A failover is triggered when any of the datanode fails.

Page 38: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

4. A failover can not be started by hadoop administrator.

Q 127 - Which scenario demands highest bandwidth for data transfer between nodes in Hadoop?

1. Different nodes on the same rack

2. Nodes on different racks in the same data center.

3. Nodes in different data centers

4. Data on the same node.

Q128 - The hadoop frame work is written in

1. C++

2. Python

3. Java

Page 39: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

4. GO

Q129 - When a client contacts the namenode for accessing a file, the namenode responds with

1. Size of the file requested.

2. Block ID of the file requested.

3. Block ID and hostname of any one of the data nodes containing that block.

4. Block ID and hostname of all the data nodes containing that block.

Q130 - Which of the following is not a goal of HDFS?

1. Fault detection and recovery

Page 40: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

2. Handle huge dataset

3. Prevent deletion of data

4. Provide high network bandwidth for data movement

Q 131 - In HDFS the files cannot be

1. read

2. deleted

3. executed

4. Archived

Q132 - The number of tasks a task tracker can accept depends on

Page 41: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

1. Maximum memory available in the node

2. Not limited

3. Number of slots configured in it

4. As decided by the jobTracker

Q133 - When using HDFS, what occurs when a file is deleted from the command line?

1. It is permanently deleted if trash is enabled.

2. It is placed into a trash directory common to all users for that cluster.

3. It is permanently deleted and the file attributes are recorded in a log file.

4. It is moved into the trash directory of the user who deleted it if trash is enabled.

Page 42: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

Q134 - The org.apache.hadoop.io.Writable interface declares which two methods? (Choose 2 answers.)

public void readFields(DataInput).

public void read(DataInput).

public void writeFields(DataOutput).

public void write(DataOutput).

1. 1 & 4

2. 2 & 3

3. 3 & 4

4. 2 & 4

Page 43: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

Question135. Mapreduce has undergone a complete overhaul in hadoop?

1. 0.212. 0.233. 0.244. 0.26

Question136. __________ is the slave/worker node and holds the user data in the form of data blocks

1. Data node2. Name node3. Data block4. Replication

Qyestion137. Spark is engineered from the bottom up for performance running __________

1. 100x2. 150x3. 200x4. None of the above

Question138. _________nodes are the mechanism by which a workflow triggers the execution of a computation/processing task

Page 44: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

1. Server2. Client3. Mechanism4. Action

Question139. __________ maps input key/value pairs to a set of intermediate key/value pairs

1. Mapper2. Reducer3. Mapper and reducer4. None of the above

Question140. Zookeeper keep track of the cluster state such as the ____________- table location

1. Domain2. Node3. Root4. All of the above

Question141. When __________ contents exceed a configurable threshold, the memtable data, which includes indexes, is put in a queue to be flushed to disk

1. Subtable2. Memtable3. Intable

Page 45: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

4. Memorytable

Question142. Apache knox accesses hadoop cluster over

1. HTTP2. TCT3. ICMP4. None of the above

Question143. ___________ supports a new command shell beeline that works with hiveserver2.

1. Hiveserver22. Hiveserver33. Hiveserver44. Hiveserver5

Question144. ________ sink can be a text file, the console display, a simple HDFC path or a null bucket where the data is simply deleted

1. Collector tier event2. Agent tier event3. Basic4. None of the above

Question145. __________ name node is used when the primary name node goes down

Page 46: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

1. Rack2. Data3. Secondary4. None of the above

Question146. Data transfer between web-console and clients are protected by using

1. SSL2. Kerberos3. SSH4. None of the above

Question147. Which of the following is one of the possible state for a workflow jobs?

1. PREP2. START3. RESUME4. END

Question148. Stratus will be a polygot _______ framework

1. Daas2. Paas3. Saas4. Raas

Page 47: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

Question149. All file access user java’s _______ APIs which give lucen stronger index safety

1. NIO.22. NIO.33. NIO.4

4. NIO.5Question150. Which of the following is a standard compliment XML Query processor?1. Whirr2. VXQuery3. Knife4. Lens

Question151. ______ is a query processing and optimization system for large-scale

1. MRQL2. Nifi3. Openaz4. ODF toolkit

Question152. Reduce progress () gets the progress of the jobs reduce tasks as a float between

1. 0.0-1.02. 1.0-2.03. 2.0-3.0

Page 48: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

4. 3.0-4.0

Question153. _____- is a framework for building java server applications GUIs

1. My faces2. Muse3. Flume4. Big top

Question154. Apache fkune 1.3.0 is the fourth release under the auspices of apache of the so-called _____ codeline

1. NG2. ND3. NF4. NR

Question155. Starting in hive ______ the avro scheme can be inferred from the hive table scheme

1. 0.142. 0.123. 0.134. 0.11

Question156. A workflow definition is a ___ with control flow nodes or action nodes

Page 49: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

1. CAG2. DAG3. BAG4. None of the above

Question157. Lucene provides scalable high-performance indexing over ______ per hour on modern hardware

1. 1TB2. 150GB3. 10GB4. 200 GB

Question158. The right level of parallelism for maps seems to be around _____ maps pernode

1. 1to 102. 10 to 1003. 100 to 1504. 150 to 200

Question159. The LZO compression format is composed of approximately ______ blocks of compressed data

1. 128k2. 256k

Page 50: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

3. 24k4. 36k

Question160. ___________ is the software development collaboration tool.

1. Buildr2. Cassandra3. Bloodhound4. All of the above

Question161. A ________ is an operation on the stream that can transform the stream

1. Decorator2. Source3. Sinks4. All of the above

Question162. _________ has the worlds largest hadoop cluster

1. Apple2. Datamatics3. Facebook4. None of the above

Page 51: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

Question163. When a ___________ is triggered the client receives a packet saying that the znode has changed

1. Event’2. Watch3. Row4. Value

Question164. Ambary leverages __________ for system altering and will send emails when your attention is needed

1. Nagios2. Nagaond3. Ganglia4. None of the above

Question165. ____ is a software distribution framework based on OSGi

1. ACE2. Abdera3. Zeppelin4. Accumulo

Question166. Which of the following is content management and punlishing system based on cocoon?

Page 52: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

1. Lib cloud2. Kafka3. Lenya4. All of the above

Question167. If the failure is of ________ nature oozie will suspend the workflow job

1. Transient2. Non transient3. Permanent4. Non permanent

Question168. _________- node distributes code across the cluster

1. Zookeeper2. Nimbus3. Supervisor4. Non of the above

Question169. A workflow definition must have one _____ node

1. Start2. Resume3. Finish4. Non of the above

Page 53: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

Question170. _________ is a rest API for HCatalog

1. Webhcat2. WbhCAT3. InpJcat4. None of the above

Question171. Which of the following fike contains user defined functions (UDGCs)

1. Script2-local.pig2. Pig.jar3. Tutorial.jar4. Excite.log.bz2

Question172. Helprace is using zookeeper on a ______ cluster in conjugation with hadoop and hBase

1. 3 node2. 4 node3. 5 node4. 6 node

Question173. ____________ represents the logical computations of your crunch pipelines

1. Do Fns2. Three Fns3. Do fn

Page 54: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

4. None of the above

Question174. _____________ has stronger ordering guarantees than a traditional messaging system

1. Kafka2. Slider3. Suz4. None of the above

Question175. HBase is _________ defines only column families

1. Row oriented2. Scheme less3. Fixed scheme4. All of the above

Question176. An input ___________ is a chunk of the input that is processed by a single map

1. Textformat2. Split3. Datanode4. All of the above

Question177. ___________ permits data written by one system to be efficiency sorted by another system

1. Complex data type

Page 55: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

2. Order3. Sort order4. All of the above

Question178. __________ text is appropriate for most non binary data types

1. Character2. Binary3. Delimited4. None of the above

Question179. __________ is an open source set of libraries tools examples and documentation engineered

1. Kite2. Kize3. Ookie4. All of the above

Question180. Map output larger than_______ percent of the memory allocated to copying map outputs

1. 102. 153. 254. 35

Page 56: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

Question181. Cassandra creates a __________for each table which allows you to symlink a table to a chosen physical drive or data volume

1. Directory2. Subdirectory3. Domain4. Path

Question182. Use ________ and embedded the scheme in the create statement

1. Scheme.literal2. Scheme.lit3. Row.literal4. All of the above

Question183. Which of the following can be used to launch spark jobs inside map reduce?

1, SIM2. SIMR

3. SIR

4. RIS

Question184. HDFS works in a __________ fashion

1. Master worker

Page 57: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

2. Master slave3. Worker/slave4. All of the above

Question185. HDFS by default replicates each data block ____ times on different nodes and on at least ____ racks

1. 3,22. 1,23. 2,34. 1,3

Question186. You can run pig in batch mode using __________

1. Pig shell command2. Pig scripts3. Pig options4. All of the above

Question187. Which of the following is the primitive data type in Avro?

1. Null2. Boolean3. Float4. All of the above

Page 58: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

Question188. ___________ name node is used when the primary name node goes down

1. Rack2. Data3. Secondary4. None of the abive

Question189. Which command is used to disable all the tables matching the given regex?

1. Remove all2. Drop all3. Disable all4. All of the above

Question190. Ambari ___________ deliver a template approach to cluster deployment

1. View2. Stack advisor3. Blueprints4. All of the above

Question191. Cassandra uses a protocol called __________ to discover location and state information

1. Gossip2. Intergos

Page 59: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

3. Goss4. All of the above

Question192. Gzip (short for GNU zip) generates compressed files that have a ________extension

1. .gzip2. .gz3. .gzp4. .g

Question193. Falcon provides _________ workflow for copying data from source to target.

1. Recurring2. Investment3. Data4. None of the above

Question194. ___________- is the node responsible for all reads amd writes for the given partition

1. Replicas2. Leader3. Follower4. Isr

Question195. The compression offset map grows to _______ gb per terabyte compressed

Page 60: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

1. 1-32. 426593. 20-224. 0-1

Question196. Drill also provides intuitive extensions to SQL to work with ____________ data types

1. Simple2. Nested3. Int4. All of the above

Question197. Hive uses _________- for logging

1. Logj42. Log4l3. Log4i4. Log4j

Question198. Spark SQL provides a domain specific language to manipulate _______________---- in scala, java or python

1. Spark streaming2. Spark SQL3. RDDs4. All of the above

Page 61: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

Question199. HBase is a distributed _________ database built on top of the hadoop file system

1. Column oriented2. Row oriented3. Tuple oriented4. None of the above

Question200. Which of the following has method to deal with metadata?

1. Load push down2. Load metadata3. Load caster4. All of the above

Question201. Which of the following is a collaborative data analytics and visualization tool?

1. ACE2. Abdera3. Zeppelin4. Accumulo

Question202. Ignite is a unified _______ data fabric providing high performance, distributed im memory data management

1. Column

Page 62: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

2. In memory3. Row oriented4. Column oriented

Question203. Avro messages are framed as a list of __________

1. Buffers2. Frames3. Rows4. Column

Question204. ___________is a distributed and scalable OLAP engine built on hadoop to support extremely large data sets

1. Kylin2. Lens3. Log4cxx24. MRQL

Question205. Sqoop is an open source tool written at___________

1. Cloudera2. IBM3. Microsoft4. All of the above

Page 63: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

Question206. Zookeeper essentially mirrors the _______ functionality exposed in the linux kernel

1. Iread2. Inotify3. Iwrite4. Icount

Question207. Apache bigtop uses _________ for continuous integration testing

1. Jenkinstop2. Jerry3. Jenkins4. None of the above

Question208. Which of the following command is used to show values to key used in pig?

1. Set2. Declare3. Display4. All of the above

Question209. For apache ________ users storm utilizes the same ODBC interfaces

1. C takers2. Hive

Page 64: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster

3. Pig4. Oozie

Question210. The tokens are passed through a lucene ___________ to produce NGrams of the desired length

1. Shnglefil2. Shingle filter3. Single filter4. Collfilter

Page 65: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster
Page 66: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster
Page 67: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster
Page 68: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster
Page 69: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster
Page 70: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster
Page 71: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster
Page 72: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster
Page 73: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster
Page 74: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster
Page 75: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster
Page 76: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster
Page 77: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster
Page 78: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster
Page 79: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster
Page 80: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster
Page 81: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster
Page 82: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster
Page 83: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster
Page 84: bigdatathon.avwebtech.combigdatathon.avwebtech.com/quizBigData.docx  · Web viewPipe. Decision. Flag. None of the above. ... Q115- _____ has the world’s largest Hadoop cluster