Achieve New Updated (September) Cloudera CCA-505 Examination Questions 11-20

Ensurepass

 

QUESTION 11

You have a Hadoop cluster running HDFS, and a gateway machine external to the cluster from which clients submit jobs. What do you need to do in order to run on the cluster and

 

 

 

 

submit jobs from the command line of the gateway machine?

 

A.

Install the impslad daemon, statestored daemon, and catalogd daemon on each machine in the cluster and on the gateway node

B.

Install the impalad daemon on each machine in the cluster, the statestored daemon and catalogd daemon on one machine in the cluster, and the impala shell on your gateway machine

C.

Install the impalad daemon and the impala shell on your gateway machine, and the statestored daemon and catalog daemon on one of the nodes in the cluster

D.

Install the impalad daemon, the statestored daemon, the catalogd daemon, and the impala shell on your gateway machine

E.

Install the impalad daemon, statestored daemon, and catalogd daemon on each machine in the cluster, and the impala shell on your gateway machine

 

Answer: B

 

 

QUESTION 12

Your cluster is configured with HDFS and MapReduce version 2 (MRv2) on YARN. What is the result when you execute: hadoop jar samplejar.jar MyClass on a client machine?

 

A.

SampleJar.jar is sent to the ApplicationMaster which allocation a container for Sample.jar

B.

SampleJar.Jar is serialized into an XML file which is submitted to the ApplicationMaster

C.

SampleJar.Jar is sent directly to the ResourceManager

D.

SampleJar.Jar is placed in a temporary directly in HDFS

 

Answer: A

 

 

QUESTION 13

Your Hadoop cluster is configured with HDFS and MapReduce version 2 (MRv2) on YARN. Can you configure a worker node to run a NodeManager daemon but not a DataNode daemon and still have a function cluster?

 

A.

Yes. The daemon will receive data from the NameNode to run Map tasks

B.

Yes. The daemon will get data from another (non-local) DataNode to run Map tasks

C.

Yes. The daemon will receive Reduce tasks only

 

 

 

 

 

Answer: A

 

 

QUESTION 14

Your cluster has the following characteristics:

 

A rack aware topology is configured and on

Replication is not set to 3

Cluster block size is set to 64 MB

 

Which describes the file read process when a client application connects into the cluster and requests a 50MB file?

 

A.

The client queries the NameNode which retrieves the block from the nearest DataNode to the client and then passes that block back to the client.

B.

The client queries the NameNode for the locations of the block, and reads from a random location in the list it retrieves to eliminate netw

ork I/O leads by balancing which nodes it retrieves data from at any given time.

C.

The client queries the NameNode for the locations of the block, and reads all three copies. The first copy to complete transfer to the client is the one the client reads as part of Hadoop’s speculative execution framework.

D.

The client queries the NameNode for the locations of the block, and reads from the first location in the list it receives.

 

Answer: A

 

 

QUESTION 15

In CDH4 and later, which file contains a serialized form of all the directory and files inodes in the filesystem, giving the NameNode a persistent checkpoint of the filesystem metadata?

 

A.

fstime

B.

VERSION

C.

Fsimage_N (Where N reflects all transactions up to transaction ID N)

D.

Edits_N-M (Where N-M specifies transactions between transactions ID N and transaction ID N)

 

Answer: C

Reference: http://mikepluta.com/tag/namenode/

 

 

 

 

 

 

QUESTION 16

You have a 20 node Hadoop cluster, with 18 slave nodes and 2 master nodes running HDFS High Availability (HA). You want to minimize the chance of data loss in you cluster.

What should you do?

 

A.

Add another master node

to increase the number of nodes running the JournalNode which increases the number of machines available to HA to create a quorum

B.

Configure the cluster’s disk drives with an appropriate fault tolerant RAID level

C.

Run the ResourceManager on a different master from the NameNode in the order to load share HDFS metadata processing

D.

Run a Secondary NameNode on a different master from the NameNode in order to load provide automatic recovery from a NameNode failure

E.

Set an HDFS replication factor that provides data redundancy, protecting against failure

 

Answer: C

 

 

QUESTION 17

You have a cluster running with the Fair Scheduler enabled. There are currently no jobs running on the cluster, and you submit a job A, so that only job A is running on the cluster. A while later, you submit Job

B.now job A and Job B are running on the cluster at the same time. How will the Fair Scheduler handle these two jobs?

 

A.

When job A gets submitted, it consumes all the tasks slots.

B.

When job A gets submitted, it doesn’t consume all the task slots

C.

When job B gets submitted, Job A has to finish first, before job B can scheduled

D.

When job B gets submitted, it will get assigned tasks, while Job A continue to run with fewer tasks.

 

Answer: C

 

 

QUESTION 18

You want a node to only swap Hadoop daemon data from RAM to disk when absolutely necessary. What should you do?

 

 

 

 

 

A.

Delete the /swapfile file on the node

B.

Set vm.swappiness to o in /etc/sysctl.conf

C.

Set the ram.swap parameter to o in core-site.xml

D.

Delete the /etc/swap file on the node

E.

Delete the /dev/vmswap file on the node

 

Answer: B

 

 

QUESTION 19

You are configuring your cluster to run HDFS and MapReduce v2 (MRv2) on YARN. Which daemons need to be installed on your clusters master nodes? (Choose Two)

 

A.

ResourceManager

B.

DataNode

C.

NameNode

D.

JobTracker

E.

TaskTracker

F.

HMaster

 

Answer: AC

 

 

QUESTION 20

You observe that the number of spilled records from Map tasks far exceeds the number of map output records. Your child heap size is 1GB and your io.sort.mb value is set to 100 MB. How would you tune your io.sort.mb value to achieve maximum memory to disk I/O ratio?

 

A.

Decrease the io.sort.mb value to 0

B.

Increase the io.sort.mb to 1GB

C.

For 1GB child heap size an io.sort.mb of 128 MB will always maximize memory to disk I/O

D.

Tune the io.sort.mb value until you observe that the number of spilled records equals (or is as close to equals) the number of map output records

 

Answer: D

Free VCE & PDF File for Cloudera CCA-505 Real Exam

Instant Access to Free VCE Files: CompTIA | VMware | SAP …
Instant Access to Free PDF Files: CompTIA | VMware | SAP …

 

This entry was posted in CCA-505 Examination questions (September) and tagged , , , , , , . Bookmark the permalink.