Get all latest (August) Cloudera CCA-500 Actual Test 1-10

Ensurepass

QUESTION 1

You are migrating a cluster from MApReduce version 1 (MRv1) to MapReduce version 2 (MRv2) on YARN. You want to maintain your MRv1 TaskTracker slot capacities when you migrate. What should you do?

 

A.

Configure yarn.applicationmaster.resource.memory-mb and yarn.applicationmaster.resource.cpu-vcores so that ApplicationMaster container allocations match the capacity you require.

B.

You don’t need to configure or balance these properties in YARN as YARN dynamically balances resource management capabilities on your cluster.

C.

Configure mapred.tasktracker.map.tasks.maximum and mapred.tasktracker.reduce.tasks.maximum ub yarn-site.xml to match your cluster’s capacity set by the yarn-scheduler.minimum-allocation.

D.

Configure yarn.nodemanager.resource.memory-mb and yarn.nodemanager.resource.cpu-vcores to match the capacity you require under YARN for each NodeManager.

 

Correct Answer: D

 

 

QUESTION 2

Given:

 

clip_image002

 

You want to clean up this list by removing jobs where the State is KILLED. What command you enter?

 

A.

Yarn application -refreshJobHistory

B.

Yarn application -kill application_1374638600275_0109

C.

Yarn rmadmin -refreshQueue

D.

Yarn rmadmin -kill application_1374638600275_0109

 

Correct Answer: B

Explanation:

http://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.1-latest/bk_using-apache-hadoop/content/common_mrv2_commands.html

 

 

QUESTION 3

You have a Hadoop cluster HDFS, and a gateway machine external to the cluster from which clients submit jobs. What do you need to do in order to run Impala on the cluster and submit jobs from the command line of the gateway machine?

 

A.

Install the impalad daemon statestored daemon, and daemon on each machine in the cluster, and the impala shell on your gateway machine.

B.

Install the impalad daemon, the statestored daemon, the catalogd daemon, and the impala shell on your gateway machine.

C.

Install the impalad daemon and the impala shell on your gateway machine, and the statestored daemon and catalogd daemon on one of the nodes in the cluster.

D.

Install the impalad daemon on each machine in the cluster, the statestored daemon and catalogd daemon on one machine in the cluster, and the impala shell on your gateway machine.

E.

Install the impalad daemon, statestored daemon, and catalogd daemon on each machine in the cluster and on the gateway node.

 

Correct Answer: D

 

 

QUESTION 4

You are running a Hadoop cluster with MapReduce version 2 (MRv2) on YARN. You consistently see that MapReduce map tasks on your cluster are running slowly because of excessive garbage collection of JVM, how do you increase JVM heap size property to 3GB to optimize performance?

 

A.

yarn.application.child.java.opts=-Xsx3072m

B.

yarn.application.child.java.opts=-Xmx3072m

C.

mapreduce.map.java.opts=-Xms3072m

D.

mapreduce.map.java.opts=-Xmx3072m

 

Correct Answer: C

Explanation:

http://hortonworks.com/blog/how-to-plan-and-configure-yarn-in-hdp-2-0/

 

 

QUESTION 5

You observed that the number of spilled records from Map tasks far exceeds the number of map output records. Your child heap size is 1GB and your io.sort.mb value is set to 1000MB. How would you tune your io.sort.mb value to achieve maximum memory to disk I/O ratio?

 

A.

For a 1GB child heap size an io.sort.mb of 128 MB will always maximize memory to disk I/O

B.

Increase the io.sort.mb to 1GB

C.

Decrease the io.sort.mb value to 0

D.

Tune the io.sort.mb value until you observe that the number of spilled records equals (or is as close to equals) the number of map output records.

 

Correct Answer: D

 

 

QUESTION 6

You are planning a Hadoop cluster and considering implementing 10 Gigabit Ethernet as the network fabric. Which workloads benefit the most from faster network fabric?

 

A.

When your workload generates a large amount of output data, significantly larger than the amount of intermediate data.

B.

When your workload consumes a large amount of input data, relative to the entire capacity if HDFS.

C.

When your workload consists of processor-intensive tasks.

D.

When your workload generates a large amount of intermediate data, on the order of the input data itself.

 

Correct Answer: A

 

 

 

 

 

 

QUESTION 7

You have just run a MapReduce job to filter user messages to only those of a selected geographical region. The output for this job is in a directory named westUsers, located just below your home directory in HDFS. Which command gathers these into a single file on your local file system?

 

A.

Hadoop fs -getmerge -R westUsers.txt

B.

Hadoop fs -getemerge westUsers westUsers.txt

C.

Hadoop fs -cp westUsers/* westUsers.txt

D.

Hadoop fs -get westUsers westUsers.txt

 

Correct Answer: B

 

 

QUESTION 8

Which three basic configuration parameters must you set to migrate your cluster from MapReduce 1 (MRv1) to MapReduce V2 (MRv2)? (Choose three)

 

A.

Configure the NodeManager to enable MapReduce services on YARN by setting the following property in yarn-site.xml:

<name>yarn.nodemanager.hostname</name>

<value>your_nodeManager_shuffle</value>

B.

Configure the NodeManager hostname and enable node services on YARN by setting the following property in yarn-site.xml:

<name>yarn.nodemanager.hostname</name>

<value>your_nodeManager_hostname</value>

C.

Configure a default scheduler to run on YARN by setting the following property in mapred-site.xml:

<name>mapreduce.jobtracker.taskScheduler</name>

<Value>org.apache.hadoop.mapred.JobQueueTaskScheduler</value>

D.

Configure the number of map tasks per jon YARN by setting the following property in mapred:

<name>mapreduce.job.maps</name>

<value>2</value>

E.

Configure the ResourceManager hostname and enable node services on YARN by setting the following property in yarn-site.xml:

<name>yarn.resourcemanager.hostname</name>

<value>your_resourceManager_hostname</value>

F.

Configure MapReduce as a Framework running on YARN by setting the following property in mapred-site.xml:

<name>mapreduce.framework.name</name>

<value>yarn</value>

 

Correct Answer: AEF

 

 

QUESTION 9

A slave node in your cluster has 4 TB hard drives installed (4 x 2TB). The DataNode is configured to store HDFS blocks on all disks. You set the value of the dfs.datanode.du.reserved parameter to 100 GB. How does this alter HDFS block storage?

 

A.

25GB on each hard drive may not be used to store HDFS blocks

B.

100GB on each hard drive may not be used to store HDFS blocks

C.

All hard drives may be used to store HDFS blocks as long as at least 100 GB in total is available on the node

D.

A maximum if 100 GB on each hard drive may be used to store HDFS blocks

 

Correct Answer: B

 

 

QUESTION 10

You are running a Hadoop cluster with a NameNode on host mynamenode, a secondary NameNode on host mysecondarynamenode and several DataNodes. Which best describes how you determine when the last checkpoint happened?

 

A.

Execute hdfs namenode -report on the command line and look at the Last Checkpoint information

B.

Execute hdfs dfsadmin -saveNamespace on the command line which returns to you the last checkpoint value in fstime file

C.

Connect to the web UI of the Secondary NameNode (http://mysecondary:50090/) and look at the “Last Checkpoint” information

D.

Connect to the web UI of the NameNode (http://mynamenode:50070) and look at the “Last Checkpoint” information

 

Correct Answer: C

Explanation:

https://www.inkling.com/read/hadoop-definitive-guide-tom-white-3rd/chapter-10/hdfs

 

Free VCE & PDF File for Cloudera CCA-500 Real Exam

Instant Access to Free VCE Files: CompTIA | VMware | SAP …
Instant Access to Free PDF Files: CompTIA | VMware | SAP …

This entry was posted in CCA-500 Real Exam (August) and tagged , , , , , , . Bookmark the permalink.