option
Cuestiones
ayuda
daypo
buscar.php

cdp o p

COMENTARIOS ESTADÍSTICAS RÉCORDS
REALIZAR TEST
Título del Test:
cdp o p

Descripción:
preguntas tipo examen

Fecha de Creación: 2025/07/28

Categoría: Otros

Número Preguntas: 85

Valoración:(0)
COMPARTE EL TEST
Nuevo ComentarioNuevo Comentario
Comentarios
NO HAY REGISTROS
Temario:

Cloudera Support Matrix: It is important to align the data platform onto a supported version stack. Cloudera intensely tests supported version stacks. On the matrix page, Click on product version to see all supported versions. Scroll down to see the supported Operating Systems, Databases, Browsers, and JDKs. All of them.

Which Hadoop component is primarily responsible for distributing data blocks across worker nodes and ensuring data locality?. YARN (Yet Another Resource Negotiator). MapReduce. HDFS (Hadoop Distributed File System). ZooKeeper.

Which HDFS daemon maintains in-memory lookup tables that tell clients which DataNodes hold the blocks of a file?. DataNode. SecondaryNameNode. NameNode. JournalNode.

Which HDFS daemon is responsible for reading and writing data blocks to the local disks on worker nodes?. NameNode. DataNode. SecondaryNameNode. JournalNode.

Which Hadoop component is responsible for allocating cluster resources and scheduling compute jobs to run in parallel?. HDFS (Hadoop Distributed File System). YARN (Yet Another Resource Negotiator). MapReduce. ZooKeeper.

Which YARN component maintains a global view of available CPU and memory resources across the cluster and is responsible for scheduling containers on worker nodes?. NodeManager. ResourceManager. ApplicationMaster. JobHistoryServer.

Which YARN daemon is responsible for managing the CPU and memory on each worker node, launching and monitoring containers, and reporting their resource usage back to the ResourceManager?. ResourceManager. ApplicationMaster. NodeManager. JobHistoryServer.

Which of the following relational databases is supported by Cloudera?. CouchDB. MariaDB. MongoDB. Cassandra.

What is the recommended practice for provisioning the relational databases required by Cloudera Manager (CM) and CDP?. Manually create each database through the database CLI. Use the Cloudera Manager GUI to provision databases automatically. Run a script that creates all required databases in one go. Allow the CM installer to auto-create the databases during installation.

Where does Cloudera Manager place the Cloudera Runtime parcel on each host node?. /usr/local/cloudera/parcels/CDH. /opt/cloudera/parcels/CDH. /var/lib/cloudera/parcels/CDH. /etc/cloudera/parcels/CDH.

Which statement best describes the hardware requirements for Cloudera Data Platform (CDP)?. Requires custom, vendor‑specific appliances. Needs specialized high‑performance storage arrays. Runs on standard data center no custom hardware. Only supported on certified cloud instances.

Which Hadoop node sits at the perimeter of the cluster, is used to create and submit compute jobs, but does not run any master services or store HDFS data blocks?. DataNode. Edge Node. NameNode. SecondaryNameNode.

Which Hadoop node receives compute jobs from gateway (edge) nodes and orchestrates their execution across the worker nodes?. DataNode. Worker Node. Master Node. SecondaryNameNode.

Which Hadoop node is responsible for both storing HDFS data blocks and running the data processing tasks?. NameNode. Edge (Gateway) Node. Master Node. Worker Node.

In a Hadoop cluster secured with Kerberos and synchronized with LDAP, which service is responsible for issuing Kerberos tickets for user authentication?. NameNode. LDAP. KDC (Key Distribution Center). ZooKeeper.

Which of the following operating systems natively implements Pluggable Authentication Modules (PAM)?. Microsoft Windows Server. Red Hat Enterprise Linux (RHEL). FreeBSD. macOS.

What is a key benefit of using Pluggable Authentication Modules (PAM) in a Hadoop environment with services like Cloudera Manager, Ranger, Hue, and Zeppelin?. It accelerates HDFS data block replication. It removes the need to individually configure LDAP for each runtime service. It provides high‑availability failover for NameNodes. It enables YARN to schedule containers based on Kerberos tickets.

Which term describes an architecture where critical components are duplicated so that, upon failure of one, operations seamlessly switch to the other?. Sharding. Duplexing. Load Balancing. Partitioning.

In a three‑master high‑availability (HA) setup, what is the primary role of the third master node?. Active Master – handles all client requests. Standby Master – takes over immediately if the active fails. Quorum/Fallback Master – participates in quorum and acts as the HA failover node. Observer Master – monitors cluster health without voting rights.

Which HDFS component is responsible for orchestrating the movement of data blocks around the cluster (for replication, rebalancing, failure recovery, or decommissioning), even though the actual block copies and deletions are handled by the DataNodes?. SecondaryNameNode. DataNode. NameNode. ResourceManager.

Which Hadoop utility is designed for parallel, large‑scale copying of data between HDFS clusters?. hadoop fs -cp. distcp. rsync. scp.

Which Hadoop utility performs parallel copying of HDFS data by running a map-only MapReduce job (using only mappers and no reducers)?. distcp. hadoop fs -cp. rsync. scp.

Hadoop’s default dfs.permissions.superusergroup is set to a group that typically doesn’t exist on the OS or in LDAP by default. Which group name must you manually create (in /etc/group or in LDAP) to satisfy this default configuration?. hadoop. supergroup. admin. wheel.

In Cloudera Manager’s architecture, which component maintains the desired state—what services should run, on which hosts, and with what configurations—and is updated whenever configurations change?. Actual State. Host Inspector. Model. Cloudera Agent.

In Cloudera Manager’s architecture, which component captures the actual running processes—what commands are executing, where they’re running, and the exact configuration files in use?. Model. Runtime. Host Inspector. Actual State.

After modifying a service configuration in Cloudera Manager, you’ll see a warning icon next to the affected service. Clicking this icon takes you to which page for resolving outdated settings?. Service Health page. Stale Configurations page. Configuration Summary page. Role Configurations page.

Which Hadoop configuration file is created by the service developer and packaged inside a JAR, providing the out‑of‑the‑box default settings?. *-site.xml. *-default.xml. *-env.sh. *.properties.

Which Hadoop configuration file is generated by Cloudera Manager (or an administrator) and stored per cluster to override default settings?. *-default.xml. *-site.xml. *-env.sh. *.properties.

Which configuration file contains environment settings for the JVM (such as heap sizes and garbage‑collection options) used by Hadoop services?. *-default.xml. *-site.xml. *-env.sh. *.properties.

Which file type is used for various service‑specific configurations, such as Log4j logging settings?. *-default.xml. *-site.xml. *-env.sh. *.properties.

Which feature in Cloudera Manager allows you to override configuration properties that aren’t exposed in the UI and to inject additional custom settings?. Host Templates. Role Groups. Snippets. Service Autoconfiguration.

Which of the following is NOT a characteristic of Maintenance Mode in Cloudera Manager?. Suppresses alerts, warnings, and status‑change indicators for the affected object. Exempts the object from host‑level or service‑level bulk operations. Prevents any manual (explicit) operations on the object. Can be applied to both services and service components.

Where would you find the NameNode daemon’s log file on a typical Hadoop cluster node?. /opt/cloudera/parcels/CDH/logs/hdfs/namenode.log. /etc/hadoop/conf/hadoop-hdfs-namenode.log. /var/log/hadoop-hdfs/hadoop-cmf-hdfs-namenode-master-1.example.log. /usr/local/hadoop/logs/hdfs-namenode.log.

Which type of logs encompasses system logs, kernel messages, hardware-related logs, and other operating-system–level records on each cluster host?. Daemon Logs. Host Logs. Audit Logs. Application Logs.

Which characteristic correctly describes HDFS data blocks?. They are mutable and can be overwritten in place. They are immutable once written. They support in‑place byte‑level updates. They auto‑shrink when data is deleted.

By convention, where should you create the home directory for an HDFS user named “alice” so that most Hadoop tools and jobs will locate it automatically?. /home/alice. /user/alice. /hadoop/alice. /data/alice.

Which statement best describes the difference between a Cloudera parcel and an OS‑level package (RPM/DEB)?. Parcels are installed and managed via yum/apt, while packages are downloaded, distributed, and activated by Cloudera Manager.. Parcels are downloaded, distributed, and activated by Cloudera Manager—supporting side‑by‑side versions and easy rollback—whereas packages install a single version via the OS’s package manager and require manual upgrades or rollbacks. Both parcels and packages use the OS’s native package manager, but only packages support rollback. Parcels and packages are functionally identical; the difference is only in terminology.

Who is responsible for installing parcels in a cluster managed by Cloudera Manager?. The administrator via yum/apt. The HDFS NameNode. Cloudera Manager (Server y Agents). The YARN ResourceManager.

Which mechanism is used to encrypt data at rest (on disk) in Cloudera Data Platform?. Kerberos. Ranger KMS. TLS/SSL. HDFS NameNode.

Which mechanism encrypts data in transit (over the network) between clients and services in CDP?. Kerberos. Ranger KMS. TLS/SSL. SSH.

Which of the following best describes the role of Kerberos in a Hadoop cluster?. Encrypting HDFS data at rest. Managing encryption keys for HDFS zones. Proving user identity via tickets. Encrypting data in transit between services.

Which component is responsible for evaluating cluster health tests and metrics and triggering the configured alerts in Cloudera Manager?. Cloudera Agent. ResourceManager. Cloudera Manager. NodeManager.

In Cloudera Manager, what are Advanced Configuration Snippets (also known as “safety valves”)?. Predefined configuration templates for services. XML/property blocks automatically inserted into *-default.xml by CM. Raw configuration fragments you add to override or inject settings not exposed in the UI. Advanced monitoring modules for custom alerts.

What is the key characteristic of Auto‑TLS in Cloudera Manager?. It requires the administrator to create and upload external CA‑signed certificates. Cloudera Manager generates an internal CA, issues certificates and keystores/truststores, and automatically deploys, renews, and rotates them for all services. It only encrypts traffic between NameNode and DataNodes. It uses Ranger KMS to manage TLS keys.

Which component is essential to enable Kerberos authentication in a Hadoop cluster?. A KDC (Key Distribution Center) with admin credentials. A Ranger KMS server for key management. A TLS service for encrypting data in transit. A ZooKeeper quorum for coordination.

Which component is responsible for encrypting data at rest (on disk) in Cloudera Data Platform?. Kerberos. Ranger KMS. TLS/SSL. HDFS NameNode.

Which mechanism encrypts data in transit between clients and services in CDP?. Ranger KMS. HDFS Transparent Encryption. TLS/SSL. SSH.

Which entity is responsible for verifying identities and issuing (signing) digital certificates in a PKI?. Key Distribution Center (KDC). Certificate Authority (CA). Kerberos Realm. Truststore.

Which operating system family is officially supported for Cloudera Data Platform Private Cloud Base?. Ubuntu/Debian. RHEL, CentOS, Oracle Linux (64‑bit). SUSE Linux Enterprise Server. Windows Server.

Which statement best describes the availability of Pluggable Authentication Modules (PAM) on Linux distributions?. PAM is available only on CentOS and RHEL. PAM is a standard authentication framework supported across most Linux distributions. PAM is exclusive to Debian‑based systems. PAM is deprecated and no longer used.

In Cloudera Manager, which entity is responsible for defining a set of permissions such as “Cluster Admin,” “Operator,” or “Auditor”?. Users. Groups. Roles. Clusters.

If you want every member of an LDAP group to inherit the same CM permissions, you should assign roles to which of the following?. Individual Users. Local CM Database. Groups. Clusters.

A user’s effective permissions in Cloudera Manager are computed as: The intersection of roles assigned directly to the user and to their groups. The union of roles assigned directly to the user and to their groups. Only the roles assigned directly to the user. Only the roles assigned to the user’s primary group.

Where in Cloudera Manager’s UI do you configure the authentication backend (e.g., Database, LDAP/AD, or SAML)?. Hosts → Configuration. Administration → Authentication. Clusters → Security. Management Services → Activity Monitor.

¿De qué se ocupa principalmente el ResourceManager en YARN?. Gestionar la replicación de bloques en HDFS. Programar y asignar contenedores en el clúster según políticas de colas. Ejecutar las tareas del usuario dentro de los contenedores. Autenticar usuarios mediante Kerberos.

Are HDFS data blocks ever moved around the cluster after being written?. No – once written, blocks remain permanently on their original DataNode. Yes – DataNodes autonomously relocate blocks as they see fit. Yes – the NameNode orchestrates block movement for replication, rebalancing, failure recovery, or decommissioning. Only when a block is first created during the initial write.

Which of the following best describes the “cloud storage” backup strategy?. Use distcp to copy data between two on‑premise clusters. Push copies of data to S3, ADLS, or another cloud storage service for off‑site durability. Re‑run ingestion flows (NiFi/Kafka) to regenerate the data. Take local HDFS snapshots and store them under /backup.

Which mechanism is typically used in the “cluster replication” strategy to duplicate HDFS or Hive data to another cluster?. Export metadata with hive export. Use distcp or Cloudera Manager Replication Policies. Reprocess data with a MapReduce job. Run hdfs dfs -cp between the same NameNodes.

What does the “parallel ingest” backup strategy involve?. Creating HDFS snapshots and mounting them in parallel. Manually importing data from external storage devices. Re‑running the same ingestion pipelines (e.g., NiFi or Kafka) in parallel against another destination to rebuild the data. Configuring synchronous disk‑level replication.

What is the first step when adding a new worker node to a Hadoop cluster?. Run the HDFS balancer. Provision the server into the correct rack with networking and power. Apply the Worker Host Template in Cloudera Manager. Install Cloudera Manager Agents on the host.

Which Cloudera Manager feature automatically assigns the DataNode and NodeManager roles to a newly added host?. Add Hosts Wizard. Host Templates. Service Monitor. Safety Valves.

Why is it necessary to run the HDFS balancer after adding a new DataNode?. To validate the new host’s DNS entry. To redistribute existing blocks so the new disk/node receives its fair share of data. To restart the NameNode in high‑availability mode. To upgrade all DataNode software to the latest parcel version.

What is the correct order of steps when adding a new worker node to a Hadoop cluster?. OS/JVM → Provision → Wizard → Rebalance → Template. Provision → OS/JVM → Wizard → Template → Rebalance. Wizard → Provision → OS/JVM → Template → Rebalance. Provision → Wizard → Rebalance → OS/JVM → Template.

What is a Worker Host Template in Cloudera Manager?. A script that automates HDFS rebalancing. A predefined list of services to install on master hosts. A saved set of roles (e.g., DataNode, NodeManager) and configurations that can be applied to new worker hosts in one click. A backup policy for HDFS data.

Which of the following is NOT a benefit of using a Worker Host Template?. Adding many workers quickly and consistently. Avoiding manual, role‑by‑role assignment on each host. Ensuring uniform role configurations across the cluster. Automatically rebalancing HDFS after new hosts are added.

In the YARN Capacity Scheduler, what is the root queue?. A leaf queue where applications run. The top‑level parent queue under which all other queues are organized. A dynamic queue created per application. A reserved queue for system services.

Which queues are eligible for application execution in YARN’s Capacity Scheduler?. Only the root queue. Any queue, regardless of children. Only leaf queues (queues with no children). Only queues explicitly named “default”.

How are capacities and ACLs defined in the YARN Capacity Scheduler?. At the root queue level only. Independently for each leaf queue, without hierarchy. Hierarchically, starting from the root queue down through its children. Via external configuration in ZooKeeper.

When should you enable Maintenance Mode in Cloudera Manager?. When you want to suppress alerts and health checks during planned cluster changes. When you want to permanently disable all monitoring for a service. When you’re scaling up YARN queues for production jobs. When you need to grant users additional permissions.

Which of the following activities does NOT require enabling Maintenance Mode first?. Adding new DataNode roles to multiple hosts. Performing a rolling upgrade of Cloudera Runtime. Running a long-running diagnostic script that causes health warnings. Assigning a new YARN queue to a leaf queue.

Which language is used to create dashboards in Cloudera Manager?. SQL. xSQL. SparkSQL. tsquery.

Which type of Hadoop logs would you find under /var/log/hadoop-yarn/containers/<appId>/. Daemon Logs. Job/Application Logs. Host Logs. Audit Logs.

What is the HDFS fsimage file?. A log of all file system edits since the last restart. A binary snapshot of the HDFS namespace (including directories, metadata, permissions, and block lists). A textual configuration file for HDFS settings. A list of active DataNode block reports.

How does the NameNode keep its fsimage up to date?. It rewrites the entire fsimage on every file operation. It merges the edits log into the fsimage during periodic checkpoints. DataNodes push block changes directly into the fsimage file. The ResourceManager updates the fsimage at job completion.

By default, how often does the NameNode perform a checkpoint to merge the edits log into the fsimage?. Every 10 minutes or every 100,000 transactions. Every hour or every 1,000,000 transactions. Once per day at midnight. Only when the NameNode restarts.

Which three are Cloudera installation packages? (Choose 3). cloudera-manager-daemons. cloudera-manager-agent. cloudera-hadoop-services. cloudera-manager-server.

What service provides key management for HDFS encryption? (Choose 1). DogTag Manager. Ranger KMS. Cloudera Key Vault. Solr KeyManager (Solr search engine).

Apache Atlas serves as a common metadata store that is designed to exchange metadata both inside and outside of the Hadoop stack. Which three of the following are capabilities of Apache Atlas? (Choose 3). Hive Metadata Management. Attribute Based access control. View lineage of data as it moves through various processes. Entity search using model attributes and classifications. Domain Specific Language (DSL) to search entities.

What are three filters available to search in the Ranger Audit window? (Choose 3). User. Exclude Application. Application. Exclude User (Exclude Service User).

Besides auditing access to services, which three other components does Apache Ranger allow you to aude those (choose 3). Plugin Status. Login sessions. Failed SQL queries. Broken Hosts. Administrative events.

Your company follows GDPR compliances for security and data governance. Due to concerns as per personally identifiable information (Pii) its that credit card number is not displayed as a result of cluster log, sql query or audit data. How will you impose this on data stored in HDFS?. Store data in parquet file format. Set an Alert policy to trigger event where Pil is not followed. Set a Log and Redaction policy. Set HDFS level encryption of Data.

Which of the following is a definition of authentication?. Checking an access control list. Confirming the identity of the participant. Determining whether a participant is allowed to perform an action. Ensuring only verified users can access data.

Which is true about Kerberos?. It is a widely used protocol for network authentication. It is not part of CDP. It is part of Microsoft Active Directory. All of the above.

Which of the following is not true about encryption?. Encryption ensures that only authorized users can access, modify, or delete a dataset. Cloudera provides encryption mechanisms to protect data at rest. Encryption uses digital keys to encode various components. Encryption requires use of Active Directory.

Which is true about Apache Ranger?. It helps manage policies for access to files, folders, databases, tables or columns. It provides a central audit location. It tracks all access requests in real time. All of the above.

Denunciar Test