option
Cuestiones
ayuda
daypo
buscar.php

preguntas cdp empresa

COMENTARIOS ESTADÍSTICAS RÉCORDS
REALIZAR TEST
Título del Test:
preguntas cdp empresa

Descripción:
examen antiguo

Fecha de Creación: 2025/07/29

Categoría: Otros

Número Preguntas: 87

Valoración:(0)
COMPARTE EL TEST
Nuevo ComentarioNuevo Comentario
Comentarios
NO HAY REGISTROS
Temario:

How can you provide full view about your cluster to Cloudera support? (Choose 1). Send Cloudera Managament Services logs to the support. Send Support bundle using CM UI. Send CSR report using CM UI. Generate Image files and upload it to thge support site.

How do you create a diagnostic bundle when requested by support? (Choose 1). In Cloudera Manager, Administration > Support > Send Bundle. Take a screenshot of the log in Cloudera Manager and add that to the support case. In Cloudera Manager, Support > Send Diagnostic Data. In Cloudera Manager, Clusters > Cluster Name > Action > Send Diagnostic Data.

How can you automate taking snapshots? (Choose 2). Hue Snapshot Manager. Shell scripts. Replication policies in Cloudera Manager. Snapshot policies in Cloudera Manager.

What is the Cloudera Navigator Key HSM service? (Choose 1). It's a hardware encyrption module that could be ordered by Cloudera Inc. for securing data in the Hadoop cluster. It encrypts Cloudera Navigator audit data if that level of security is needed. It manages the cryptographic keys rotation and ACLs through HSM (Cloudera High Security Module). It enables Key Trustee Server to use an HSM (hardware security module) as a root of trust for cryptographic keys.

After installing a brand new cluster how do you verify the cluster is working with acceptable performance? (Choose 1). Cluster baseline performance testing using Teragen and Terasort. Use Cloudera Manager to create performance charts and dashboards for future use, Login to Workload XM and review its performance analysis for Apache YARN applications. Login to Cloudera Manager and check the services are green.

Which three are Cloudera installation packages? (Choose 3). cloudera-manager-daemons. cloudera-manager-agent. cloudera-hadoop-services. cloudera-manager-server.

How you can provide Ranger KMS HA? (Choose 1). With active-active nodes installtion with Load Balancer. With active-passive nodes installation. Ranger KMS doesn't support HA. With active-active nodes installation without Loadbalancer.

Using Cloudera Manager what three types of logs could you view to identify service start failures? (Choose 3). Standard output log. Standard error log. Role Log. JVM trace log.

What data can you backup using the ReplicationManager? (Choose 2). HDFS. Zookeeper. Hive. Kudu.

How do you add a service to Apache Ranger? (Choose 1). Add service to Service Manager in Apache Ranger. Add service via Ranger API. Select the Configuration of the service and check Enable Ranger Authorization. Restart the service.

Which two of the following statements are TRUE about snapshots? (Choose 2). You can create snapshots of HBase and HDFS. HBase snapshot creates another copy of tables in HDFS. Snapshot created once, can not be deleted. A Snapshot is a read-only copy of an HDFS directory at a point in time.

Which two are possible events that trigger an alert? (Choose 2). Service failure. Saturated HDFS capacity. HDFS balancing. Apache YARN application submitted.

Which two of the following is true for Cloudera Workflow Manager (WXM)? (Choose 2). It provides SQL performance analysis. It automatically optimizes SQL applications. It automatically optimizes Spark applications. It provides baseline monitoring.

Which of the following is the default authentication method for accessing the Ranger Admin User Interface?. OLDAR. Linux. PAM. Knox SSO.

Cloudera Manager provides many built-in dashboards and charts. Which three abilities are you provided with? (Choose 3). You can create your own dashboard by assembling your own collection of charts. You can modify the look and feel of existing charts. You can move charts from one dashboard to another dashboard. You can export the dashboards as PDF documents.

What service provides key management for HDFS encryption? (Choose 1). DogTag Manager. Ranger KMS. Cloudera Key Vault. Solr KeyManager.

What are three reasons to invoke a rebalance of the cluster? (Choose 3). When you add more CPU cores to a node. When you moved disks from one node to another. When a new host is added to the cluster. Some nodes have much more data on them than others.

Apache Atlas serves as a common metadata store that is designed to exchange metadata both inside and outside of the Hadoop stack. Which three of the following a capabilities of Apache Atlas? (Choose 3). Hive Metadata Management. Attribute Based access control. View lineage of data as it moves through various processes. Entity search using model attributes and classifications. Domain Specific Language (DSL) to search entities.

Which two features does Apache Ranger provide you with, when creating access policies? (Choose 2). Row Filtering. Column Masking. Data Expiration. Query Delay.

What are good examples of controlling data access using classifications with Apache Atlas? (Choose 2). Validity period or expiration date. Setting HDFS ACL on directories. Segregating access privileges by department or region. Sensitive data masking.

Which four of the following can be used to check currently running Apache YARN application status? (Choose 4). Cloudera Manager. Apache YARN command line. Apache YARN WebUl (ResourceManger WebUi). HUE. Openshift state metrics agent.

What are two courses of action when the memory of one of your master hosts is overcommitted? (Choose 2). Lower memory settings on services deployed on that host. Enable swapping. Suppress the warning. Relocate services.

What are three filters available to search in the Ranger Audit window? (Choose 3). User. Exclude Application. Application. Exclude User (Exclude Service User).

Where are Cloudera Manager database parameters stored for CM server? (Choose 1). In postgresql.conf if the database is PostgreSQL. In/etc/my.cnf if the database is Mysql or Mariadb. They are not stored you must write down and seal it in a secure place. In db.properties file under /etc/cloudera-scm-server directory.

You want to find important records in the cluster log files. Which three filters can you apply to find relevant Information? (Choose 3). Day of Week Filter. Service Filters. Disk Capacity Filter. Log Level Filter. Host Filter.

What is the value used for the heap overhead of role instances (Choose 1). 0.15. 0.2. 0.3. 0,05.

Which two features in Cloudera Manager allow you to verify the integrity of your cluster? (Choose 2). Runtime Inspector. Java Inspector. Security Inspector. Host inspector. Maintenance Inspector.

Besides auditing access to services, which three other components does Apache Ranger allow you to aude those (choose 3). Plugin Status. Failed SQL queries. Broken Hosts. Administrative events. Login sessions.

What is Diagnostic Data Collection? (Choose 1). Cloudera Manager Agents collects diagnostic data on a regular schedule and automatically logs it to disk for debugging purposes. HDFS collects business data on a regular schedule, and automatically sends it to Cloudera for backup purposes. Cloudera Manager collects diagnostic data on a regular schedule, and automatically sends it to Cloudera. Apache YARN collects diagnostic data on a regular schedule, and automatically sends it to Cloudera for usage charging.

What are two general resources to identify the root causes of a problem? (Choose 2). Log Files. Diagnostic Bundle. Charts. API documentation.

Which configuration file is NOT used by CDP? (Choose 1). hdfs-site.xml. hdfs-default.xml. hdfs-ops.xml. hdfs-env.sh.

Which two of the following does the Cluster Utilization Report contain? (Choose 2). Cluster CPU utilization. Cloudera Manager Cluster utilization. Cluster Network utilization. Cluster Memory utilization.

You want to implement a Python script that checks for certain service and cluster details. SOAP API and Avro messages. RPC and Protocol Buffers. Thrift API and XML messages. REST API and JSON messages.

Which three of the following are Log4j logging levels? (Choose 3). ATOMIC. ERROR. WARN. INFO.

Some users are reporting that their Impala queries are performing poorly. Which two of the following tools or interfaces would you use to diagnose Impala query performance? (Choose 2). The Impala daemons provide built-in web services to examine query perform. The SQL Command SHOW PERFORMANCE in Impala displays performance. Hue has a performance dashboard for the latest Impala queries. The Queries Tab inside the Impala services allows you to browse Impala queries.

Cloudera Manager is installed using package management tools such as yum for RHEL compatible systems. The host that you was not connected to the Internet. What are three options to install Cloudera Manager? (Choose 3). You can manually copy the repository files to the Cloudera Manager Server host for distribution to the other hosts. You cannot install Cloudera Manager on a host that is not connected to the Internet. You can create your own internal repository for hosts that do not have Internet access. Cloudera maintains Internet-accessible repositories for Runtime and Cloudera Manager installation files.

How can you manage/rotate certificates automatically for CDP servers/services? (Choose 1). With the built-in Dogtag server. Automatic certificate management/rotation is not available in CDP. With Auto-TLS feature turned on. With ActiveDirectory Group Policies.

What must you do after installing Cloudera Navigator Encrypt on a host? (Choose 2). Encrypt the HDFS directory. Register the host with Navigator Key Store. Restart the host. Register the host with Navigator Key Trustee Server.

Choose the three options that could be used to authenticate users to Cloudera Manager. SAML. LDAP/AD. httpaccess file. Linux PAM.

Your company follows GDPR compliances for security and data governance. Due to concerns as per personally identifiable information(Pi) its that credit card number is not displayed as a result of cluster log, sql query or audit data. How will you impose this on data stored in HDFS?. Store data in parquet file format. Set an Alert policy to trigger event where Pil is not followed. Set a Log and Redaction policy. Set HDFS level encryption of Data.

Which three of the following statements are true of Ranger KMS with a Trustee Server?. Must be installed on the Key trustee server nodes. Ranger must be installed. Must be installed on different hosts than the Key trustee servers. Kerberos must be enabled.

When the certificate is signed by your internal CA what two parts of the certificate must be include. TLS Web Server Authentication and TLS Web Client Authentication. SAN DNS. Email address of requester. IP adress of the host.

If using CA signed certificates with an intermediate CA, what certificate do you append the intermediate certicate to when using Auto-TLS?. Append it to the root CA certificate. Append it to the host certificate. Append to the keystore. There is no need to append it to any certificate.

You are examining some entries in the Ranger Audit window. What three details does every entry Include? (Choose 3). Which policy, if any, has been applied. Number of nodes involved in the request. Whether the request was allowed or denied. The Timestamp of the request.

Which three options are provided by Cloudera Manager to check cluster states? (Choose 3). Diagnostics (Logs, Events, Server Logs). APM (Application Performance Monitoring). Audits (service life/cycle events, security events). Charts (dashboards, charts).

When creating a user account in Cloudera Manager, how many roles can you assign to it? (Choose 1). Up to three per cluster. As many as needed. Exactly one. You don't assign roles to users, they get assigned by Kerberos automatically.

What is the Apache Knox service? (Choose 1). Built-in directory server for syncronizing external users and groups. Streaming data ingestion designer Ul. HDFS data encryption service. Secure api gateway and reverse proxy with SSO capability.

Access to Cloudera Manager requires you to login with a user name and a password. Which of the following statements are true?. Cloudera Manager has exactly one account, called admin. For each managed cluster, you have to create multiple user accounts. You can have exactly one account for each managed cluster. You can create as many accounts as you need.

What two options do you have to ensure data is spread evenly across a cluster? (Choose 2). Run hdfs distcp -autobalance command from a shell. Run the hdfs balancer command from a shell. Run rebalance from the Cloudera Manager HDFS service action menu. From the Cloudera Manager HDFS server action menu set HDFS autobalance to true.

What information does the host inspector gather? (Choose 3). Networking. Under replicated blocks. Component Versions. Graphics card resolution. System Time.

You want to receive automatic alerts on critical events and situations. Which two features does Cloudera Manager support?. You can setup up to 5 alerts per service. You can decide which events and situations should create an alert. Some alerts allow you to specify thresholds on when to send alerts. All events and situations generate an alert.

You want to set a quota on a specific HDFS directory to limit the maximum size of data to 4 GB Which two of the following are options that you can use to achieve this? (Choose 2). hdfs dfsadmin -setSpaceQuota command. Create a new classification in Apache Atlas. HDFS file browser in Cloudera Manager. HDFS file browser in HUE. HDFS set quota in Apache Ranger.

You want to access the Cloudera Manager packages and Cloudera Runtime parcels. Where do you get them from? (Choose 1). You let Apache Maven download them. You get them from a Cloudera website, with a license key. Cloudera's sales team will send the files to you by email. You fetch them from Cloudera's Github repository.

In Cloudera Manager, what is the most convenient source of information to troubleshoot overcommitted memory problems on a host?. The host Resources page. The host Audits page. The host Status page. The Host Inspector.

Number of volumes that can fail before the DN disconnects (choose 1). - dfs.datanode.failed.volumes.tolerated. - dfs.datanode.max.locked.memory. - Otra propiedad que está claro que no es.

How to replicate the data?. Erasure Coding. Replication Factor. All of them.

About Snapshots (Choose 2). Can be eliminated. No se puede hacer snapshot del FS entero. Son automáticos. Usan espacio duplicado.

Which components are entities (choose 3) (clusters, services, roles and role instances, hosts). Hosts. Services. Role instances (Role). Clusters. Configuration Versions. Role Groups.

Sending alerts (choose 2). Email. SNMP. SMS. Slack.

What is stored in the keytab (choose 1). Principals and secret key. Only the username. TLS certificates. Temporary tokens.

What type of security can we implement in Atlas (choose 1). Local classifications. Labels. Tags. Internal policies.

For creating scripts in Ansible, what information is necessary to know about the nodes to be managed? (Choose 2). The IP address of the hosts. The DNS name of the hosts. The kernel version of the hosts. The machine’s UUID.

What should you do to automate the creation of snapshots in HDFS? (Choose 1). Enable it from Cloudera Manager > HDFS > File Browser > Enable Snapshots. Create a tag in Atlas with the label “snapshotable. Enable snapshot from Hue. Apply a retention policy in Ranger.

When is it necessary to rebalance HDFS?. When we add a disk. When we add a host. When we move a disk from one host to another host. When we restart the NameNode.

From where can YARN applications be viewed? (Choose 2). From the command line (CLI) using commands like yarn application -list. From Cloudera Manager > Services > YARN > Applications. From Apache Atlas > Applications. From Apache Ranger > Services > YARN.

If there are issues with Impala queries, from where can they be analyzed? (Choose 2). From Cloudera Manager > Impala > Queries. From the Web UI of each Impala daemon (for example, http://<impalad>:25000). From Apache Atlas Web UI. From Apache Ranger > Queries.

What happens when we create a snapshot in HDFS? (Choose 1). A copy of HBase and its tables is created. They cannot be deleted. A read-only copy of the blocks is created. All data is replicated into a new directory.

From where can data replication be configured in Cloudera? (Choose 1). From Cloudera Manager > Replication > Replication Policies. From Apache Atlas. From Apache Ranger. From the HDFS tab in Hue.

How do you obtain a copy of the NameNode image (fsimage) from a node in the cluster? (Choose 1). hdfs dfsadmin -fetchImage <destination_directory>. hdfs dfsadmin -saveNamespace. hdfs namenode -format. hdfs dfs -copyToLocal /fsimage /tmp.

Which of the following statements are true about tags in the context of Apache Atlas? (Choose 2). They are called classifications in Atlas. They can be created from Atlas. They are applied automatically without user intervention. They can only be applied from Apache Ranger.

In the RS-6-3-1024k erasure coding policy, how many parity blocks per stripe are used?. 2. 3. 6. 9.

Which erasure codec is currently supported by CDP?. LRC (Local Reconstruction Codes). Turbo Codes. Reed-Solomon (RS). Hamming Code.

Which Kerberos daemon is responsible for authenticating users and granting them initial credentials?. kadmind. sshproxy. kinit. krb5kdc.

What is the main function of the kadmind Kerberos daemon?. Issues initial tickets to users during login. Manages Kerberos database administration requests. Lists all Kerberos tickets for a user. Starts the Kerberos client authentication process.

What is the primary role of Apache ZooKeeper in a CDP cluster?. Scheduling YARN resource allocations. Handling Hive metastore queries. Coordinating distributed services and maintaining configuration data. Managing HDFS file replication.

What does the INVALIDATE METADATA command do in Impala?. Clears all cached query results from memory. Reloads metadata for a single table from the Hive Metastore. Removes cached metadata for all objects and repopulates it from the Hive Metastore on the next access. Forces a manual statistics recomputation for a table.

Which file format works best with Impala for fast analytical queries?. Text. CSV. Parquet. ORC.

Which file format is highly optimized for reading, writing, and processing data in Hive?. Avro. ORC. Parquet. JSON.

Which compression codec is best suited for cold data that is accessed infrequently?. Snappy. Gzip. LZO. Bzip2.

Which compression codec is best suited for hot data that is accessed frequently?. snappy. Gzip. Bzip2. enzip.

Which file formats are supported by Apache Sqoop for importing/exporting data?. ORC, Parquet, JSON. CSV, Avro, Parquet. CSV, ORC, Avro. Avro, JSON, SequenceFile.

Which ACID property ensures that a transaction is “all-or-nothing,” meaning either all its operations succeed or none are applied?. Consistency. Atomicity. Durability. Isolation.

Which ACID property ensures that concurrent transactions do not affect each other and that intermediate states are invisible to other transactions?. Atomicity. Durability. Consistency. Isolation.

Which ACID property guarantees that once a transaction is committed, its results remain even if the system crashes?. Atomicity. Isolation. Durability. Consistency.

What is the correct sequence for decommissioning a worker node in a CDP cluster?. Stop Roles → Maintenance Mode → Decommission NodeManager → Decommission DataNode → Modify or Repair. Maintenance Mode → Decommission NodeManager → Decommission DataNode → Stop Roles → Modify or Repair. Decommission NodeManager → Maintenance Mode → Decommission DataNode → Stop Roles → Modify or Repair. Maintenance Mode → Decommission DataNode → Decommission NodeManager → Stop Roles → Modify or Repair.

What is the correct sequence for recommissioning a worker node in a CDP cluster?. Start Components → Recommission NodeManager → Recommission DataNode → Turn Off Maintenance Mode → Restore to Full Service. Recommission DataNode → Recommission NodeManager → Start Components → Turn Off Maintenance Mode → Restore to Full Service. Start Components → Recommission DataNode → Recommission NodeManager → Turn Off Maintenance Mode → Restore to Full Service. Start Components → Turn Off Maintenance Mode → Recommission DataNode → Recommission NodeManager → Restore to Full Service.

Which statement correctly describes the difference between “Remove From Cluster” and “Remove From Cloudera Manager”?. “Remove From Cluster” decommissions roles and unregisters the host from Cloudera Manager, while “Remove From Cloudera Manager” only decommissions roles but keeps the host registered. “Remove From Cluster” decommissions roles but keeps the host registered in Cloudera Manager, while “Remove From Cloudera Manager” decommissions roles and unregisters the host completely. Both commands remove the host entirely from Cloudera Manager, but “Remove From Cluster” preserves data directories. “Remove From Cluster” is used for permanent removal, while “Remove From Cloudera Manager” is for temporary maintenance.

Denunciar Test