Exam Code: C2090-102 (Practice Exam Latest Test Questions VCE PDF)
Exam Name: IBM Big Data Architect
Certification Provider: IBM
Free Today! Guaranteed Training- Pass C2090-102 Exam.
Online C2090-102 free questions and answers of New Version:
NEW QUESTION 1
Which of the following is a consideration in sizing an active archive Hadoop infrastructure?
- A. replication factor within Hadoop
- B. Reporting requirements
- C. velocity or rate at which data is being generated
- D. veracity or trustworthiness of the data
Answer: B
Explanation:
Reference:
http://www.ibm.com/developerworks/library/ba-augment-data-warehouse3/
NEW QUESTION 2
The YARN High Availability feature adds redundancy in the form of an Active/Standby. Which of the following will pair to remove this otherwise single point of failure?
- A. JobTracker
- B. Data Node
- C. Management Node
- D. Resource Manager
Answer: D
Explanation:
References:
http://hadoop.apache.org/docs/current/hadoop-yarn/hadoop-yarn-site/ResourceManagerHA.html
NEW QUESTION 3
As you explore the data for a BigSheets workbook, you must run the workbook against the full data set to get the most current results for analysis. Which statement is TRUE regarding running and visualizing data in a workbook?
- A. You can create graphs for more than one sheet within the same workbook
- B. By default, the first sheet in your workbook is named the Results sheet
- C. When you save and run the workbook, the data in a Child Workbook is theoutput for that workbook
- D. When you add sheets to workbooks, saving the sheets runs the individual data for the sheet but not for the full workbook
Answer: C
Explanation:
Reference:
https://www- 01.ibm.com/support/knowledgecenter/SSPT3X_4.1.0/com.ibm.swg.im.infosphere.biginsight s.analyze.doc/doc/bigsheets_con_workbooks.html
NEW QUESTION 4
You are designing storage for a new Hadoop cluster. Which of the following statements is TRUE regarding the usage of SAN or NAS?
- A. SAN or NAS should not be used to set up HDFS
- B. SAN or NAS must be used, if available, to provide backup capabilities
- C. SAN or NAS can be used to support retention policies
- D. SAN or NAS cannot be used if your Hadoop cluster spans 2 sites
Answer: A
Explanation:
References:
http:// www-01.ibm.com/software/data/infosphere/hadoop/hdfs/
NEW QUESTION 5
Which architecture document is used to help organize projects, manage the complexity of the solution, and ensurethat all architecturerequirements have been addressed?
- A. Operational Model
- B. Component Model
- C. Connection Model
- D. API Model
Answer: B
NEW QUESTION 6
What term applies to the data elements in Infosphere Streams?
- A. Tuples
- B. Operators
- C. Sink adapters
- D. Composite operators
Answer: B
Explanation:
Reference:
http://www-01.ibm.com/support/knowledgecenter/SSCRJU_3.2.1/com.ibm.swg.im.infosphere. streams. glossary.doc/doc/glossary_Streams.html?lang=en
NEW QUESTION 7
A major telecommunication company has millions of customers. Most of their customers are prepaid. Being prepaid customers, they can very easily switch to other vendors. The last four to six months, this company has lost quite a good number of customers to competition. They intend to build a system that can provide them with insight into the customer’s social network (e.g. who is the influencer and who is the follower). They also want the ability to monitor the voice
and data usage patterns in real time and they want the system to be trained over time to predict possible dissatisfactions. Given this scenario, which one of the following would you recommend?
- A. Hadoop
- B. Spark
- C. Cloudant
- D. Netezza
Answer: B
NEW QUESTION 8
Which of the following statements regarding Big R is TRUE?
- A. The Big R API is completely identical to the R API
- B. Big R implements an interpreted computer language
- C. Big R users cannot access Big R capabilities from the standard RStudio client as RStudio is not Hadoopenabled
- D. Big R utilizes the big SQL query engine for processing
Answer: D
Explanation:
References: https://www.ibm.com/support/knowledgecenter/SSPT3X_3.0.0/com.ibm.swg.im.i nfosphere. biginsights.bigr.doc/ doc/intro.html
NEW QUESTION 9
The CAP Theorem states that it is not possible for a distributed computer system to guarantee all three of these?
- A. Consistency, Accuracy, and Partition tolerance
- B. Concurrency, Availability, and Parallel updates
- C. Concurrency, Accuracy, and Parallel updates
- D. Consistency, Availability, and Partition tolerance
Answer: B
NEW QUESTION 10
Which of the following is the section of the Component Model that details how the solution integrates?
- A. Component Relationship Diagram
- B. Component Interface Diagram
- C. Component Interaction Diagram
- D. Component Reaction Diagram
Answer: A
NEW QUESTION 11
By default, Parquet uses which of the following codecs?
- A. SNAPPY
- B. LZO
- C. GZIP
- D. BZIP2
Answer: A
NEW QUESTION 12
A media company wants to measure the effectiveness of their advertising campaign. Before they release a movie they prepare and run a campaign for promotion. Based on the response on Twitter and Facebook they want to decidewhetheror not they should continue a particular campaign. Which of the following should be selected to meet these requirements?
- A. Hadoop
- B. Streams
- C. Unica
- D. Pure Data for Analytics
Answer: C
NEW QUESTION 13
What is used to capture client requirements for software selection and to evaluate the initial functional “fit” of a vendor’s software solution to the business needs of the client?
- A. Operational Model
- B. Requirements Matrix
- C. Viability Assessment
- D. Use Case Model
Answer: B
NEW QUESTION 14
Data high availability is provided by SAN in traditional architectures by employing a level of RAID. What is the Hadoop equivalent?
- A. Data Node
- B. Replication
- C. Quorum Journal Manager
- D. Yarn Resource Manager
Answer: B
Explanation:
References:
http://www.computerweekly.com/feature/Big-data-storage-Hadoop-storage- basics
NEW QUESTION 15
Which of the following statements regarding Big R is TRUE?
- A. Missing data values must be handled by ETL processes prior to analyzing data with Big R
- B. A bigr.frame loads data in memory for optimal performance
- C. A Big R user is responsible for parallelizing the execution of the R functions being used in the R program
- D. Performing a mathematical operation on a Big R vector variable willautomatically loop through each item inthe vector
Answer: D
Explanation:
Reference:
http://www.computerworld.com/article/2497319/business-intelligence-beginner-s-guide-to-r-syntax-quirks-you-llwant-to-know.html
NEW QUESTION 16
The downside of cloud computing, relative to SLAs, is the difficulty in determining which of the following?
- A. Root cause for service interruptions
- B. Turn-Around-Time (TAT)
- C. Mean Time To Recover (MTTR)
- D. First Call Resolution (FCR)
Answer: A
Explanation:
References:
https://en.wikipedia.org/wiki/Service-level_agreement
NEW QUESTION 17
Defining your need to enrich existing customer data you realize that you need to process large quantities of Geospatial data and output to your data warehouse in a standard GeoJSON format. Which of the following would provide a business analyst with the desired output?
- A. Big SQL
- B. BigSheets
- C. Hive queries
- D. Text Analytics
Answer: C
NEW QUESTION 18
Which of the following is NOT a valid Service Level Agreement (SLA) metric?
- A. Mean time between failures
- B. Mean time to repair
- C. Identification to responsible party
- D. Identification of failing component
Answer: D
Explanation:
References:
https://en.wikipedia.org/wiki/Service-level_agreement
NEW QUESTION 19
A bank wants to build a system that tracks all ATM and online transactions in real- time. They want to build a personalized model of their customer’s financial activity by incorporating enterprise data as well as social media data. The system must be able to learn and adapt over a period of time. These personalized models will be used for real time promotions as well as for any fraud or crime detections. Given these requirements, which of the following would recommend?
- A. Spark
- B. Hadoop
- C. Cloudand
- D. Netezza
Answer: D
NEW QUESTION 20
Which of the following statements is TRUE regarding cloud based solutions?
- A. In a Platform as a Service Cloud deployment, the customer chooses the operating system they want to use
- B. Automated recovery from hardware or network failures is not possible in a public cloud implementation, onlyin a private clouds
- C. There are benefits to use the cloud even for small-scale applications
- D. Using firewalls to create network boundaries is sufficient for ensuring cloud security
Answer: C
Explanation:
References:
http://www.ibm.com/developerworks/cloud/library/cl-cloudappdevelop/
NEW QUESTION 21
......
P.S. Dumps-hub.com now are offering 100% pass ensure C2090-102 dumps! All C2090-102 exam questions have been updated with correct answers: https://www.dumps-hub.com/C2090-102-dumps.html (110 New Questions)
