READ Free Dumps For EMC- E20-532
Question ID 16208 | What would be considered "Big Data"? |
Option A | An OLAP Cube containing customer demographic information about 100, 000, 000 customers |
Option B | Daily Log files from a web server that receives 100, 000 hits per minute |
Option C | Aggregated statistical data stored in a relational database table |
Option D | Spreadsheets containing monthly sales data for a Global 100 corporation |
Correct Answer | B |
Question ID 16210 | You are given 10, 000, 000 user profile pages of an online dating site in XML files, and they |
Option A | Run MapReduce to transform the data, and find relevant key value pairs. |
Option B | Divide the data into sets of 1, 000 user profiles, and run K-means clustering in RHadoop iteratively. |
Option C | Run a Naive Bayes classification as a pre-processing step in HDFS. |
Option D | Partition the data by XML file size, and run K-means clustering in each partition. |
Correct Answer | A |