SAP-C02 Question #281

Question

A company is collecting a large amount of data from a fleet of IoT devices. Data is stored as Optimized Row Columnar (ORC) files in the Hadoop Distributed File System (HDFS) on a persistent Amazon EMR cluster. The company's data analytics team queries the data by using SQL in Apache Presto deployed on the same EMR cluster. Queries scan large amounts of data, always run for less than 15 minutes, and run only between 5 PM and 10 PM. The company is concerned about the high cost associated with the current solution. A solutions architect must propose the most cost-effective solution that will allow SQL data queries. Which solution will meet these requirements?

中文翻译：
一家公司正在从一组物联网设备收集大量数据。数据作为优化行列式 (ORC) 文件存储在持久 Amazon EMR 集群上的 Hadoop 分布式文件系统 (HDFS) 中。该公司的数据分析团队使用部署在同一 EMR 集群上的 Apache Presto 中的 SQL 查询数据。查询会扫描大量数据，运行时间始终少于 15 分钟，并且仅在下午 5 点到晚上 10 点之间运行。该公司担心当前解决方案的高成本。解决方案架构师必须提出最具成本效益的解决方案，以允许 SQL 数据查询。哪种解决方案可以满足这些要求？

Accepted Answer

Correct answer: B

SAP-C02 第 281 题

题目

选项

答案

解析