Big Data Technologies
AWS offers an ideal environment to implement Big Data on the public cloud. It provides multiple core and specific services that can do so with tight time-to-market.
AWS ELASTIC MAP REDUCE
Processing large volumes of data quickly and easily using frameworks such as Spark, Presto, HBase or Flink.
AWS LAMBDA
Lets you run code without provisioning or managing servers. Pay only for what you use and enjoy native integration with other AWS services.
AWS KINESIS
Like other AWS services, it is fully managed and easily scalable. It lets you collect, process and analyze date flows in real time.
AWS REDSHIFT
Database specialized in storing and processing large quantities of information, with superior performance to conventional databases thanks to its machine learning and parallel processing capabilities.
AWS SIMPLE STORAGE SERVICE
Object storage service that can be used as a Data Lake in different architectures, thanks to its characteristics and integration with other AWS services.
AWS QUICKSIGHT
Managed service to visualize the data on different storage systems. It can scale from 10 to 10,000 users with no need to manage servers.
AWS Cloud Benefits
Separation between computing (EMR) and storage (S3) in Data Lakes, allowing for separate cost increases instead of provisioning a single cluster.
Possibility to provision any number of Hadoop clusters of any size, either on demand or for Redshift in interactive analytics.
Ability to use Apache Hadoop technologies (Hive, Pig, Spark, Impala, etc.) or SQL query languages with Redshift and Athena.
The infrastructure features nodes specialized in computing, memory or network, giving it the ability to respond to any performance demand.
Access and execution of data processes using roles for requests and security groups for network access.
The data are always available thanks to services like S3 and DynamoDB, with an optimal service commitment.
Components have the ability to recover from service interruptions, even with characteristics such as job attempts.
Managed services such as S3, Athena and Quicksight with no management by operations.