Learning More about AWS (Part 1) - Notes for Certified Cloud Practicioner Exam
I intreact with AWS a good amount - I use s3 for image/audio/video/other data storage and I use the Relational Database Service for the database for this site - so I am going to try to learn more about how it all works. I have learned a lot about computer networking by having to look up what is what when implementing AWS / Google Cloud services, but I still don't look forward to setting up another service, so by learning more about AWS, I hope that the task begins to seem less daunting to me.
What I Intend to Learn
- I hope to learn enough to earn the AWS Certified Cloud Practitioner and AWS Certified Developer - Associate certifications.
Things that I have Done with AWS / Things that I Need to DO
- What I Have Done
- I have created s3 buckets to store / retrieve images, audio files, video files, and geojson objects.
- I have used CloudFront to create edge location for these s3 buckets to make access to their objects quicker.
- I have created Lambda Functions that run on interaction with the s3 buckets.
- I have created a Relational Database Service for PostgreSQL that I use for this website.
- I have limited access to each of these buckets based on IP address, using CORS headers, and by restricting access to certain AWS users.
- What I Need to Do
- I need to create a few different things relating to images stored in the s3 bucket
- A Lambda function that appropriately checks for whether or not the image contains inappropriate content.
- A way for the images to be resized (width / height) on the server and then sent to the client / save a new version of the image in the s3 bucket
- I need to validate videos / audio stored in AWS
- I need to generate captions for video / audio in AWS
- I need to generate thumbnail for video on AWS
- I need to change the audio / video process for streaming video / audio
- This replaces the ffmpeg process that I am currently performing on the server
AWS Certified Cloud Practitioner Notes
Getting Started
- Amazon Web Services
- AWS has 200+ services
- Provides most (200+) services
- Reliable, secure and cost-effective
- Exam tests your decision making capabilities
- Which service do you choose in which situation
- Benefits of the Cloud
- On demand resource provisioning (also called Elasticity) - only provision resources when you need them and release the resources when no longer needed
- Trade capital expense for variable expense
- Benefit from massive economies of scale
- Stop guessing capacity
- Increase speed and agility
- Stop spending money running and maintaining data centers
- Go global in minutes
Regions and Zones
- AWS provides 20+ regions around the world (expanding every year)
- AWS Regions - Advantages
- Low Latency
- Global Footprint
- Adhere to government regulations
- High availability
- Availability Zones (AZs)
- Each AWS region consists of multiple, isolated, and physically separate AZs
- Availability Zones in a Region are connected through low-latency links
- Each availability zone:
- Can have one or more discrete data centers
- has redundant power, networking, and connectivity
- Increase availability and fault tolerance of applications in the same region
- Achieve high availability and greater fault tolerance
EC2 Fundamentals
- Where do you deploy applications in AWS?
- EC2 instances - virtual servers in AWS
- EC2 service - Provision EC2 instances or virtual servers
- EC2 Features
- Create and manage the lifecycle of EC2 instances
- Attach storage (& network storage) to your ec2 instance
- Manage network connectivity to your EC2 instance
- Load balancing and auto scaling for multiple EC2 instances
Useful Commands:
sudo su
yum update -y
yum install httpd
systemctl start httpd
systemctl enable httpd
echo "Hello World 2" > /var/www/html/index.html
EC2 Concepts - Amazon Machine Image (AMI)
- What operating system and what software do you want on the instance?
- Three AMI sources:
- Provided by AWS
- AWS Market Place: Online store for customized AMIs. Per hour billing.
- Customized AMIs: Created for you
EC2 Concepts - Instance Families
- Optimized combination of compute (CPU, GPU), memory, disk (storage) and networking for specific workloads
- 270+ instances across 40+ types for different workloads
- m (m4, m5, m6) - General Purpose
- c (c4, c5, c5n) - Compute Optimized
- r (r4, r5, r5a, r5n) -Memory (RAM) optimized
- i (i3) - Storage (I/O) Optimized
- g (g3, g4) - GPU Optimize - Graphics Processing
EC2 Concepts - Security Groups
- Virtual firewall to control incoming and outgoing traffic to/from AWS resources (EC2 instances, databases, etc.)
- Provides additional layer of security - Defense in Depth
Security Group Rules
- Default deny - If there are no rules configured, no outbound/inbound traffic is allowed
- Allows allow rules ONLY
- Separate rules for inbound and outbound traffic
EC2 Security - Key Pairs
- EC2 uses public key cryptography for protecting login credentials
- Key pair - public key and private key
- Public key is stored in EC2 instance
- Private key is stored by the customer
EC2 IP Addresses
- Public IP addresses are internet addressable.
- Private IP addresses are internal to a corporate network.
- You CANNOT have two resources with same public IP address.
- HOWEVER, two different corporate networks CAN have resources with the same IP privileges
- All EC2 instances are assigned private IP addresses
- (Remember) When you stop an EC2 instance, public IP address is lost
Elastic IP Addresses
- How do you get a constant public IP address for an EC2 instance?
- Quick and dirty way is to use an Elastic IP
- Elastic IP can be switched to another EC2 instance within the same region.
- Elastic IP can be switched to another EC2 instance within the same region
- Elastic IP remains attached even if you stop the instance. You have to manually attach it.
- Remember: You are charged for an Elastic IP address when you are NOT using it. Make sure that you explicitly release an Elastic IP when you are not using it.
IAAS (Infrastructure as a Service)
- use only infrastructure from a cloud provider
- Computers (virtual or on dedicated hardware), data storage space and Networking features
- Also called "Lift and Shift"
- Cloud Provider is responsible for:
- Physical Infrastructure (Hardware, Networking)
- Virtualization Layer (Hypervisor, Host OS)
- Customer is responsible for:
- Guest OS upgrades and patches
- Application Code and Runtime
- Availability, Fault Tolerance, Scalability etc.
PAAS (Platform as a Service)
- Use a platform provided by cloud
- Cloud Provider is responsible for:
- OS (incl. upgrades and patches)
- Application Runtime
- Auto scaling, Availability & Load balancing etc...
- Customer is responsible for:
- Application code and/or
- Configuration
AWS Managed Service Offerings
- Elastic Load Balancing - Distribute incoming traffic across multiple targets
- AWS Elastic Beanstalk - Run and Manage Web Apps
- Amazon RDS - RDS - MySQL, Oracle, SQL Server, etc.
- And a lot more...
Elastic Load Balancer
- Distribute traffic across EC2 instances in one or more AZs in a single region
- Managed service - AWS ensures that it is highly available
- Auto scales to handle huge loads
- Load Balancers can be public or private
- Health checks - route traffic to healthy instances
Three Types of Elastic Load Balancers
- Classic Load Balancer (Layer 4 and Layer 7)
- Old generation supporting Layer 4 (TCP/TLS) and Layer 7(HTTP/HTTPS) protocols
- Not recommended by AWS
- Application Load Balancer (Layer 7)
- Most popular and frequently used ELB in AWS
- New generation and supporting from HTTP/HTTPS
- Supports advanced routing approaches (Headers, Query Params, Path and Host Based)
- Network Load Balancer (Layer 4)
- New generation supporting TCP/TLS and UDP
- Very high performance use cases
Availability
- Are the applications available when the users need them?
- Percentage of time an application provides the operations expected of it
- Example: 99.9% availability. Also called four 9's availability
Availability Basics - EC2 and ELB
- Deploy to multiple AZs
- Deploy to multiple regions
Scalability
- A system is handling 1000 transactions per second. Load is expected to increase 10 times in the next month
- Can we handle a growth in users, traffic, or data size without any drop in performance?
- Does ability to serve more growth increase proportionally with resources?
- Ability to adapt to changes in demand (users, data)
- What are the options that can be considered?
- Deploy to bigger instances with bigger CPU and more memory
- Increase the number of application instances and setup a load balancer
- and a lot more...
Vertical Scaling
- Deploying application / database to bigger instance:
- A larger hard drive
- A faster CPU
- More RAM, CPU, I/O. or networking capabilities
- There are limits to vertical scaling
Vertical Scaling for EC2
- Increasing EC2 instance size:
- t2.micro to t2.small or
- t2.small to t2.xlarge or
- ...
Horizontal Scaling
- Deploying multiple instances of application / database
- (Typically but not always) Horizontal Scaling is preferred to Vertical Scaling:
- Vertical Scaling has limits
- Vertical Scaling can be expensive
- Horizontal Scaling increases availability
- (BUT) Horizontal Scaling needs additional infrastructure
- Load Balancers (etc.)
Horizontal Scaling for EC2
- Distribute EC2 instances
- in a single AZ
- in multiple AZs in single region
- in multiple AZs in multiple regions
- Auto Scale: Auto Scaling Group
- Distribute Load: Elastic Load Balancer
EC2 Tenancy - Shared vs Dedicated
- Shared Tenancy (Default)
- Single host machine can have instances from multiple customers
- EC2 Dedicated Instances
- Virtualized instances on hardware dedicated to one customer
- You do NOT have visibility into the hardware of the underlying host
- EC2 Dedicated Hosts
- Physical severs dedicated to one customer
- You have visibility into the hardware of underlying host (sockets and physical cores)
- (Use cases) Regulatory needs or server bound software licenses like Windows Server, SQL server
EC2 Pricing Models Overview
Pricing Model | Description | Details |
---|---|---|
On demand | Request when you need it | Flexible and Most Expensive |
Spot | Quote the maximum price | Cheapest (up to 90% off) BUT NO Guarantees |
Reserved | Reserve ahead of time | Up to 75%off. 1 or 3 years reservation |
Savings Plan | Commit spending $X per hour on (EC2 or AWS Fargate or Lambda) | Up to 66% off. No restrictions. 1 or 3 years reservation |
EC2 On-Demand
- On demand resource and provisioning - Use And Throw!
- Highest cost and highest flexibility
- This is what we have been using until now in this course
- Ideal for:
- A web application which receives spiky traffic
- A batch program which has unpredictable runtime and cannot be interrupted
- A batch program being moved from on-premises to cloud for the first time
EC2 Spot Instances
- (Old Model) Bid a price. Highest bidder wins
- (New Model) Quote your maximum price. Prices decided by long term trends
- Up to 90% off (compared to On-Demand)
- Can be terminated with a 2 minute notice
- Ideal for Non time-critical workloads that can tolerate interruptions (fault-tolerant)
- A batch program that does not have a strict deadline AND can be stopped at short notice and re-started
EC2 Reserved Instances
- Reserve EC2 instances ahead of time!
- Get up to 75% OFF!
- Payment Models:
- No upfront - $0 upfront. Pay monthly installment
- Partial Upfront. $XYZ upfront. Pay monthly installment
- All Upfront - Full amount upfront. $0 monthly installment
- Cost wise: Earlier you pay, more the discount. All Upfront < Partial Upfront < No Upfront
- A difference up to 5%
EC2 Savings Plans
- EC2 Compute Savings Plans
- Commitment: I would spend X dollars per hour on AWS compute resources (Amazon EC2 instances, AWS Fargate, and/or AWS Lambda) for a 1 or 3 year period
- Up to 66% off (compares to on demand instances)
- Provides complete flexibility:
- You can change instance family, size, OS, tenancy or AWS Region of your Amazon EC2 instances
- You can switch between Amazon EC2, AWS Fargate and/or AWS Lambda
- EC2 Instance Savings Plans
- Commitment: I would spend X dollars per hour on Amazon EC2 instances of a specific instance family (General Purpose, for example) within a specific region (us-east-1, for example)
- Up to 72% off (compared to on demand instances)
- You can switch operating systems (Windows to Linux, for example)
EC2 Pricing Models Overview
Pricing Model | Use Cases |
---|---|
On Demand | Spiky Workloads |
Spot | Cost sensitive, Fault tolerant, Non immediate workloads |
Reserved | Constant workloads that run all the time |
Savings Plans | Constant workloads that run all the time and you want more flexibility |
AWS Elastic BeanStalk
- Next level of Platform as a Service
- Simplest way to deploy and scale your web applications in AWS
- Provides end-to-end web application management
- Supports JAVA, .NET, Node.js, PHP, Ruby, Python, Go, and Docker Applications
- No usage charges - Pay for AWS Resources provisioned
- Features:
- Automatic Load Balancing
- Auto scaling
- Managed platform updates
- Application health monitoring
AWS Elastic BeanStalk Concepts
- Application - A container for environments, versions and configuration
- Application Version - A Specific version of deployable code (stored in s3)
- Environment - An application version deployed to AWS resources. You can have multiple environments running different application versions for the same application
Auto Scaling Components
- Launch Configuration / Template (What?)
- EC2 instances size and Amazon Machine Image
- Auto Scaling Group (Where?)
- Min, max and desired size of ASG
- Health checks
- Auto Scaling Policies (When?)
- When and How to execute scaling
Dynamic Scaling Policy Types
Scaling Policies - Background
- Two Parts:
- CloudWatch alarm (Is CPU utilization >80%? or <60%)
- Scaling action (+5 EC2 instances or -3 EC2 instances)
Serverless
- What if we do not need to worry about servers and focus on building our application?
- Enter Serverless
- Remember: Serverless does not mean "No Servers"
- Serverless for me:
- You don't worry about infrastructure
- Flexible scaling
- Automated High Availability
- Pay for use:
- You don't have to provision servers or capacity
- You focus on code and the cloud managed service takes care of all that is needed to scale you code to serve millions of requests
AWS Lambda
- You don't worry about servers or scaling or availability
- You don't worry about your code
- You pay what you use
- Number of requests
- Duration of requests
- Memory consumed
AWS Lambda - Supported Languages
- Java
- Go
- Powershell
- Node.js
- C#
- Python
- Ruby
- and a lot more...
AWS Lambda Event Sources
- Amazon SPU Gateway
- AWS Cognito
- Amazon DynamoDB (event)
- Amazon CloudFront (Lambda@Edge)
- AWS Step Functions
- Amazon Kinesis (event)
- Amazon Simple Storage Service
- Amazon Simple Queue Service (event)
- Amazon Simple Notification Service
- The list is endless...
Other Compute Services
- Amazon Lightsail
- Use case 1: Pre-configured development stacks like LAMP, Nginx, MEAN, and Node.js
- Use case 2: Run websites on WordPress, Magneto, Plesk, and Joomla
- Low, predictable monthly price
- AWS Batch
- Use Case: Run batch computing workloads on AWS
- Use Amazon EC2 and Amazon EC2 Spot Instances
Storage
I am thinking about making it such that all my projects use the same s3 buckets for image / audio / video / font - Need to pay attention to bucket performance vs size
Amazon S3 (Simple Storage Service)
- Most popular, very flexible, and inexpensive storage service
- Store large objects using a key-value approach
- Also called Object Storage
- Provides REST API to access and modify objects
- Provides unlimited storage
- (S3 storage class) 99.99% availability & (11 9's - 99.999999999) durability
- Objects are replicated in a single region (across multiple AZs)
- Store all file types - text, binary, backup, and archives:
- Media files and archives
- Application packages and logs
- Backups of your databases or storage devices
- Staging data during on-premise to cloud database migration
Amazon S3 - Objects and Buckets
- Amazon S3 is a global service. Not associated with a region.
- However, a bucket is created in specific AWS region.
- Objects are stored in buckets
- Bucket names are globally unique
- Bucket names are used as part of object URLs => Can contain ONLY lower case letters, numbers hyphens and periods
- Unlimited objects in a bucket
- Each Object is identified by a key value pair
Amazon S3 Storage Classes - Introduction
- Different kinds of data can be stored in Amazon s3
- Media files and archives
- Application packages and logs
- Backups of your databases or storage devices
- Long term archives
- Huge variations in access patterns
- Trade-off between access time and cost
- S3 storage classes help to optimize your costs while meeting access time needs
Amazon S3 Storage Classes
Amazon S3 Storage Classes - Comparison
Amazon S3 Cost
- Important pricing elements:
- Cost of storage (per GB)
- (If Applicable) Retrieval Charge (per GB)
- Monthly tiering fee (Only for Intelligent Tiering)
- Data of Transfer fee
- FREE of Cost:
- Data transfer into S3
- Data transfer from Amazon S3 to Amazon CloudFront
- Data transfer from Amazon S3 to services in the same region
Amazon S3 Glacier
- In addition to existing as a S3 Storage Class, S3 Glacier is a separate AWS Service on its own
- Extremely low cost storage for archives and long-term backups:
- Old media ontent
- Archives to meet regulatory requirements (old patient records etc.)
- As a replacement for magnetic tapes
- High durability (11 9s 99.99999999%)
- High scalability
- High security (encrypted at rest and in transfer)
Amazon S3 vs S3 Glacier
Storage Types - Block Storage and File Storage
- What is the type of storage of your hard disk?
- Block storage
- You've created a file to share a set of files with your colleagues in an enterprise. What type of storage are you using?
- File storage
Block Storage
- Use case: Hard -disks attached to you computers
- Typically, ONE block storage device can be connected to ONE virtual server
File Storage
- Media workflows need huge shared storage for supporting processes like video editing
- Enterprise users need a quick way to share files in a secure and organized way
- These file shared are shared by several virtual servers
AWS Block Storage and File Storage
- Block Storage
- Amazon Elastic Block Store (EBS)
- Instance Store
- File Storage
- Amazon EFS (for Linux Instances)
- Amazon FSx Windows File Servers
- Amazon FSx for Lustre (high performance use cases)
EC2- Block Storage
- Two popular types of block storage can be attached to EC2 instances:
- Elastic Block Store (EBS)
- Instance Store)
- Instance Stores are physically attached to the EC2 instance
- Temporary data
- Lifecycle ties to EC2 instance
- Elastic Block Store (EBS) is a network storage
- More durable
- lifecycle not tied to EC2 instance
Instance Store
- Physically attached to your EC2 instance
- Ephemeral Storage
- Temporary data
- Data is lost when hardware fails or an instance is terminated
- Use case: cache or scratch files
- Lifecycle is tied to EC2 instance
- Only some of the EC2 instance types support Instance Store
Instance Store - Advantages and Disadvantages
- Advantages
- very fast I/O (2-100X of EBS)
- (Cost Effective) No extra cost. Cost is included in the cost of EC2 instance
- Ideal for storing temporary information - cache, scratch files, etc.
- Disadvantages
- Slow boot up (up top 5 minutes)
- Ephemeral storage (data is lost when hardware fails or instance is terminated)
- CANNOT take a snapshot or restore from snapshot
- File size based on instance type
- You cannot detach and attach it to another EC2 instance
Amazon Elastic Block Store (EBS)
- Network block storage attached to you EC2 instance
- Provisioned capacity
- Very flexible
- increase size when you need it - when attached to EC2 instance
- Independent lifecycle from EC2 instance
- Attach/Detach from one EC2 instance to another
- 99.999% Availability and replicated within the same AZ
- Use Case: Run your custom database
Amazon EBS vs Instance Store
Hard Disk Drive vs Solid State Drive
Amazon EFS
- Petabyte scale, Auto scaling, Pay for use shared file storage
- Compatible with Amazon EC2 Linux-based instances
- (Use cases) Home directories, file share, content management
- (Alternative) Amazon FSx for Lustre
- File system optimized for performance
- High performance computing (HPC) and media processing use cases
- Automatic encryption at-rest and in-transit
- (Alternative) Amazon FSx Windows and File Servers
- Fully managed Windows file servers
- Accessible from Windows, Linux, and MacOS instances
- Integrated with Microsoft Active Directory (AD) to support Windows-based environments and enterprises
- Automatic encryption at-rest and in-transit
Review of Storage Options
AWS Storage Gateway
- Hybrid storage (cloud + on premise)
- Unlimited cloud storage for on-premise software applications and users with good performance
- (Remember) Storage Gateway and S3 Glacier encrypt data by default
- Three Options
- AWS Storage File Gateway
- AWS Storage Tape Gateway
- AWS Storage Volume Gateway
Amazon Storage File Gateway
- Problem Statement: Large on-premise file share with terabytes of data
- Users put files into file share and applications use the files
- Managing it is becoming expensive
- Move the file share to cloud without performance impact
- AWS Storage File gateway provides cloud storage for your file shares
- Files stored in Amazon S3 & Glacier
AWS Storage Tape Gateway
- Tape backups used in enterprises (archives)
- Stored off-site - expensive, physical wear and tear
- AWS Storage Tape Gateway - Avoid physical tape backups
- No charge needed for tape backup infrastructure
- Backup data to virtual tapes (actually, Amazon S3 and Glacier)
AWS Storage Volume Gateway
- Volume Gateway: Move block storage to cloud
- Automate backup and disaster recovery
- Use cases: Backup and disaster recovery, Migration of application data
- (Option 1) Cached (Gateway Cached Volumes):
- Primary Data Store - AWS - Amazon S3
- On-premise cache stores frequently accessed data
- (Option 2) Stored (Gateway Stored Volumes):
- Primary Data Store - On-Premises
- Asynchronous copy to AWS
- Stored as EBS snapshots
AWS Storage Gateway - Summary
- Key to look for: Hybrid Storage (cloud + on premise)
- File Share moved to cloud => AWS Storage File Gateway
- Tape Backups on cloud (Block Storage) => AWS Storage Volume Gateway
- High Performance => Stored
- Otherwise => Cached
Databases
Databases Primer
- databases provide organized and persistent storage for your data
- To choose between different database types, we would need to understand
- Availability
- Durability
- Consistency
- Transactions etc.
- Let's get started on a simple journey to understand these
- Database Snapshots = making a copy of your database at a point in time
- Database Transaction Logs - Let's add transaction logs to database and create a process to copy it over to the second data center
- Standby Datacenter - a DB center you can switch to if your first database goes down
Availability and Duration
- Availability
- Will I be able to access my data when I need it?
- Percentage of time an application provides the operations expected of it
- Durability
- Will my data be available after 10 or 100 or 1000 years?
- Examples of measuring availability and durability:
- 4 9s- 99.99%
- 11 9s - 9.9999999%
- Typically, an availability of four 9's is considered very good
- Typically, a durability of eleven 9's is considered very good
- Typical online apps aim for 99.99% availability
Increasing Availability and Durability of Databases
- Increasing Availability
- Having multiple standbys available
- in multiple AZs
- in multiple Regions
- Increasing Durability
- Multiple copies of data (standby, snapshots, transaction logs and replicas)
- in multiple AZs
- in multiple regions
- Replicating data comes with its own challenges
Database Terminology: RTO and RPO
- RPO
- Recovery Point Objective = maximum acceptable period of database loss
- RTO
- Recovery Time Objective - maximum acceptable downtime
- Achieving minimum RTO and RPO is expensive
- Trade-off based on the criticality of the data
Database Read Replicas
- Databases that only read data from the main database
- Reduces load on master databases
Consistency
- How do we make sure that data in multiple database instances is updated simultaneously?
- Strong consistency
- Synchronous replication to all replicas
- Will be slow if you have multiple replicas or standbys
- Eventual Consistency
- Asynchronous replication. A little lag - few seconds - before change is available in all replicas
- In the intermediate period, different replicas might return different values
- Used when scalability is more important than data integrity
- Examples: Social Media Posts - Facebook status messages, Twitter tweets, LinkedIn posts, etc.
- Read-after-write Consistency
- Inserts are immediately available. Updates and deletes are eventually consistent
- Amazon S3 provides read-after-write consistency
Database Categories
- There are several categories of databases
- Relational, Document, Key Value, Graph, etc.
- Choosing type of database for your use case is not easy. factors:
- Do you want a fixed schema?
- Do you want flexibility in defining and changing your schema?
- What level of transaction properties do you need?
- What kind of latency do you want?
- How many transactions do you expect?
- How much data will be stored?
Relational Databases
- Only option until recently
- Most popular (or unpopular) type of databases
- Predefined schema - tables & relationships
- Supports Complex SQL Queries
- Very strong transactional capabilities
- Used for:
- OLTP (Online transaction processing) use cases
- OLAP (Online Analytics Processing) use cases
- Application where large number of users make large number of small transactions
- small data reads, updates, and deletes
- Use Cases: Most traditional applications, ERP, CRM, e-commerce, banking applications
- Popular Databases: MySQL, Oracle, SQL Server, etc.
Amazon RDS
- Amazon RDS is a managed relational database service for OLTP use cases
- Amazon RDS Features:
- Multi-AZ deployment (standby in another AZ)
- Read replicas
- Same AZ
- Multi AZ
- Cross Region (Availability++)
- Storage Auto scaling (up to a configured limit)
- Automated backups (restore to point in time)
- Manual snapshots
- Amazon RDS - You vs AWS
- AWS is responsible for
- Availability (according to your configuration)
- Durability
- Scaling (according to your configuration)
- Maintenance (patches)
- Backups
- You are responsible for
- Managing database users
- App optimization (tables, indexes, etc.)
- You CANNOT
- SSH into database EC2 instances or setup custom software (NOT ALLOWED)
- Install OS or DB patches. RDS takes care of them (NOT ALLOWED)
- Amazon RDS - When to Use
- Use Amazon RDS for transactional applications needing
- Pre-defined schema
- Strong transactional capabilities
- Complex Queries
- Amazon RDS is NOT recommended when
- You need highly scalable massive read/write operations - for example millions of writes/second
- Go for DynamoDB
- When you want to upload files using simple GET/PUT REST API
- Go for Amazon S3
- When you need heavy customizations for your database or need access to underling EC2 instances
- Go for custom database installation
Amazon Aurora
- MySQL and PostgreSQL-compatible
- 2 copies in a minimum of 3 AZ
- Provides "Global Database" option
- Up to five read only, secondary AWS regions
- Low Latency for global reads
- Safe from region-wide outages
- Minimal lag time, typically less than 1 second
Relation Database - OLAP (Online Analytics Processing)
- Applications allowing users to analyze petabytes of data
- Examples: Reporting applications, Data warehouses, Business intelligence applications, Analytics systems
- Sample application: Decide insurance premiums analyzing data from last hundred years
- Data is consolidated from multiple (transactional) databases
OLAP vs OLTP
- OLAP and OLTIP use similar data structures
- BUT very different approach in how data is stored
- OLTP databases use row storage
- Each table row is stored together
- Efficient for processing small transactions
- OLAP databases use columnar storage
- Each table column is stored together
- High compression - store petabytes of data efficiently
- Distribute data - one table in multiple cluster nodes
- Execute single query across multiple nodes - Complex queries can be executed efficiently
Amazon Redshift
- Redshift is a relational database (tables and relationships)
- What is the need for another relational database?
- RDS is optimized for online transaction processing
- RDS is optimized to provide a balance between both reads and write operations
- (However) OLAP workloads have exponentially larger reads on the database compared to writes
- Can we use a different approach to design the database?
- How about creating a cluster and splitting the execution of the same queries across several nodes>
- Redshift is a petabyte-scale distributed data warehouse based on PostgreSQL
- Three important characteristics of Redshift:
- Massive parallel processing (MPP) - storage and processing can be split across multiple nodes
- Columnar data storage
- High data compression
- As a result
- A single row of data might be stored across multiple nodes
- A query to Redshift leader node is distributed to multiple compute nodes for execution
- Start with a single node configuration and scale to multi node configuration
- You can dynamically add and remove nodes
- Used for traditional ETL (Extract, Transform, Load), OLAP and Business Intelligence (BI) use cases
- Optimized for high performance analysis and reporting of very large datasets
- Supports standard SQL
- Integration with data loading, reporting, mining and analytics tools
Amazon EMR - Elastic MapReduce
- Managed Hadoop service with high availability and durability
- EMR gives access to underling OS => You can SSH into it
- Important tools in Hadoop eco system are natively supprted:
- Examples: Pig, Hive, Spark, or Presto
- install Others using bootstrap actions
- Use Cases:
- Log Processing for insights
- Click stream analysis for advertisers
- Genomic and life science dataset processing
Amazon Redshift Spectrum
- Run SQL queries against datasets in Amazon S3
- Does need for any intermediate data stores
- Auto scales based on your queries
- Scale storage and compute independently
- Eliminate expensive data transfers from S3 to data warehousing solutions (Cost Effective)
- Query against Amazon EMR (as well)
Document Databases
- Structure data the way your application needs it
- Create one table instead of dozens
- Quickly evolving semi structured data (schema-less)
- Easily distributable
- Advantages: (Horizontally) Scalable to terabytes of data with millisecond responses up to millions of transaction per second
- Use cases: Content management, catalogs, user profiles
Key-Value:
- Use a simple key-value pair to store data. Key is a unique identifier.
- Values can be objects, compound objects, or simple data values
- Advantages: (Horizontally) Scalable to terabytes of data with millisecond responses up to millions of transactions per second
- Use cases: shopping carts, session stores, gaming applications, and very high web traffic apps
Amazon DynamoDB
- Fast, scalable, distributed for any scale
- Flexible NoSQL Key-value & document database (schemeless)
- Single digit millisecond responses for millions or durability
- Automatically partitions data as it grows
- Maintains 3 replicas within the same region
- No need to provision a database
- Create a table and configure read and write capacity (RCU and WCU)
- Automatically scales to meet your RCU and WCU
- Provides an expensive serverless mode
- Use cases: User profiles, shopping carts, high volume read and write applications
DynamoDB Tables
- Hierarchy: Table > item(s) > attribute (key value pair)
- Mandatory primary key
- Other than the primary key, tables are schemaless
- No need to define the other attributes or types
- Each item in a table can have distinct attributes
- Max 400 kB per item in a table
- Use S3 for large objects and DynamoDB for smaller objects
In-memory Databases (or Caches)
- Retrieving data from memory is much faster from retrieving data from disk
- You can speed up dynamic database-driven websites by caching data and objects in memory (Ex. Memcached)
- You can delivery microsecond latency by storing some persistent data in memory (Ex. Redis)
- Use Cases: Caching, session management, gaming leader boards, geospatial applications
Amazon ElastiCache
- Highly scalable and low latency in-memory data store
- Used for distributed caching
- (Option 1) ElastiCache Memcached
- Low maintenance caching solution
- Easy horizontal scaling with auto discovery
- Use case: Speed up database-driven websites by caching data
- (Option 2) ElastiCache Redis:
- Persistence
- Advanced Features:
- Pub-sub
- Read Replicas and Failover
- Encryption
- Use cases: gaming leaderboards, queues, real time analytics
Database Type | AWS Service | Description |
---|---|---|
Relational OLTP Databases | Amazon RDS | Row Storage |
Relational OLAP Databases | Amazon Redshift | Columnar Storage |
Document and Key Databases | Amazon DynamoDB | Apps needing quickly evolving semi structured data (schema-less) |
Graph Databases | Amazon Neptune | Store and navigate data with complex relationships |
In memory databases / caches | Amazon ElastiCache | Applications needing microsecond responses Memcached - simple caches |
Other Storage Services
- Amazon DocumentDB
- Managed document database service
- Compatible with MongoDB
- Amazon Keyspaces
- Managed service for Apache Cassandra
- Serverless (Pay for use)
- AWS Backup
- Centrally manage and automate backup across AWS services
- Automate backup compliance and monitoring
Networking
Need for Amazon VPC
- In a corporate network or an on-premises data center:
- Can anyone on the internet see the data exchange between the application and the database?
- No
- Can anyone from internet directly connect to your database?
- Typically NO
- You need to connect to your corporate network and then access your applications or databases
- Corporate network provides a secure internal network protecting your resources, data and communication from external users
- How do you create your own private network in the cloud
- Enter Virtual Private Cloud (VPC)
Amazon VPC
- Your own isolated network in AWS cloud
- Network traffic within a VPC is isolated (not visible) from all other Amazon VPCs
- You control all the traffic coming in and outgoing a VPC
- (Best Practice) Create all your AWS resources (compute, storage, databases etc.) within a VPC
- Secure resources from unauthorized access AND
- Enable secure communication between your child resources
Need for VPC Subnets
- Different resources are created on cloud - databases, compute (EC2) etc.
- Each type of resource has its own access needs
- Public Elastic Load Balancers are accessible from internet (public resources)
- Databases or EC2 instances should NOT be accessible from internet
- ONLY applications within your network (VPC) should be able to access them (private resources)
- How do you separate public resources from private resources inside a VPC?
VPC Subnets
- (Solution) Create different subnets for public and private resources
- Resources in a public subnet CAN be accessed from internet
- Resources in a private subnet CANNOT be accessed from internet
- BUT resources in public subnet can talk to resources in private subnet
- Each VPX is created in a Region
- Each Subnet is created in an Availability Zone
- Example: VPC us-east-1 => Subnets - AZs us-east-1a or us-east-1b
Routing on the Internet
- You have an IP address of a website you want to visit
- There is no direct connection from your computer to the website
- Internet is actually a set of routers routing traffic
- Each router has a set of rule that help it decide the path to the destination OP address
Routing Inside AWS
- In AWS, route tables are used for routing
- Route tables can be associated with VPCs and subnets
Destination | Target |
---|---|
172.31.0.0/16 | Local |
0.0.0.0/0 | igw-1234567 |
- Each route table consists of a set of rules called routes
- Each route or routing rule has a destination and target
- What Range of Addresses should be routed to which target resource?
- Rule 1 (Above) - Route requests to VPC CIDR 172.31.0.0/16 (172.31.0.0 to 172.31.255.255) to local resources within the VPC
- Rule 2 - Route all other IP addresses (0.0.0.0/0) to internet (internet gateway)
Public Subnet vs Private Subnet
- Public Subnet
- Communication allowed from subnet to internet
- Communication allowed from internet to subnet
- Private Subnet
- Communication NOT allowed from internet to subnet
Name | Destination | Target | Explanation |
---|---|---|---|
Rule 1 | 172.31.0.0/16 | Local | Local Routing |
Rule 2 | 0.0.0.0/0 | igw-1234567 | Internet Routing |
- An internet gateway enables internet communication for subnets
- Any subnet which has a route to an internet gateway is called a public subnet
- Any subnet which DOES NOT have route to an internet gateway is called a private subnet
Network Address Translation (NAT) Instance and Gateway
- How do you allow instances in a private subnet to download software updates and security patches while denying inbound traffic from internet?
- How do you allow instances in a private subnet to connect privately to other AWS Services outside the VPC?
- Three Options:
- NAT Gateway: Managed service
- NAT Instance: Install a EC2 instance with a specific NAT AMI and configure as a gateway
- Egress-Only Internet Gateways: For IPv6 subnets
Network Access Control List
- Security group control traffic to a specific resource in a subnet
- How about stopping traffic from even entering the subnet?
- NACL provides stateless firewall at a subnet level.
- Each subnet must be associated with a NACL
- Default NACL allows all inbound and outbound traffic
- Custom created NACL denies inbound and outbound traffic by default
- Rules have a priority number
- Lower Number => Higher Priority
Security Group vs NACL
Feature | Security Group | NACL |
---|---|---|
Level | Assigned to a specific instance(s)/resource(s) | Configured for a subnet. Applied to traffic to all instances in a subnet |
Rules | Allow rules only | Both allow and deny rules |
State | Stateful. Return traffic is automatically allowed. | Stateless. You should explicitly allow return traffic. |
Evaluation | Traffic allowed if there is a matching rule. | Rules are prioritized. Matching rule with highest priority wins. |
VPC Flow Logs
- Monitor network traffic
- Troubleshoot connectivity issues (NACL and/or security groups misconfiguration)
- Capture traffic going in and out of your VPC (network interfaces)
- Can be crated for:
- a VPC
- a subnet
- Publish logs to Amazon CloudWatch Logs or Amazon S3
- Flow log record contain ACCEPT or REJECT
- Is traffic permitted by security groups or network ACLs?
VPC Peering
- Connect VPCs belonging to same or different AWS accounts irrespective of the region of the VPCs
- Allows private communication between the connected VPCs
- Peering uses a request/accept protocol
- Owner of requesting VPC sends a request
- Owner of the peer VPC has one week to accept
AWS and On-Premises - Overview
- AWS Managed VPN
- IPsec VPN tunnels from VPC to customer network
- AWS Direct Connect (DX)
- Private dedicated network connection from on-premises to AWS
AWS Managed VPN
- IP Sec VPN tunnels from VPC to customer network
- Traffic over internet - encrypted using IPsec protocol
- VPN gateway to connect one VPX to customer network
- Customer gateway installed in customer network
- You need an Internet routable IP address of customer gateway
AWS Direct Connect (DC)
- Private dedicated network connection from on-premises to AWS
- Advantages:
- Private network
- Reduce your (ISP) bandwidth costs
- Consistent Network performance because of the private network
- (REMEMBER) Establishing DC connection can take more than a month
Service | Example Use Case | Explanation |
---|---|---|
AWS Local Zones | A gaming company providing seamless online multiplayer experience in a specific city | Extend AWS infrastructure to metro areas closer to end-users for single0digit millisecond latency. Ideal for real-time user engagement like online gaming. |
AWS Outposts | A healthcare organization processing patient data with regulatory requirements to keep data on-premises. | Bring native AWS services, infrastructure, and operating models to on-premises. Suitable for workloads with regulatory or data residency needs. |
AWS Wavelength | Developing an augmented reality app for mobile devices requiring real-time edge data processing. | Embed AWS infrastructure in mobile service providers data centers at the 5g network edge. Enables ultra-low latency and high bandwidth applications for mobile and connected devices. |
VPC - Revies
- VPC: Virtual Network to protect resources and communication from the outside world.
- Subnet: Separate private resources from public resources
- Internet Gateway: Allows Public Subnets to connect/accept traffic to/from internet
- NAT Gateway: Allow internet traffic from private subnets
- VPC Peering: Connect one VPC with other VPCs
- VPC Flow logs: Enable logs to debug problems
- AWS Direct Command: Private pipe from AWS to on-premises
- AWS VPN: Encrypted (IPsec) tunnel over internet to on-premises
IAM - Fundamentals
Typical Identity Management in the Cloud
- You have resources in the cloud (examples - a virtual server, a database)
- You have identities (human and non-human) that need access to those resources and perform actions
- For example: launch (stop, start, or terminate) a virtual server
- How do you configure resources they can access?
- How can you configure what actions to allow?
- In AWS, Identity and Access Management (IAM) provides this service
AWS Identity and Access Management (IAM)
- Authentication (is it the right user?) and
- Authorization (do they have the right access?)
- Identities can be
- AWS users
- Federated users (externally authenticated users)
- Provides very granular controil
- Limit a single user
- to perform single action
- on a specific AWS resource
- from a specific IP address
- during a specific time window
Important IAM Concepts
- IAM users: Users created in an AWS account
- Has credentials attached (name/password or access keys)
- IAM Groups: Collection of IAM Users
- Roles: Temporary Identities
- Does not have credentials attached
- (Advantage) Expire after a set period of time
Data Encryption KMS and Cloud HSM
Data States
- Data at rest: Stored on a device or a backup
- Examples: data on a hard disk, in a database, backups and archives
- Data in motion: Being transferred across a network
- Also called data in transit
- Examples:
- Data copied from on-premise to cloud storage
- An application in a VPC talking to a database
- Two Types:
- In and out of AWS
- Within AWS
- Data in use: Active data processed in a non-persistent state
- Example: Data in your RAM
Encryption
- If you store data as is, what would happen id an unauthorized entity gets access to it?
- Imagine losing an unencrypted hard disk
- First law of security: Defense in Depth
- Typically, enterprises encrypt all data
- Data on your hard disks
- Data in your databases
- Data on your file servers
- Is it sufficient if you encrypt data at rest?
- No. Encrypt data in transit - between application to database as well.
Systematic Key Encryption
- Symmetric encryption algorithms use the same key for encryption and decryption
- Key Factor 1: Choose the right encryption algorithm
- Key Factor 2: How do we secure the encryption key?
- Key Factor 3: How do we share the encryption key?
Asymmetric Key Encryption
- Two Keys: Public Key and Private Key
- Also called Public Key Cryptography
- Encrypt data with Public Key and decrypt with Private Key
- Share Public Key with everybody and keep the Private Key with you
- No crazy questions:
- Will somebody not figure out private key using the public key
- How do you create Asymmetric Keys?
KMS and Cloud HSM
- How do you generate, store, use, and replace your keys?
- AWS provides two important services - KMS and Cloud HSM
- Manage your keys
- Perform encryption and decryption
Amazon KMS
- Create symmetric and manage cryptographic keys (symmetric and asymmetric)
- Control their use in your applications and AWS Services
- Defined key usage permissions (including cross account access)
- Track key usage in AWS CloudTrail (regulations and compliance)
- Integrates with almost all AWS services that need data encryption
- Automatically rotate master keys once a year
- No need to re-encrypt previously encrypted data (versions of master key are maintained)
- Schedule key deletion to verify if the key is used
- Mandatory minimum wait period: 7 days (max-30days)
AWS CloudHSM
- Managed (highly available and auto scaling) dedicated single tenant Hardware Security Module (HSM) for regulatory compliance
- (Remember) AWS KMS is a multi-tenant service
- FIPS 140-2 Level 3 compliant
- AWS CANNOT access your encryption master keys in CloudHSM
- In KMS, AWS can access your master keys
- Be ultra safe with your keys when you are using CloudHSM
- (Recommendation) Use two or more HSMs in separate AZs in a production cluster
AWS CloudHSM
- AWS KMS can use CloudHSM cluster as "custom key store" to store the keys:
- AWS Services can continue to talk to KMS for data encryption
- (AND) KMS does the necessary integration with CloudHSM cluster
- (Best Practice) CloudWatch for monitoring and CloudTrail to track key usage
- Use Cases
- (Web Servers) Offload SSL processing
- Certificate Authority
- Digital Rights Management
- TDE for Oracle Databases
AWS Shield
- Shields from Distributed Denial of Service (DDoS) attacks
- Disrupt normal traffic of a server by overwhelming it with a flood of Internet traffic
- Protect
- Amazon Route S3
- Amazon CloudFront
- AWS Global Accelerator
- Amazon Elastic Compute Cloud (EC2) Instances
- Elastic Load Balancers (ELB)
AWS Shield - Standard and Advanced
- AWS Shield Standard
- Zero Cost. Automatically enabled.
- Protection against common infrastructure (layer 3 and 4) DDoS attacks
- AWS Shield Advanced
- Paid service
- Enhanced protection for Amazon EC2, Elastic Load Balancing (ELB), Amazon CloudFront, AWS Global Accelerator, and Amazon Route S3
- 24x7 access to the AWS DDoS Response Team (DRT)
- Protects your AWS bill from usage spikes as a result of a DDoS attack
- Protect any web application (from Amazon S3 or external) from DDoS by putting Amazon CloudFront enabled AWS Shield in front of it
AMS WAF - Web Application Firewall
- AWS WAF protect your web application from OWASP Top 10 exploits, CVE and a lot more!
- OWASP (Open Web Application Security Project) Top 10
- List of broadly agreed "most critical security risks to web applications"
- Examples: SQL injection, cross-site scripting
- Common vulnerabilities and Exposures (CVE) is a list of information-security vulnerabilities and exposures
- Can be deployed on Amazon CloudFront, Application Load Balancer, Amazon API Gateway
- Customize rules & trigger real-time alerts (CloudWatch Alarms)
- Web traffic filtering: block attacks
- Filter traffic based on IP addresses, geo locations, HTTP headers, and body (block attacks from specific user-agents, bad bots, or content scrapers)
Amazon Macie
- Fully managed data security and data privacy service
- Automatically discover, classify, and protect sensitive data in Amazon S3 buckets
- When migrating to AWS use S3 for staging
- Run Macie to discover secure data
- Uses machine learning
- Recognizes sensitive data
- Example: personally identifiable information (PII) or intellectual property
- Provides you with dashboards and alerts
- Gives visibility into how data is being accessed and moved
AWS Inspector: Enhanced Security Scanning
- AWS Inspector: Automated Security Scanning
- Discover AWS workloads: Scans Amazon EC2 instances, containers, and Lambda functions for vulnerabilities
- Security: Identifies software vulnerabilities and checks for unintended network exposures
- Compliance: Helps ensure your AWS workloads with security standards and best practices
- Continuous Monitoring: Automatically assesses new and existing workloads to improve your security posture over time
AWS Systems Manager Parameter Score
- Manage application environment configuration and secrets
- database connections, password etc.
- Supports hierarchal structure
- Store configuration at one place
- Multiple applications
- multiple environments
- Maintains history of configuration over a period of time
- Integrates with KMS, IAM, CloudWatch, and SNS
AWS Secrets Manager
- Rotate, Manage and retrieve database credentials, API keys, and other secrets for your applications
- Integrates with KMS (encryption), Amazon RDS, Amazon redshift, and Amazon DocumentDB
- (KEY FEATURE) Rotate secrets automatically without impacting applications
- (KEY FEATURE) Service dedicated to secrets management
- Recommended for workloads needing HIPAA, PCI-DSS compliance
AWS Single Sign On
- Cloud-based single sign-on (SSO) service
- Centrally manage SSO access to all of your AWS accounts
- Integrates with Microsoft AD (Supports using your existing corporate accounts)
- Supports Security Assertion Markup Languages (SML) 2.0
- Deep integration with AWS Organizations (Centrally manage access to multiple AWS accounts)
- One place auditing in AWS CloudTrail
Other Important Security Services
- Amazon GuardDuty
- Continually monitor your AWS environment for suspicious activity (Intelligent Threat Detection)
- Analyze AWS CloudTrail events, VPC flow logs, etc.
- AWS Certificate Manager
- Provision, manage, deploy, and renew SSL/TLS certificates on the AWS platform
- AWS Artifact
- Self-service portal for on-demand access to AWS compliance reports, certifications, accreditations, and other third-party attestations
- Review, accept, and manage your agreements with AWS
- AWS Security Hub
- Consolidated view of your security status in AWS
- Automate security checks, manage security findings, and identify the highest priority security issues across your AWS environment.
- Amazon Detective
- Investigate and quickly identify the root cause of your potential security issue
- Automatically collect log data from your AWS resources and uses machine learning to help you visualize and conduct security investigations
- Penetration Testing
- Testing application security by simulating an attack
- You do not need permission from AWS to do penetration testing on a limited set of services (EC2 instances, ELB, RDS, CloudFront, API Gateway, Lambda, Elastic BeanStalk)
CloudTrail, Config, and CloudWatch
AWS CloudTrail
- Track events, API calls, changes made to your AWS resources:
- Who made the request?
- What action was performed?
- What are the parameters used?
- What was the end result?
- (USE CASE) Compliance with regulatory standards
- (USE CASE) Troubleshooting. Locate a missing resource
- Delivers log files to S3 and/or Amazon cloud watch logs log group (S3 is default)
- You can setup SNS notifications for log file delivery
AWS Cloud Trail Types
- Multi Region Trail
- One trail of all AWS regions
- Events from all regions can be sent to one CloudWatch logs log group
- Destination S3 bucket can be in any region
AWS Clous Trail - Good to Know
- Log files are automatically encrypted with Amazon S3 SSE
- You can configure S3 Lifecycle rules to archive or delete log files
- Supports log file integrity
- You can prove that a log file has not been altered
AWS Config
- Auditing
- Create a complete history of your AWS resources
- Resource History and Change Tracking
- Find how a resource was configured at any point in time
- Configuration of deleted resources would be maintained
- Delivers a history file to S3 bucket every 6 hours
- Take configuration snapshots when needed
- Governance
- Customize Config Rules for specific resources or for entire AWS account
- Continuously evaluate compliance against desired configuration
- Get a SNS notification for every configuration change
- Consistent Rules and Compliance across AWS accounts
- Group Config Rules and Remediation Actions into Conformance Packs
Predefined Config Rule Examples (80+)
- alb-http-to-https-redirection-check - Checks whether HTTP to HTTPS redirection is configured on all HTTP listeners of Application Load Balancers
- ebs-optimized-instance - Checks whether EBS optimization is enabled for your EC2 instances that can be EBS-optimized
- ec2-instance-no-public-ip - Do EC2 instances have public IPs?
- encrypted-volumes - Are all EC2 instance attached EBS volumes encrypted?
- eip-attached - Are all Elastic IP addresses used?
- restricted-ssh - Checks whether security groups that are in use disallow unrestricted incoming SSH traffic
AWS Config Rules
- (Feature) Create Lambda functions with your custom rules
- (Feature) You can setup auto remediation for each rule
- Take immediate action on a non compliant resource
- (Example) Stop EC2 instances without a specific tag!
- Enable AWS Config to use the rules
- No Free Tier
- No more rules to check => More $$$$
AWS Config + AWS CloudTrail
- AWS Config
- What did my AWS resource look like?
- AWS CloudTrail
- Who made an API call to modify this resource?
Monitoring AWS With Amazon CloudWatch
- Monitoring and observability service
- Collects monitoring and operational data in the form of logs, metrics, and events
- Set alarms, visualize logs, take automated actions, and troubleshoot issues
- Integrates with more than 70 AWS services
Amazon CloudWatch Logs
- Monitor and troubleshoot using system, application, and custom log files
- Real time application and system monitoring
- Monitor for patterns in your logs and trigger alerts based on them
- Example: Errors in a specific interval exceed a certain threshold
- Long term log retention
- Store logs in CloudWatch Logs for as long as you want (configurable - default: forever)
- Or archive logs to S3 bucket (Typically involves a delay of 12 hours)
- Or stream real time to Amazon Elasticsearch Service (Amazon ES) cluster using CloudWatch Logs subscription
Amazon CloudWatch Logs
- CloudWatch Logs Agent
- Installed on EC2 instances to move logs from servers to CloudWatch logs
- CloudWatch Logs Insights
- Write queries and get actionable insights from your logs
- CloudWatch Container Insights
- Monitor, troubleshoot, and set alarms for your containerized applications running in EKS, ECS and Fargate
Amazon CloudWatch Alarms
- Create alarms based on:
- Amazon EC2 instance CPU utilization
- Amazon SQS queue length
- Amazon DynamoDB table throughput or
- Your own custom metrics
- Take immediate action:
- Send a SNS event notification
- Send an email using SNS
- Execute an Auto Scaling Policy
Amazon CloudWatch Dashboards
- create auto refreshed graphs around all CloudWatch metrics
- Automatic Dashboards are available for most AWS services and resources
- Every Dashboard can have graphs from multiple regions
Amazon CloudWatch Events
- Enable you to take immediate action based on events on AWS resources
- Call a AWS Lambda function when an EC2 instance starts
- Send event to an Amazon Kinesis stream when an Amazon EBS volume is created
- Notify an Amazon SNS topic when an Auto Scaling event happened
- Schedule events - Use Unix chron syntax
- Schedule a call to a Lambda function every hour
- Send a notification to Amazon SNS topic every 3 hours
Decoupling Applications with SQS, SNS, and MQ
Need for Asynchronous Communication
- Why do we need asynchronous communication?
Synchronous Communication
- Applications on your web server make synchronous calls to the logging service
- What if your logging service goes down?
- Will your application go down too?
- What if all of sudden, there is high load and there are lot of logs coming in?
- Log Service is not able to handle the load and goes down very often
Asynchronous Communication - Decoupled
- Create a queue or topic
- Your applications put the logs on the queue
- They would be picked up when the logging service is ready
- Good example of decoupling
Asynchronous Communication - Scale Up
- You can have multiple logging service instances reading from the queue
Asynchronous Communication - Pull Model - SQS
- Producers put messages on the queue
- Consumers poll on the queue
- Only one of the consumers will successfully process a given message
- Scalability
- Scale consumer instances under high load
- Availability
- Producer up even if consumer is down
- Reliability
- Work s not lost due to insufficient resources
- Decoupling
- Make changes to consumers without effect on producers worrying about them
Asynchronous Communication - Push Model - SNS
- Subscribers subscribe to a topic
- Producers send notifications to a topic
- Notifications sent out to all subscribers
- Decoupling
- Producers don't care about who is listening
- Availability
- Producer up even if a subscriber is down
Simple Queuing Service
- Reliable, scalable, fully managed message queuing service
- High availability
- Unlimited scaling
- Auto scale to process billions of messages per day
- Low Cost (Pay for Use)
Standard and FIFO Queues
- Standard Queue
- Unlimited throughput
- But NO Guarantee of ordering (Best-Effort Ordering)
- and NO guarantee of exactly-once processing
- Guarantees at-least-once delivery (some messages can be processed twice)
- FIFO (first-in-first-out) Queue
- First-In-First-Out Delivery
- Exactly-Once Processing
- BUT throughput is lower
- Up to 300 messages per second (300 send, receive, or delete operations per second
- If you batch 10 messaged per operation (maximum), up to 3000 messages per second
- Choose
- Standard SQS queue if throughput is important
- FIFO Queue if order of events is important
Amazon Simple Notification Service (SNS)
- Publish-Subscribe (pub-sub) paradigm
- Broadcast asynchronous event notifications
- Simple process
- Create an SNS Topic
- Subscribers can register for a Topic
- When an SNS Topic receives an event notification (from publisher), it is broadcast to all Subscribers
- Use Cases: Monitoring Apps, workflow systems, mobile apps
Amazon Simple Notification Service (SNS)
- Provides mobile and enterprise message web services
- Push notifications to Apple, Android, FireOS, Windows devices
- Send SMS to mobile users
- Send Emails
- REMEMBER: SNS does not need SQD or a Queue
- You can allow access to other AWS accounts using SNS generated policy
Amazon MQ
- Managed message broker service for Apache Active MQ
- (Functionally) Amazon MQ = Amazon SQS (Queues) + Amazon SNS (Topics)
- BUT with restricted scalability
- Supports traditional APIs (JMS_ and protocols (AMQP, MQQTT, OpenWire, and STOMP)
- Easy to migrate on-premise applications using traditional message brokers
- Start with Amazon MQ as first step and slowly re-design apps to use Amazon SQS and/or SNS
- Scenario: An enterprise uses AMQP (standard message broker protocol). They want to migrate to AWS without making code changes
- Recommend Amazon MQ
Routing and Content Delivery
Content Delivery Network
- You want to deliver content to your global audience
- Content Delivery Networks distribute content to multiple edge locations around the world
- AWS provides 200+ edge locations around the world
- Provides high availability and performance
Amazon CloudFront
- How do you enable serving content directly from AWS edge locations?
- Amazon CloudFront (one of the options)
- Serve users from the nearest edge location (based on user location)
- Source content can be from S3, EC2, ELB, and External Websites
- If content is not available at the edge location, it is retrieved from the origin server and cached
- No minimum usage commitment
- Provides features to protect your private content
- Use Cases
- Static Web apps. Audio, video and software downloads. Dynamic web apps
- Supporting media streaming with HTTP and RTMP
- Integrates With
- AWS Shield to protect from DDoS attacks
- AWS Web Application Firewall (WAF) to protect from SQL injection, cross-site scripting, etc.
- Cost Benefits
- Cero cost for data transfer between S3 and CloudFront
- Reduce compute workload for your EC2 instances
- Create a CloudFront distribution to distribute consent to edge locations
- DNS domain name - example abc.cloudront.com
- Origins - Where do you get content from? S3, EC2, ELB, External Website
- Cache-Control
- By default objects expire after 24 hours
- Customize min, max, default TTL in CloudFront distribution
- (For file level customization) Use Cache-Control max-age and Expires headers in origin server
- You can configure CloudFront to only use HTTPS (or) use HTTPS for certain objects
- Default is to support HTTP and HTTPS
- You can configure CloudFront to redirect HTTP to HTTPS
AWS Edge Locations: Content Delivery Hubs
- AWS Edge Locations: Delivery Hubs for Your Apps
- Used by Amazon CloudFront: CloudFront distributes your static content (images, videos, etc.) across a global network of edge locations, minimizing latency and improving delivery speeds for users worldwide
- Used by Global Accelerator: Global Accelerator intelligently routes user traffic to the closest AWS edge location, minimizing latency and improving loading times
- Used by Amazon S3 Transfer Acceleration: Accelerates long-distance transfers to and from your Amazon S3 buckets
Route 53
- What would be the steps in setting up a website with a domain name (for example, in28minutes.com)?
- But the domain name in 28minutes.com (Domain Registrar)
- Setup your website content (Website Hosting)
- Route requests to in28minutes.com to the my website host server (DNS)
- Route 53 = Domain Registrar + DNS
- Buy your domain name
- Setup your DNS routing for in28minutes.com
Route 52 - DNS (Domain Name Server)
- How Should traffic be routed for in28minutes.com
- Configure Records:
- Route api.in28minutes.com to the IP address of api server
- Route static.in28minutes.com to the IP address of http server
- Route email to the mail server
- Each record is associated with a TTL (Time To Live) - How long is your mapping cached at the routers and the client?
Route 52 Hosted Zone
- Container for records containing DNS records routing traffic for a specific domain
- I want to use Route 53 to manage the records (Name Server) for in28minutes.com
- Create a hosted zone for in28minutes.com in Route 53
- Hosted zones can be
- private - routing within VPCs
- public - routing on the internet
- Manage the DNS records in a Hosted Zone
Standard DNS Records
- A - Name to IPV4 address(es)
- AAAA - Name to IPV6 address(es)
- NS - name Server containing DNS records
- I bought in28minutes.com from GoDaddy (Domain Registrar)
- BUT I can use Route 52 as DNS
- Create NS records on GoDaddy
- Redirect to Route 52 Name Servers
- MX - Mail Exchange
- CNAME - Name1 to Name2
Route 52 Specific Extension - Alias Records
- Route traffic to selected AWS resources
- Elastic BeanStalk environment
- ELB load balancer
- Amazon S3 bucket
- CloudFront Distribution
- Alias records can be created for
- root (in28,inutes.com) and
- non root domains (api.in28minutes.com)
- COMPARED to CNAMD records which can only be created for
- non root domains (api.in28minutes.com)
Route 53 - Routing
- Route 52 can route across Regions
- Create ALBs in multiple regions and route to them
- Offers multiple routing policies
Route 53 Routing Policies
Need for AWS Global Accelerator
- Cached DNS answers
- clients might cache DNS answers causing a delay in propagation of configuration updates
- High latency
- users connect to the region over the internet
AWS Global Accelerator
- Directs traffic to optimal endpoints over the AWS global network
- Global Accelerator provides you with two static IP addresses
- Static IP addresses are anycast from the AWS edge network
- Distribute traffic across multiple endpoint resources in multiple AWS Regions
- Works with Network Load Balancers, Application load Balancers, EC2 Instances, and Elastic IP addresses
Moving Data Between AWS and On-Premises
AWS Snowball
- Transfer dozens of terabytes to petabytes of data from on-premises to AWS
- 100TB (80TB usable) per appliance
- Involves physical shipping
- Simple process
- Request for Snowball
- Copy data
- Ship it back
- Manage jobs with AWS Snowball console
- Data is automatically encrypted with KMS
- Current versions of AWS Snowball use Snowball Edge devices
- Provide both compute and storage
- Pre-process data (using Lambda functions)
- Choose between
- Storage optimized (34 vCPUs, 32 GiB RAM)
- Compute optimized (52 vCPUs, 208 GiB RAM)
- Compute optimized with GPU
- Choose Snowball if direct transfer takes over a week
- 5TB can be transferred on 100 Mbps line in a week at 80% utilization
AWS DataSync - Transfer File Storage to Cloud
- Secure and 10x faster (100s of TB) data transfers from/to AWS over internet or AWS Direct Connect
- Transfer from on-premise file storage (NFS, SMB) to S3, EFS, or FSx for Windows
- Monitor progress using Amazon CloudWatch
- (Use Cases) Data Migration, Data replication, and Cold Data Archival
- (Alternative) Use AWS Snowball if you are bandwidth constrained or transferring data from remote, or disconnected
- (Alternative) Use S3 Transfer Acceleration when your applications are integrated with S3 API. If not, prefer AWS DataSync (Supports multiple destinations, built-in retry)
- (Integration) Migrate data using DataSync and use AWS Storage Gateway for ongoing updates from on-premises applications
AWS Data Pipeline
- Process and move data (ETL) between S3, RDS, DynamoDB, EMR, On-premise data sources
- Create complex data processing workloads that are fault tolerant, repeatable, and highly available
- Launches required resources and tear them down after execution.
- REMEMBER: NOT for streaming data!
AWS Database Migration Service
- Migrate databases to AWS while keeping source database operational
- Homogeneous Migrations (ex. Oracle to Oracle)
- Heterogeneous Migrations (ex. Oracle to Amazon Aurora, MySQL to Amazon Aurora)
- Free for first 6 months when migrating to Aurora, Redshift, or DynamoDB
- (AFTER MIGRATION) Keep databases in sync and pick right movement to switch
- (Use case) Consolidate multiple databases into a single target database
- (Use case) Continuous Data Replication can be used for Disaster Recovery
AWS Schema Conversion Tool
- Migrate data from commercial databases and data warehouses to open source or AWS services
- Preferred option for migrating data warehouse data to Amazon Redshift
- Migrate database schema (views, stored procedures, and functions) to compatible targets
- Features
- SCT assessment report
- Analyze a database to determine the conversion complexity
- Update source code (update embedded SQL in code)
- Fan-in (multiple sources - single target)
- Fan-out (single source - multiple targets)
Database Migration Service VS Schema Conversion Tool
- (Remember) SCT is part of DMS service
- DMS is preferred for homogeneous migrations
- SCT is preferred when schema conversions are involved
- DMS is for smaller workloads
- SCT preferred for large data warehouse workloads
- Prefer SCT for migrations to Amazon Redshift
- Only DMS provides continuous data replication after migration
Amazon Kinesis
Amazon Kinesis
- Handle streaming data
- NOT recommended for ETL batch jobs
- Amazon Kinesis Data Streams
- Process data Streams
- Amazon Kinesis Firehouse
- Data ingestion for streaming data: S3, Elasticsearch, etc.
- Amazon Kinesis Analytics
- Run queries against streaming data
- Amazon Kinesis Video Streams
- Monitor Video Streams
Amazon Kinesis Data Streams
- Limitless Real time stream processing
- Sub second processing latency
- Alternative for Kafka
- Supports multiple clients
- Each client can track their stream position
- Retain and replay data (max 7 days and default 1 day)
Amazon Kinesis Data Streams - Integrations
- Use application integrations to generate streams
- Toolkits: AWS SDK, AWS Mobile SDK, Kinesis Agent
- Service Integrations: AWS IOT, CloudWatch Events and Logs
- Process streams using Kinesis Stream Applications
- Run on EC2 instances
- Written using Kinesis Data Streams APIs
Amazon Kinesis Data Firehose
- Data ingestion for streaming data
- Receive
- Process (transform - Lambda, compress, encrypt)
- Store stream data to S3, Elasticsearch, Redshift and Splunk
- Use existing analytics tools based on S3, Redshift, and Elasticsearch
- Pay for volume of data ingested (Serverless)
Amazon Kinesis Analytics
- You want to continuously find active number of users on a website in the last 5 minutes based on streaming website data
- With Amazon Kinesis Analytics, you can write SQL queries and build JAVA applications to continuously analyze your streaming data
Amazon Kinesis Video Streams
- Monitor video streams from web cams
- Examples: traffic lights, shopping malls, homes. etc.
- Integrate with machine learning frameworks to get intelligence
DevOps
DevOps
- Getting better at "Three Elements of Great Software Teams"
- Communication - Get teams together
- Feedback - Earlier you find a problem, the easier it is to fix
- Automation - Automate texting, infrastructure provisioning, deployment, and monitoring
DevOps - CI, CD
- Continuous Integration
- Continuously run your tests and packaging
- Continuous Deployment
- Continuously deploy to test environments
- Continuous Delivery
- Continuously deploy to production
DevOps - CI, CD Tools
- AWS CodeCommit - Private source control (GIT)
- AWS CodePipeling - Orchestrate CI/CD Pipelines
- AWS CodeBuild - Build and Test Code (application packages and containers)
- AWS CodeDeploy - Automate Deployment (EC2, ECS, Elastic Beanstalk, EKS, Lambda etc.)
DevOps - IAAC
- Treat infrastructure the same way as application code
- Track your infrastructure changes over time (version control)
- Bring repeatability into your infrastructure
- Two Key Parts
- Infrastructure Provisioning
- Provisioning compute, database, storage, and networking
- Open source cloud neutral - Terraform
- AWS Service - CloudFormation
- Configuration Management
- Install right software and tools on the provisioned resources
- Open Source Tools - Chef, Puppet, Ansible
- AWS Service - OpsWorks
AWS CloudFormation - Introduction
- Let's consider an example:
- I would want to create a new VPC and a subnet
- I want to provision a ELB, ASG with 5 EC2 instances and an RDS database in the subnet
- I would want to setup the right security groups
- AND I would want to create 4 environments
- Dev, QA, Stage, and production!
- Cloud Formation can help you do all these with a simple (or not so simple) script!
AWS CloudFormation - Advantages
- Automate deployment and modification of AWS resources in a controlled, predictable way
- Avoid configuration drift
- Avoid mistakes with manual configuration
- Think of it as version control for your environments
AWS CloudFormation
- All configuration is defined in a simple text file - JSON or YAML
- I want a VPC, a subnet, a database, and ...
- CloudFormation understands dependencies
- Create VPCs first, then subnets, then the database
- (Default) Automatic rollbacks on errors (Easier to retry)
- If creation of the database fails, it would automatic delete the subnet and VPC
- Version control your configuration file and make changes to it over time
- Free to use - Pay only for the resources provisioned
- Get an automated estimate for your configuration
AWS CloudFormation - Terminology
- Template
- A CloudFormation JSON or YAML defining multiple resources
- Stack
- A group of resources that are created from a CloudFormation template
- In the earlier example, the stack contains an EC2 instance and a security group
- Change Sets
- To make changes to stack, update the template
- Change set shows what would change if you execute
- Allows you to verify the changes and then execute
AWS CloudFormation - Important Template Elements
- Resources: What do you want to create
- One or more mandatory elements
- Parameters - values to pass to your template at runtime
- Which EC2 instance to create? - ("t2.micro", "m1.small" , "m1.large")
- Mappings - Key value pairs
- Example: Configure different values for different regions
- Outputs - return values from execution
- See them on console and use in automation
AWS CloudFormation - remember
- Deleting a stack deletes all the associated resources
- EXCEPT for resources with DeletionPolicy attribute set to "Retain"
- You can enable termination protection for the entire stack
- Templates are stored in S3
- Use CloudFormation Designer to visually design templates
- AWS CloudFormation StackSets
- Create update, or delete stacks across multiple accounts and regions with a single operation
CloudFormation vs AWS Elastic Beanstalk
- (Do you know?) You can create an Elastic Beanstalk environment using cloud formation
- Think of Elastic Beanstalk as a pre-packaged Cloud Formation template with a User Interface
- You choose what you want
- (Background) A cloud formation template is created and executed
- The environment is ready
AWS CDK: Define Cloud Infrastructure Using Code
- AWS CDK (Cloud Development Kit): Provision AWS resources with familiar programming languages
- Code as infrastructure: Use TypeScript, Python, Java, or .NET to define resources
- Use CloudFormation: Translates your code into AWS CloudFormation templates for reliable and repeatable deployments
- Streamline Development: Simplify the creation of complex, multi-component AWS applications with modular, reusable components
- Automate Deployment: Integrate with AWS deployment pipelines for CI/CD
AWS OpsWorks - Configuration Management
- OpsWorks is used for Configuration Management
- How do you ensure that 100 servers have the same configuration?
- How can I make a change across 100 servers?
- Managed service based on Chef and Puppet
- One service for deployment and operations in cloud and on-premise environments
- Configuration - Chef recipes or cookbooks, Puppet manifests
- All metrics are sent to Amazon CloudWatch
- (IMPORTANT) All configuration management tools can also do infrastructure provisioning
- However, I would recommend NOT doing that as they are not good at infrastructure provisioning
AWS CloudShell: Command Line at Your Fingertips
- AWS CloudShell: instant Command Line I/F
- Browser-based access: No setup required, use the AWS CLI directly from your browser
- pre-authenticated: Automatically logs in with your console credentials for immediate access to your resources
- Built-in tools: Comes with pre-installed AWS CLI and other useful software to manage your resources
- No extra cost: Available at no additional charge, you pay only for the AWS resources you manage with CloudShell
Management Services in AWS
AWS Organizations
- organization have multiple AWS accounts
- Different business units
- Different environments
- How do you centralize your management (billing, access control, compliance and security) across multiple AWS accounts?
- Welcome AWS organizations!
- Organize accounts into Organizational Units (OU)
- Provides API to automate creation of new accounts
AWS Organizations - Features
- One consolidated bill for all AWS accounts
- Centralized compliance management for AWS Config Rules
- Send AWS CloudTrail data to one S3 bucket (across accounts)
- AWS Firewall Manager to manage firewall rules (across accounts)
- AWS WAF, AWS Shield Advanced protections, and Security Groups
- Use Service control points (SCPs) to define restrictions for actions (across accounts):
- Prevent users from disabling AWS Config or changing its rules
- Require Amazon EC2 instances to use a specific type
- Require MFA to stop an Amazon EC2 instance
- Require a tag upon resource creation
AWS Trusted Advisor
- Recommendations for cost optimization, performance, security and fault tolerance
- Red - Action recommended Yellow - investigate and Green - Good to go
- All AWS customers get 4 checks for free:
- Service limits (usage > 80%)
- Security groups having unrestricted access (0.0.0.0/0)
- Proper use of IAM
- MFA on Root Account
- Business or Enterprise AWS support plan provides over 50 checks
- Disable those you are not interested in
- How much will you save by using Reserved Instances?
- How does your resource utilization look like? Are you right sized?
AWS Trusted Advisor Recommendations
- Cost Optimization
- Highlight unused resources
- Opportunities to reduce your costs
- Security
- settings that can make your AWS solution more secure
- Fault Tolerance
- Increase resiliency of your AWS solution
- Redundancy improvements, over-utilized resources
- Performance
- Improved speed and responsiveness of your AWS solutions
- Service Limits
- Identify if your service usage is more than 80% of service limits
AWS Service Quotas
- AWS account has Region-specific default quotas or limits for each service
- You don't need to remember all of them
- Service Quotas allows you to manage your quotas for over 100 AWS services, from one location
AWS Directory Service
- Provide AWS Access to on-premise users without IAM users
- Managed service deployed across multiple AZs
- Option 1: AWS Directory Service for Microsoft AD
- More than 5000 Users
- Trust relationships needed between AWS and on-premise directory
- Option 2: Simple AD
- Less than 5000 users
- Powered by Samba4 and compatible with Microsoft AD
- Does not support trust relationships with other AD domains
- Option 3: AD Connector
- Use your existing on-premise directory with other AWS cloud services
- Your use existing credentials to access AWS resources
Billing and Cost Management Services / Tools
- AWS Billing and Cost Management - Pay your AWS bill, monitor your usage, and analyze and control your costs
- Costs Explorer - View your AWS cost as a graph. Filter by a Region, AZ, tags, etc. See future cost projection
- AWS Budgets - Create a budget. Create Amazon SNS notifications to alert you when you go over (or projected to go over) budget
- AWS Compute Optimizer - recommends optimal AWS Compute resources to reduce costs (Example: Right-sizing - EC2 instance type and EC2 Auto Scaling group configuration)
- AWS Pricing Calculator (NEW) - Estimate cost of your architecture solution
- AWS Simple Monthly Calculator (OLD) - Estimate charges for AWS services
- Total Cost of Ownership (TCO) Calculator (OLD) - Compare Cost of running applications in AWS vs On Premise
Other Management Services
- AWS Marketplace
- Digital catalog to find, test, buy, and deploy licensed software solutions using flexible pricing options: Bring Your Own License (BYOL), free trial, pay-as-you-go, hourly, monthly, etc.
- Resource Groups
- Group your AWS resources
- Automate Tasks using AWS Systems Manager
- Get group related insights from AWS Config and CloudTrail
- AWS Systems Manager
- Run commands (operational tasks) on Amazon EC2 instances
- Manage your OS patches
- Personal Health Dashboard
- Personalized alerts when AWS is experiencing events that my impact you
- Provides troubleshooting guidance
Serverless Architecture
Rest API Challenges
- Most applications today are built around REST API
- Management of REST API is not easy
- You've to take care of authentication and authorization
- You've to e able to set limits (rate limiting, quotas) for your API consumers
- You've to take care of implementing multiple versions of your API
- You would want to monitor your API calls
- You would want to be able to cache API requests
Amazon API Gateway
- How about a fully managed service with auto scaling that can act as a "font door" to your APIs?
- Welcome "Amazon API Gateway"
- "publish, monitor, and secure APIs at any scale"
- Integrates with AWS Lambda, Amazon EC2, Amazon ECS, or any web application
- Supports HTTP(S) and WebSockets (two way communication - chat apps and streaming dashboards)
- Serverless. Pay for use (API calls and connection duration)
Amazon API Gateway - Remember
- Run multiple versions of the same API
- Rate Limits (request quota limits), throttling and fine-grained access permissions using API Keys for Third-Party Developers
- Implement Authorization with
- AWS IAM
- Amazon Cognito
- Custom Lambda Authorizer
Amazon Cognito
- Want to quickly add a sign up page and authentication for your mobile and web apps?
- Want to integrate with web identity providers (example: Google, Facebook, Amazon) and provide a social sign-in?
- Do you want security features such as multi-factor authentication (MFA), phone and email verification?
- Want to create your own user database without worrying about scaling or operations?
- Let's go: Amazon Cognito
- Support for SAML
Amazon Cognito - User Pools
- Do you want to create your own secure and scalable user directory?
- Do you want to create sign-up pages?
- Do you want a built-in, customizable web UI to sign in users (with option to social sign in)?
- Create a user pool
Amazon Cognito - Identity Pools
- Identity pools provide AWS credentials to grant your users access to other AWS services
- Connect identity pools with authentication (identity) providers
- Your own user pool OR
- Amazon, Apple, Facebook, Twitter, OR
- OpenID Connect Provider OR
- SAML identity Providers (SAML 2.0)
- Configure multiple authentication (identity) providers for each identity pool
- Federated identity
- An external authentication (identity) provider
- ex. Amazon, Apple, Facebook, OpenID, or SAML identity providers
Serverless Application Model
- 1000s of Lambda functions to manage, versioning, deployment, etc.
- Serverless projects can become maintenance headache
- How to test serverless projects with Lambda, API Gateway, and DynamoDB in your local?
- How to ensure that your serverless projects are adhering to best practices?
- Tracing (X-Ray), CI/CD (CodeBuild, CodeDeploy, CodePipeline) etc.
- Welcome SAM - Serverless Application model
- Open source framework for building serverless applications
- Define YAML with all the serverless resources you want:
- Functions, APIs, Databases, etc.
- BEHIND THE SCENES: Your configuration is used to create a AWS CloudFormation syntax to deploy your application
AWS Step Functions
- Create a serverless workflow in 10 minutes using a visual approach
- Orchestrate multiple AWS services into serverless workflows:
- Invoke an AWS Lambda function
- Run an Amazon Elastic Container Service or AWS Fargate task
- Get an existing item from an Amazon DynamoDB table or put a new item into a DynamoDB table
- Publish a message to an Amazon SNS topic
- Send a message to an Amazon SQS queue
- Build workflows as a series of steps
- Output of one step flows as input into next step
- Retry multiple times until it succeeds
- Maximum duration of 1 year
- Integrates with Amazon API Gateway
- Expose API around Step Functions
- Include human approvals into workflows
- (Use case) Long running workflows
- Machine learning model training, report generation, and IT automation
- (Use case) Short duration workflows
- IoT data ingestion, and streaming data processing
- (Benefits) Visual workflows with easy updates and less code
- (Alternative) Amazon Simple Workflow Service
- Complex orchestration code (external signals, launch child processes)
- Step Functions is recommended for all new workflows UNLESS you need to write complex code for orchestration
Containers and Container Optimization
Microservices
- Enterprises are heading towards microservices architectures
- Build small focused microservices
- Flexibility to innovate and build applications in different programming languages (Go, Java, Python, JavaScript, etc.)
- But deployments become complex!
- How can we have one way of deploying Go, Java, Python, or JavaScript ... microservices?
- Enter containers!
Docker
- Create Docker images for each microservice
- Docker image contains everything a microservice needs to run:
- Application Runtime
- Application code
- Dependencies
- You can run these docker contains the same way on any infrastructure
- Your local machine
- Corporate data center
- Cloud
Docker - Advantages
- Docker containers are light weight (compared to virtual Machines)
- Docker provides isolation for containers
- Docker is cloud neutral
- How do you manage 1000's of containers belonging to multiple microservices?
- Enter Container Orchestration
- Requirement: i want 10 instances of Microservice A container, 15 instances of Microservice B container, and ...
- Typical Features:
- Auto Scaling - Scale containers based on demand
- Service Discovery - Help microservices find one another
- Load Balancer - Distribute load among multiple instances of a microservice
- Self Healing - Do health checks and replace failing instances
- Zero Downtime Deployments - Release new versions without downtime
Cloud Orchestration Options
- Cloud Neutral
- Kubernetes
- AWS service - AWS elastic Kubernetes Service (EKS)
- EKS does not have a free tier
- AWS Specific
- AWS Elastic Container Service (ECS)
- AWS Fargate: Serverless version of AWS ECS
- AWS Fargate does not have a free tier
Amazon Elastic Container Service (Amazon ECS)
- Fully managed service for container orchestration
- Serverless option - AWS Fargate
- Use cases:
- Microservices Architectures - Create containers for your microservices and orchestrate them using ECS and Fargate
- Batching Processing. Run batch workloads on EC2 and AWS Batch
Architecture and Best Practices
Well Architected Framework
- Helps cloud architects build application infrastructure which is:
- Secure
- High-performing
- Resilient and
- Efficient
- Five Pillars
- Operational Excellence
- Security
- Reliability
- Performance Efficiency
- Cost Optimization
Operational Excellence
- Avoid/Minimize effort and problems with
- Provisioning servers
- Deployment
- Monitoring
- Support
Operational Excellence - Solutions and AWS Services
- Use managed services
- You don't need to worry about managing servers, availability, durability, etc.
- Go serverless
- Prefer Lambda to EC2
- Automate with Cloud Formation
- Use Infrastructure As Code
- Implement CI/CD to find problems early
- CodePipeline
- CodeBuild
- CodeDeploy
- Perform frequent, small reversible changes
Operational Excellence - Solutions and AWS Services
- Prepare: for failure
- Game days
- Disaster recovery exercises
- Implement standards with AWS Config Rules
- Operate: Gather Data and Metrics
- CloudWatch (Logs agent), Config, Config Rules, CloudTrail, VPX Flow Logs and X-Ray (tracing)
- Evolve: Get intelligence
- Use Amazon Elasticsearch to analyze your logs
Security Pillar
- Principle of least privilege for least time
- Security in Depth - Apply security in all layers
- Protect data in Transit and at rest
- Actively monitor for security issues
- Centralize policies for multiple AWS accounts
Security Pillar - Principle of least privilege for least time
- Use temporary credentials when possible (IAM roles, Instance profiles)
- Use IAM Groups to simplify IAM management
- Enforce strong password practices
- Enforce MFA
- Rotate credentials regularly
Security Pillar - Security in Depth
- VPCs and Private Subnets
- Security groups
- Network Access Control List (NACL)
- Use hardened EC2 AMIs (golden image)
- Automate patches for OS, Software etc
- Use CloudFront with AWS Shield for DDoS mitigation
- Use WAF with CloudFront and ALB
- Protect web applications from CSS, SQL injection, etc.
- Use CloudFormation
- Automate provisioning infrastructure that adheres to security policies
Security Pillar - Protecting Data at Rest
- Enable Versioning (when available)
- Enable encryption - KMS and Cloud HSM
- Rotate encryption keys
- Amazon S3
- SSE-C, SSE-S3, SSE-KMS
- Amazon DynamoDB
- Encryption Client, SSE-KMS
- Amazon Redshift
- Amazon KMS and AWS Cloud HSM
- Amazon EBS, Amazon SQS, and Amazon SNS
- Amazon KMS
- Amazon RDS
- Amazon KMS, TDE
Security Pillar - Protecting Data in Transit
- Data coming in and going out of AWS
- By default, all AWS API use HTTPS/SSL
- You can also choose to perform client side encryption for additional security
- Ensure that your data goes through AWS network as much as possible
- VPX Endpoints and AWS PrivateLink
Security Pillar - Detect Threats
- Actively monitor for security issues
- Monitor CloudWatch Logs
- Use Amazon GuardDuty to detect threats and continuously monitor for malicious behavior
Reliability
- Ability to:
- recover from infrastructure and application issues
- Adapt to changing demands in load
Reliability - Best Practices
- Prefer serverless architectures
- Prefer loosely coupled architectures
- SQS, SNS
- Distributed System Best Pracices
- Use Amazon API Gateway for throttling requests
- AWS SDK provides with exponential backoff
Loosely Coupled Architectures
- ELB
- Woks in tandem with AWS auto scaling
- Amazon SQS
- Polling Mechanism
- Amazon SNS
- Publish subscribe pattern
- Bulk notifications and Mobile push support
- Amazon Kinesis
- Handle event streams
- Multiple clients
- Each client can track their stream position
Performance Efficiency - Best Practices
- Use managed services
- Focus on your business instead of focusing on resource provisioning and management
- Go serverless
- Lower transactional cost and less operational burden
- Experiment
- Cloud makes it easy to experiment
- Monitor Performance
- Trigger CloudWatch alarms and perform actions through Amazon SQS and Lambda
Performance Efficiency - Choose the Right Solution
- Compute
- EC2 instance vs Lambda vs Containers
- Storage
- Block, File, Object
- Database
- RDS vs DynamoDB vs RedShift
- Caching
- ElastiCache vs CloudFront vs DAX vs Read Replicas
- Network
- CloudFront, Global Accelerator, Route 52, Placement Groups, VPC endpoints, Direct Connect
- Use product specific features
- Enhanced Networking, S3 Transfer Acceleration, EBS Optimized Instances
Cost Optimization
- Run systems at lowest cost
Cost Optimization - Best Practices
- Match supply and demand
- Implement Auto Scaling
- Stop Dev/Test resources when you don't need them
- Go Serverless
- Track your expenditure
- Cost Explorer to track and analyze your spend
- AWS Budgets to trigger alerts
- use tags on resources
Cost Optimization - Choose Cost-Effective Solutions
- Right Sizing: Analyze 5 large servers vs 10 small servers
- Use CloudWatch (monitoring) and Trusted Advisor (recommendations) to right size your resource
- Email server vs managed email service (charged per email)
- On-demand vs reserved bs spot instances
- Avoid expensive software: MySQL vs Aurora vs Oracle
- Optimize data transfer costs using AWS Direct Connect and Amazon CloudFront
Digital Transformation
- How consumers make purchase decisions? (Social)
- How we do things? (Mobile)
- How much data we have(Big Data)
- Digital Transformation: Using modern technologies to create (or modify) business processes and customer experiences by innovating with technology and team culture
Shared Responsibility Model
- Security and Compliance is shared between AWS and customer
Shared Responsibility Model
- AWS manages security of the cloud
- AWS operates, manages, and controls components from the Host OS and virtualization layer down to the physical safety
- YOU are responsible for security in the cloud
- Customer assumes responsibility and management of
- Guest operating system (including updates and security patches)
- Application software
- Configuration of Security Group
- Choosing and Integrating AWS Services with their IT environments
More AWS Services
AWS Transit Gateway
- AWS Transit Gateway: connect multiple VPCs with VPN and DC
- Supports Global inter-Region peering
- Traffic between an Amazon VPC and AWS Transit Gateway remains on the AWS global private network
- Create Route Table and associate Amazon VPCs and VPNs
Machine Learning - 3 Approaches
- Use Pre-Trained Models
- Get intelligence from text, images, audio, video
- Amazon Comprehend, Amazon Rekognition...
- Build simple models: Without needing Data scientists
- Limited / no-code experience
- Example: Amazon SageMaker AutoML
- Build complex models: Using data scientists and team
- Build your own ML models from zero (code-experienced)
- Example: Amazon SageMaker
Pre-Trained Models in AWS
- Amazon Comprehend: Analyze unstructured text
- Amazon Textract: Easily extract text and data from virtually any document
- Amazon Rekognition: Search and Analyze Images and Videos
- Amazon Transcribe: Powerful Speech Rekognition
- Amazon Polly: Turn Text into Lifelike Speech
- Amazon Translate: Powerful Neural Machine Translation
- Amazon Personalize: Add real-time recommendations to your apps
- Amazon Fraud Detector: Detect online fraud faster
- Amazon Forecast: Time-series forecasting service
- Amazon Kendra: Intelligent search service (Search from scattered content - multiple locations and content repositories)
- Amazon Lex: Build Voice and Text chatbots
Amazon SageMaker
- Amazon SageMaker: Simplifies creation of your models
- Manage data, code, compute, models, etc.
- Prepare data
- Train models
- Publish models
- Monitor models
- Multiple Options to create models
- AutoML/Autopilot: Build custom models with minumum ML experties
- Build your own models: Data Scientists
- Support for deep learning frameworks such as TensorFlow, Apache MXNet, PyTorch, and more (use them within the built in containers)
- Data and compute
Big Data - Terminology and Evolution
- 3Vs of Big Data
- Volume: Terabytes to Petabytes to Exabytes
- Variety: Structured, Semi structured, unstructured
- Velocity: Batch, Streaming
- Terminology: Data warehouse vs Data lake
- Data warehouse: PBs of Storage + Compute (Typically)
- Data stored in format ready for specific analysis (processed data)
- Examples: terabyte, BigQuery (GCP), Redshift (AWS), Azure Synapse Analytics
- Typically uses specialized hardware
- Data lake: Typically retains all raw data (compressed)
- Typically object storage is used as data lake
- Amazon S3, Google Cloud Storage, Azure Data Lake Storage Gen2 etc..
- Flexibility while saving cost
- Perform ad-hoc analysis on demand
- Analytics & intelligence services (even data warehouses) can directly read from data lake
- Azure Synapse Analytics, BigQuery(GCP), Redshift Spectrum(AWS), Amazon Athena etc..
Big Data and Data warehousing in AWS
Comments
You have to be logged in to add a comment
User Comments
There are currently no comments for this article.