What did I learn about AWS S3? AWS S3 Essentials for Success
As we know, companies are going towards the cloud for business purposes. One problem every company was facing is the storage infrastructure problem. Companies have to manage infrastructure to store data. Managing on-premise infrastructure for data storage is a costly way.
What is AWS S3?
So, AWS solved this problem by introducing AWS S3 in 2006. AWS S3 (Simple Storage Service) is a data storage service on the cloud where we can store any type of data on it. It stores the data as objects.
Concept:
AWS S3 is like a folder. Just like we create a folder in our local machine. The same is the case with AWS S3. We create a parent folder, then we create sub-folders in it to store files according to our needs. We can store 5TB of data in one bucket. We can create 100 buckets in one region of AWS.
We can access the bucket in different ways. Such that...
- Using ARN
- Programmatically
- AWS Console
AWS gives you a special URL which is known as ARN to access the bucket. You can also write code in your desired language to access AWS S3 buckets. You can also access the buckets using AWS Console.
AWS S3 Storage Classes:
There are different types of storage classes with different pricing. This storage varies on the demand for data. If you are frequently accessing the data, AWS will put your bucket in the standard class. If your data is not being accessed frequently, AWS will push it to the Intelligent class. But AWS will charge less in this class. In the same way, there are Infrequent Class and Glacier Class. Where data availability is lessened and latency is increased. But the price will also be decreased.
We can manage this class by setting lifecycle rules for the bucket. Let’s say, if you are not accessing data for 30 days, you can set the rule to push it to the next class. Similarly, if you are not accessing the data in 60 days, you can set the rule to push the data to the next class. You can handle it according to your own need.
AWS S3 Security:
AWS gives you the surety of 99.99999999999% of data security in AWS S3. You don't have to worry about data leakage when you are storing your data in AWS S3. S3 is built on the top-level infrastructure of the AWS cloud. So, there is very less chance of data leakage.
AWS Is More Than Storage: How?
AWS S3 is not just a storage service in the cloud. We can do many other things using this service also. We can understand this with examples.
Let’s say, we have built a Machine Learning application on the cloud that will take images from the user and process the images. After processing, it will inform the user to download the image. We can store the image uploaded by the user in AWS S3. Then, we can set the event notifications. We will create a Lambda function that will invoke whenever an object is uploaded on the S3. This lambda function will trigger and process the image and then send it back to the S3 bucket. AWS AppSync will inform the user to download the image after being processed.
Another example is, we have a situation where we don't have to store our data in the database. We can store the data in AWS S3 and do analysis. We will create a Glur crawler that will crawl our bucket and then we can use AWS Athena to do analysis of data.
Another example is, we are trying to do some real-time analysis of stock market data. We can extract data from the data source and by using AWS Kinesis Firehose, we can push the data to AWS S3. From S3, we can load the data into the desired visualization tool.