Maybe you use s3 daily and you knew the facts, maybe you are forgotten. So my idea is to give you a quick knowledge recap, or new knowledge..
1. S3 Etags are used to keep track of the objects
Note: Etags are useful if you want to programmatically detect content changes to S3 objects
An entity tag (etag) is a response header that represents a resource that has changed without the need to download. Etags are part of the HTTP protocol. It reflects changes only to the contents of an object, not its metadata. Etag represents a specific version of an object. Etags allows clients to quickly verify the integrity of downloaded or uploaded data.
2. S3 Object Lock
S3 Object Lock provides WORM (Write Once Read Many) functionality. This means once an object is locked, it cannot be deleted or overwritten for a fixed amount of time or indefinitely.
This is useful for:
- Compliance requirements
- Regulatory requirements
- Legal hold requirements
3. S3 Lifecycle Policies
S3 Lifecycle policies can automatically transition objects between storage classes or delete them based on rules you define. This can significantly reduce storage costs.
Common transitions:
- Standard → Standard-IA (Infrequent Access) after 30 days
- Standard-IA → Glacier after 90 days
- Glacier → Deep Archive after 180 days
4. S3 Transfer Acceleration
S3 Transfer Acceleration uses CloudFront’s edge locations to accelerate uploads to S3. This can speed up transfers by routing data through the AWS edge network.
5. S3 Versioning
When versioning is enabled, S3 stores all versions of an object, including when you delete it. This provides protection against accidental deletion or overwrites.
6. S3 Requester Pays
With Requester Pays buckets, the requester pays for both the request and the data transfer costs instead of the bucket owner. This is useful when you want to share large datasets.
7. S3 Event Notifications
S3 can send notifications when certain events occur, such as:
- Object creation
- Object deletion
- Object restoration from Glacier
These notifications can trigger Lambda functions, SQS queues, or SNS topics.
8. S3 Storage Classes
S3 offers multiple storage classes optimized for different use cases:
- Standard: General purpose, frequently accessed data
- Standard-IA: Infrequently accessed data
- One Zone-IA: Infrequently accessed, non-critical data
- Glacier Instant Retrieval: Archive data with millisecond retrieval
- Glacier Flexible Retrieval: Archive data with retrieval options (1-5 minutes, 5-12 hours, or 12-48 hours)
- Glacier Deep Archive: Lowest cost, 12-hour retrieval time
- Intelligent-Tiering: Automatically moves data to the most cost-effective access tier
9. S3 Cross-Region Replication (CRR)
CRR automatically replicates objects from one S3 bucket to another in a different AWS region. This provides:
- Compliance requirements
- Lower latency access
- Operational reasons
10. S3 Server Access Logging
S3 can log all requests made to a bucket. These logs can be stored in another S3 bucket and analyzed for:
- Security auditing
- Access patterns
- Troubleshooting
This article is 100% written by human. Perplexity AI is used only for research. Grammar corrected by Grammarly.