AWS DataSync makes it simple and fast to move large amounts of data online between on-premises storage and Amazon S3, Amazon Elastic File System (Amazon EFS), or Amazon FSx for Windows File Server. Manual tasks related to data transfers can slow down migrations and burden IT operations. DataSync eliminates or automatically handles many of these tasks, including scripting copy jobs, scheduling and monitoring transfers, validating data, and optimizing network utilization. The DataSync software agent connects to your Network File System (NFS) and Server Message Block (SMB) storage, so you don’t have to modify your applications. DataSync can transfer hundreds of terabytes and millions of files at speeds up to 10 times faster than open-source tools, over the internet or AWS Direct Connect links. You can use DataSync to migrate active data sets or archives to AWS, transfer data to the cloud for timely analysis and processing, or replicate data to AWS for business continuity. Getting started with DataSync is easy: deploy the DataSync agent, connect it to your file system, select your AWS storage resources, and start moving data between them. You pay only for the data you move.
Simplify and automate transfers
AWS DataSync makes it easy for you to move data over the network between on-premises storage and AWS. DataSync automates both the management of data transfer processes and the infrastructure required for high-performance, secure data transfer. The service also includes automatic encryption and data. All of this minimizes the in-house development and management otherwise needed for fast, reliable, and secure transfers.
Move data 10x faster
Transfer data rapidly over the network into AWS, up to 10 times faster than is common with open-source tooling. DataSync uses a purpose-built network protocol and a parallel, multi-threaded architecture to accelerate your transfers. This speeds up migrations, recurring data processing workflows for analytics and machine learning, and data protection processes.
Reduce operational costs
You can move data cost-effectively with DataSync’s flat, per-gigabyte pricing. You’ll also save on script development and management costs, and avoid the need for costly commercial transfer tools.
How it works
If you are closing data centers or retiring storage arrays, you can use DataSync to move active data sets or archives rapidly over the network into Amazon S3, Amazon EFS, or Amazon FSx for Windows File Server. DataSync does both full initial copies, and incremental transfers of changing data. It also includes encryption and integrity checking to help make sure your data arrives securely, intact, and ready to use. You can use DataSync to copy active, changing data alongside AWS Snowball Edge for the migration of static data to Amazon S3.
Data processing for hybrid workloads
If you have on-premises systems generating or using data that needs to move into or out of AWS for processing, you can use DataSync to accelerate and schedule the transfers. It can help speed up critical hybrid cloud workflows in industries that need to move active files into AWS quickly, including video production in media and entertainment, seismic research in oil and gas, machine learning in life science, and big data analytics in finance.
Archiving of cold data
If you have large amounts of cold data stored in expensive on-premises storage systems, you can move this data directly to durable and secure long-term storage such as Amazon S3 Glacier or Amazon S3 Glacier Deep Archive. This will allow you to free up on-premises storage capacity and shut down legacy storage systems.
If you have large Network Attached Storage (NAS) systems, you likely have a lot of files to protect—either with replication or backup to a second hardware stack. With DataSync, you can replicate files into all Amazon S3 storage classes, and select the most cost-effective storage class for your needs. Or, you can send the data to Amazon EFS or Amazon FSx for Windows File Server for a standby file system.
“At Celgene, our research teams are focused intently on the discovery and development of treatments for cancer and other severe conditions. AWS is an integral part of our innovation process, and for our IT teams that means using as many AWS services as we can, to eliminate the operational and cost burdens of running infrastructure and tooling that distract us from supporting drug discovery. Our labs generate petabytes of data – irreplaceable intellectual property – and we use AWS DataSync to get the data into Amazon S3 and Amazon EFS easily, quickly and cost-effectively. Without the data in AWS, there’s no way we could innovate as fast. AWS DataSync works with my existing storage systems, and efficiently uses as much bandwidth as we can give it to get our data safely into AWS.”
Lance Smith, Director of Research Computing - Celgene
Learn what makes AWS DataSync fast, secure and easy to use as part of your AWS architecture.
Instantly get access to the AWS Free Tier.
Get started building with AWS DataSync in the AWS Console.