Using Globus with Amazon S3 storage

Summary

How to use Globus to transfer data to/from Amazon S3 storage

Body

Quest and Kellogg Linux Cluster Downtime, December 14 - 18.

Quest, including the Quest Analytics Nodes, the Genomics Compute Cluster (GCC), the Kellogg Linux Cluster (KLC), and Quest OnDemand, will be unavailable for scheduled maintenance starting at 8 A.M. on Saturday, December 14, and ending approximately at 5 P.M. on Wednesday, December 18. During the maintenance window, you will not be able to login to Quest, Quest Analytics Nodes, the GCC, KLC, or Quest OnDemand submit new jobs, run jobs, or access files stored on Quest in any way including Globus. For details on this maintenance, please see the Status of University IT Services page.

Quest RHEL8 Pilot Environment - November 18.

Starting November 18, all Quest users are invited to test and run their workflows in a RHEL8 pilot environment to prepare for Quest moving completely to RHEL8 in March 2025. We invite researchers to provide us with feedback during the pilot by contacting the Research Computing and Data Services team at quest-help@northwestern.edu. The pilot environment will consist of 24 H100 GPU nodes and seventy-two CPU nodes, and it will expand with additional nodes through March 2025. Details on how to access this pilot environment will be published in a KB article on November 18.

The Globus data transfer service can be used to transfer data to and from Amazon's S3 cloud storage service. For more information on why and how to use Amazon S3 storage, see Using Amazon S3 Storage

Setup

To use Globus with Amazon S3, you need to: create a bucket (if you don't have one already), create an IAM access key with permission to write to the bucket, and install the access key on the endpoint.

  1. Create an Amazon S3 bucket
  2. Create an AWS IAM access key that has permission to read from and write to the bucket
  3. Give Globus access to the bucket via the IAM access key you created (see below).

Authenticate in the Globus interface

Once you have an AWS Access Key ID, you can use these credentials to authenticate in the Globus interface. 

  1. First, log into Globus File Manager using your Northwestern NetID
  2. Then search for the Northwestern AWS endpoints. 

    Northwestern has 3 endpoints to transfer data to Amazon Web Service S3 storage.

    These endpoints are now region-agnostic and can be used to transfer data stored in any AWS region. 

  • Northwestern AWS us-east-1 N. Virginia
  • Northwestern AWS us-east-2 Ohio
  • Northwestern AWS us-west-2 Oregon
  1. Once you have selected an endpoint, follow the instructions in Globus's How to Access Your Files on AWS S3 with Globus documentation.

Need help?

Email globus-help@northwestern.edu for additional support in setting up Globus with Amazon S3.

Also see our Research Data Management Guide for links to all of our help articles.

Research Data Management Support at Northwestern University

Research computing data services partners with the Office for Research, University Libraries and Galter Health Sciences Library to provide research data management support throughout the research process. Please see the Research Data Management and Sharing page for more information

Details

Details

Article ID: 1968
Created
Thu 10/6/22 3:33 PM
Modified
Fri 10/25/24 3:22 PM

Related Articles

Related Articles (7)

An outline of advanced features available for Globus
How to initiate file transfers using the Globus data transfer tool
Overview of how to use Globus with links to how to articles.

Related Services / Offerings

Related Services / Offerings (2)

Northwestern faculty, staff, and researchers can request a new public cloud account.
Northwestern has agreements with Amazon Web Services, Google, and Microsoft Azure, to provide discounted cloud hosting options for Northwestern faculty and staff.