Astronomy Computing Today

A Tale of 160 Scientists, Three Applications, One Workshop and A Cloud

Advertisements

This is the text of a presentation I gave at the 22nd Annual Astronomical Data Analysis Software and Systems Conference, held in Champaign, Illinois (Nov 4-8, 2012).  You may download the slides here: Sagan e-Science 2012 gbb best.

The work was performed in collaboration with Carolyn Brinkworth, Dawn Gelino and Dennis Wittman (NASA Exoplanet Science Institute, Caltech), Ewa Deelman, Gideon Juve and Mats Rynge (Information Sciences Institute, USC), and Jamie Kinney (Amazon.com,Inc).

1. Introduction

The NASA Exoplanet Science Center (NEXScI) hosts the Sagan Workshops, annual themed conferences aimed at introducing the latest techniques in exoplanet astronomy to young researchers. The workshops emphasize interaction with data, and include hands-on sessions where participants use their laptops to follow step-by-step tutorials given by experts.

The 2012 workshop had the theme “Working With Exoplanet Light Curves, “and posed special challenges for the conference organizers because the three applications chosen for the tutorials run on different platforms, and because over 160 persons attended, much the largest attendance to date. One of the applications, PyKE, is a suite of Python tools designed to reduce and analyze Kepler light curves; called from PyRAF  or from the Unix command line. The Transit Analysis Package (TAP) uses Markov Chain Monte Carlo (MCMC) techniques to fit light curves under the Interactive Data Language (IDL) environment, and Transit Timing Variations (TTV) uses IDL tools and Java-based GUIs to confirm and detect exoplanets from timing variations in light curve fitting.

Rather than attempt to run these diverse applications on the inevitable wide range of environments on attendees’ laptops, the conference organizers, in consultation with the Virtual Astronomical Observatory, chose instead to run the applications on the Amazon Elastic Cloud 2 (EC2). This paper summarizes the system architecture, the Amazon resources consumed, and lessons learned and best practices.
2.    The System Architecture
The Sagan Workshop took advantage of the EC2’s capabilities to support Virtual Ma- chines (VMs) that can be customized to meet local needs, then replicated, and then released on completion of the jobs. Fig 1 shows the system architecture developed to support the Sagan Workshop.

Participants logged into one of four servers dedicated to the workshop via a Virtual Network Connection (VNC). The Amazon Elastic Block Storage (EBS) system and the Network File System (NFS) were used to share common datasets and user home directories across all virtual machines. An IDL license server at IPAC used ssh to provide the licenses to the servers. The list following describes the architecture component by component and the rationale for the design choices.

3.    Cost of Using the Amazon EC 2 Cloud
Had the Sagan Workshop’s Amazon EC2 costs not been met by an Educational Grant, the total cost of installation, testing and running the workshop sessions would have been $2,876. The breakdown of the costs is shown in Table 1.

4. Lessons Learned And Best Practices
These may be summarized as follows:

Advertisements

Advertisements