Instructors: D. Gannon (primary) and B. Plale. Includes invited lectures.
Computational science has emerged as the third leg of science, along with theory and experiment. Biology, chemistry, physics, economics, and medicine all have strong computational science research arms. This science often requires substantial collaboration between computer scientists and domain specialists. The domain scientist often has deep domain expertise, while the computer scientist brings expertise in issues like security, threading and concurrency, and distributed computing.
In this seminar we will look at the key elements of contemporary, grid-based technology for computational science. The students will be required to read a variety of papers and book chapters from the literature. There will be 4 Òprogramming projectsÓ that will use many of the current technologies. Students in the course will work in teams, where students are assembled into interdisciplinary teams.
Students will learn the state-of-the-art in research concepts and technologies. They will learn to communicate across domain boundaries to achieve a common end. This course is open to all students with graduate standing. It will benefit the domain investigator, the computation generalist, and the computer scientists specialist. It will appeal to the computer scientist interested in advancing the field of systems computer science, the informatician or the domain specialist interested in computational science applied to their domain.
Prerequisites: Java, C or C# programming experience and knowledge of XML is required for the computer science students. For other students: consent of the instructor. In these cases background in an application domain or other informatics focus is required.
Week 1
1. 8-29-06 Lecture 1. An introduction to e-Science.
2. 8-31-06 Lecture 2. The Open Grid Service Architecture.
Readings.
1. Science Portals. Part1
2. The Anatomy of the Grid: Enabling Scalable Virtual Organizations. I. Foster, C. Kesselman, S. Tuecke. International J. Supercomputer Applications, 15(3), 2001. Download pdf
3. The Physiology of the Grid: An Open Grid Services Architecture for Distributed Systems Integration. I. Foster, C. Kesselman, J. Nick, S. Tuecke, download pdf
Week 2.
More Grid Details and Portal Introduction.
1. 9-05-06. Lecture 3. Grid Security
2. 9-07-06. Lecture 4. Introduction to Portals & Portal Project Description.
Readings. The readings for this week are draft chapters from the portal book. These are drafts, so your comments will help us! Please email detailed comments/suggestions to gannon@cs.indiana.edu.
Week 3.
Portal deployment.
1. 9-11-06. Deploying a Portal
2. 9-13-06. Adding a Portlet.
Readings. Look at the material ÒInstalling a PortletÓ and the Grid certificate notes in the on-course resources.
Week 4.
Project 1 due. Click here for details. Project 2 announced.
1. 9-19-06. Special Guest Lecture: Michael Russell. Grid Sphere Architect.
2. 9-21-06. Intro to xml and the next project.
Week 5.
1. 9-25-06. Introduction to Web Services
2. 9-27-06. Web Services and Amazon S3
Week 6
1. 10-03-06. The Big Picture
and Accessing Remote Data
2. 10-05-06. Writing a Portlet for S3
Week 7
Project 2a due. Project 2b described. You can find the TreePortlet code here.
1. 10-10-06. More on the Portlet
2. 10-12-06. Invoking a Service Through the Portal Part.1
Week 8
Project 3 due.
1. 10-17-06. Invoking a Service Through the Portal
2. 10-19-06. Prof. Plale. Fundamentals of Data Mining. Part 1
Week 9
1. 10-24-06. Prof. Plale. Fundamentals of Data Mining. Part 2
2. 10-26-06. Prof. Plale. Fundamentals of Data Mining. Part 3
Week 10
1. 10-31-06. Prof. Plale. Fundamentals of Data Mining Part 4
2. 11-02-06. Introducing Workflow
Week 11
Project 4 announced
1. 11-07-06. Building application services. Slides are here.
2. 11-09-06. More on application services in workflows.
Week 12
1. 11-14-06. Pub-Sub Notification Systems
2. 11-16-06. WS-Eventing tools
Week 13
Project 4 due. Project 5 announced
1. 11-21-06. Final Project and Large-Scale e-science
2. 11-23-06. Thankgiving
Week 14
1. 11-28-06. XBaya
2. 11-30-06. Final Project Notes
Week 15
3. 12-05-06. Provenance
4. 12-07-06. Final Exam
5 weeks
Basic Grid concepts:
Security and virtualization of computation, service oriented architectures
publish/subscribe notification.
Basic Portal concepts: security, portal context, portlets
Access to remote data.
Data search and access to personal metadata catalogs.
Reading: several chapters from the Òportal bookÓ.
Project 1. Customizing a simple portal.
Part 2. Scientific Workflow.
4 weeks
Basic Concepts of Workflow Management for Scientific Applications.
Applications as services
Composing services
Reading: several chapters from the Òworkflow bookÓ.
Project 2. Using Workflow Tools (BPEL/XBAYA and Kepler and Taverna)
2 weeks.
Basic Concepts.
Composing Data Mining Applications.
Reading: TBD
Project 3. A data mining workflow.
3 weeks.
Data virtualization, metadata models.
Data Streams and data stream filters.
Reading: TBD
Project 4. Calder
2 weeks
non-browser-based problem solving environments.
Application component containers.
A look at a domain specific problem solving environment.