Large-scale distributed systems

Teachers

Summary

Large scale distributed infrastructures leverage the high performance networks to federate computing, data and scientific resources from multiple institutions interconnected through the Internet. Distributed computing technologies have undergone a very fast evolution these last years and the infrastructure deployed have become a critical tool in many scientific disciplines. This lecture describes the foundation of distributed computing infrastructures. It introduces the main computing models exploited in Grids and Clouds to evolve from cluster computing towards more virtualized resources and across-institutional user communities. The main problems encountered when deploying such very large scale infrastructures are discussed: users identification and authorization, security of data and computations, heterogeneity of resources, redundancy and fault tolerance, deployment, management, and computation flow control… The most wide spread technologies and their associated middlewares are reviewed. Several examples illustrate the concepts introduced.

Objectives

  • become familiar with large-scale distributed computing infrastructures
  • learn distributed computing principles and underlying technologies
  • identify distributed computing capabilities and limitations
  • design performing distributed applications
  • be alert to emerging technologies and research trends

Content

Duration: 8 weeks

Week 1: Distributed computing and models

Slides

  • Introduction, definitions
  • Different kinds of grid, PC clusters, Internet grids
  • Parallel and distributed computing
  • Capabilities and limitations - sharing resources
  • Application area examples
  • Virtual Organizations
  • Distributed computing models
  • Application models

Week 2: Remote services

Slides

  • Services and interoperability
  • Web Services
  • OGSI / OGSA
  • WS-RF, WS-*
  • Platform as a service, Cloud computing

Week 3: Grid infrastructures

Slides

  • Research and production infrastructures
  • Middleware development
  • Deployment
  • Operations

Week 4: Workload and performance modeling

Slides

  • Job Submission System
  • Mathematical tools
  • Probabilistic models
  • Exploitation

Week 5: Workflows

Slides

  • Application workflows
  • Representation of data-intensive workflows
  • Parallel enactment

Week 6: Security

Slides

  • Cryptography background
  • Authentication
  • Authorization
  • Access control policies

Week 7: Data management

Slides

  • Distributed data management
  • Distribution, replication
  • P2P

Week 8: Evaluation

2009_2010/si5/datagrid/start.txt · Dernière modification: 2012/02/14 15:35 par montagnat
chimeric.de = chi`s home Creative Commons License Valid CSS Driven by DokuWiki do yourself a favour and use a real browser - get firefox!! Recent changes RSS feed Valid XHTML 1.0