Jump to content

High-throughput computing

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by Friendlydata (talk | contribs) at 16:32, 4 May 2008 (fixed typo (minor)). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

High-throughput computing (HTC) is a computer science term to describe the use many computing resources over long periods of time to accomplish a computational task.

Challenges

The HTC community is also concerned with robustness and reliability of jobs over a long-time scale. That is, being able to create a reliable system from unreliable components. This research is similar to transaction processing, but at a much larger and distributed scale.

Some HTC systems, such as Condor and PBS, can run tasks on opportunistic resources. It is a difficult problem, however, to operate in this environment. On one-hand the system needs to provide a reliable operating environment for user's jobs, but at the same time the system must not compromise the integrity of the execute node and allow the owner to always have full control of their resources.

High-throughput computing vs. high-performance computing

There are many differences between high-throughput computing and high-performance computing (HPC). HPC tasks are characterized as needing large of amounts of computing power for short periods of time, whereas HTC tasks also require large amounts of computing, but for much longer times (months and years, rather than hours and days) [1]. HPC environments are often measured in terms of FLOPS. The HTC community, however, is not concerned about operations per second, but rather operations per month or per year. Therefore, the HTC field is more interested in how many jobs can be completed over a long period of time instead of how fast an individual job can complete.

As a general rule, HPC systems are tightly coupled parallel jobs, and as such they must execute within a particular site with low-latency interconnects. Conversely, HTC systems are independent, sequential jobs that can be individually scheduled on many different computing resources across multiple administrative boundaries. HTC systems achieve this using various grid computing technologies and techniques.

See also

HTC is not only for sequential jobs but also for parallel jobs.

References

  1. ^ Beck, Alan (1997-06-27). "High Throughput Computing: An Interview with Miron Livny". HPCWire. {{cite web}}: Check date values in: |date= (help)