<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Bench Press &#187; GPU</title>
	<atom:link href="http://blog.benchside.com/tag/gpu/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.benchside.com</link>
	<description>The Crossroads of Science and Tech</description>
	<lastBuildDate>Thu, 26 Jan 2012 15:00:36 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>What to Do as Science Gets Older and More Crowded</title>
		<link>http://blog.benchside.com/2010/06/what-to-do-as-science-gets-older-and-more-crowded/</link>
		<comments>http://blog.benchside.com/2010/06/what-to-do-as-science-gets-older-and-more-crowded/#comments</comments>
		<pubDate>Wed, 30 Jun 2010 06:59:45 +0000</pubDate>
		<dc:creator>ben</dc:creator>
				<category><![CDATA[Science and the Public]]></category>
		<category><![CDATA[attribution]]></category>
		<category><![CDATA[Benjamin Jones]]></category>
		<category><![CDATA[ChemBioConnect]]></category>
		<category><![CDATA[crowdsourcing]]></category>
		<category><![CDATA[distributed computing]]></category>
		<category><![CDATA[fold.it]]></category>
		<category><![CDATA[Folding@Home]]></category>
		<category><![CDATA[Friendfeed]]></category>
		<category><![CDATA[Google Wave]]></category>
		<category><![CDATA[GPGPU]]></category>
		<category><![CDATA[GPU]]></category>
		<category><![CDATA[Lone Ranger]]></category>
		<category><![CDATA[NBER]]></category>
		<category><![CDATA[older]]></category>
		<category><![CDATA[ORCID]]></category>
		<category><![CDATA[public repository]]></category>
		<category><![CDATA[teams]]></category>
		<category><![CDATA[wikis]]></category>

		<guid isPermaLink="false">http://blog.benchside.com/?p=1403</guid>
		<description><![CDATA[A recent NBER paper (gated) by Benjamin Jones from Northwestern conducts a systematic review of trends in scientific research and made a couple of conclusions that won’t come as a surprise to anyone in science (HT: Inside Higher Ed): As science advances and knowledge accumulates, ensuing generations of innovators spend longer in training and become [...]]]></description>
			<content:encoded><![CDATA[<p>A recent <a href="http://papers.nber.org/papers/w16002">NBER paper</a> (gated) by <a href="http://www.kellogg.northwestern.edu/faculty/bio/jones_b.htm">Benjamin Jones from Northwestern</a> conducts a systematic review of trends in scientific research and made a couple of conclusions that won’t come as a surprise to anyone in science (HT: <a href="http://www.insidehighered.com/news/2010/05/25/science">Inside Higher Ed</a>):</p>
<blockquote><p>As science advances and knowledge accumulates, ensuing generations of innovators spend longer in training and become more narrowly expert, shifting key innovations (i) later in the life cycle and (ii) from solo researchers toward teams</p></blockquote>
<p>As evidence to this, the average age at which a scientist made a discovery which later qualified for a Nobel prize has increased by 6 years over the course of the 20th century. When looking at publications, the researchers found that the average author list on a publication grew, on average, by 15-20% per decade!</p>
<p>We’ve discussed before the <a href="http://blog.benchside.com/2009/10/the-lone-ranger/">“decline of the Lone Ranger model of science”</a>, but Jones’ paper focuses on looking at the policy implications for such a change. He concludes that the government (and, probably, the academic and private institutions which support researchers) need to adapt policy to reflect this new reality by:</p>
<ul>
<li>Tailoring funding and messaging to help keep young researchers interested despite the longer and more difficult training period</li>
<li>Finding new ways to evaluate the worthiness of proposals as scientist’s expertise becomes more and more specialized</li>
<li>Altering incentive structures as the team of collaborators replaces the Lone Ranger scientist model of discovery</li>
</ul>
<p>These policy suggestions are definitely good ones, and are certainly necessary to adapt to a new scientific environment, but one dimension of this which Jones doesn’t discuss as much are the technological (the focus of this blog!) innovations which can help further research in this brave new world.</p>
<ul>
<li><strong>Improving science communication with the public</strong>. We’ve made <a href="http://blog.benchside.com/2008/11/reaching-out/">multiple</a> <a href="http://blog.benchside.com/2010/05/imax-eye-candy/">mentions</a> of this in the past, but they are no less true here. Active public communications management not only helps secure funding and raise public awareness of the good scientists can do, but it also helps attract the interest of future generations of researchers and policymakers.</li>
<li><strong>Embracing new collaboration tools</strong>. To really kick-start collaboration between scientists across geographies and specialties, we need tools that go beyond just email and fax machines. Tools like <a href="http://wave.google.com/">Google Wave</a>, wikis, distributed version control, and social media forums like <a href="http://www.friendfeed.com/">Friendfeed</a> are an early taste of the sort of live collaboration that new web technology can bring about.</li>
<li><strong>Leveraging cloud computing and heterogeneous compute</strong>. One of the reasons discoveries are taking longer and are more expensive is that there is so much more data to collect and to analyze than before. One technological innovation which we’ve talked about at lengths here is <a href="http://blog.benchside.com/tag/gpu/">the ability of graphics cards/GPUs to make supercomputer-level processing power more readily accessible</a> to research labs. Another is <a href="http://blog.benchside.com/tag/cloud-computing/">the use of new cloud computing services</a> like Amazon’s to rapidly increase the computational resources that a lab/company has access to. Neither are panaceas for all the data analysis issues which scientists face, but they are definitely ways to make things easier for research groups who have stringent IT budgets.</li>
<li><strong>Using crowdsourcing to speed innovation</strong>.<strong> </strong>Who says research has to take longer and be more expensive? Perhaps its time to pull on new technological levers which let scientists borrow on the resources and brains of a wider group of people. While new platforms like <a href="http://blog.benchside.com/2009/08/chembiodrawcrowdsource/">ChemBioConnect</a>, <a href="http://blog.benchside.com/2008/12/distribute-compute/">distributed computing systems</a> like <a href="http://folding.stanford.edu/">Folding@Home</a>, and <a href="http://blog.benchside.com/2009/02/playing-the-crowd/">volunteer crowdsourcing initiatives</a> like <a href="http://fold.it/portal/">Fold.IT</a> are far from perfect, they hint at a future where researchers can call on resources beyond what their personal computers and brains are capable of.</li>
<li><strong>Building new research attribution models</strong>. When I say new attribution models, I’m referring to two things. The first is embodied by new standards like <a href="http://www.orcid.org/homepage">ORCID</a> which make it easier to understand which person is the author/researcher in question (something which will become more and more important as more people with the same initials/names enter the sciences). The second, and more substantive, is finding new ways to understand who contributed what to a particular study. In today’s digital age, I find it laughable that we still rely on simple author list order to determine the relative roles and positions of the researchers listed on a publication. Employing metadata and other graphical cues can help scientists achieve the recognition they deserve, as well as provide appropriate incentives for teams of researchers to contribute.</li>
<li><strong>Contributing negative and after-publication results to open repository</strong>. While I can understand the hesitation for most research groups to pursue a pure open access strategy, those concerns should not hold with negative or post-publication experimental data. While opening up access to data from failed/negative experiments does little to hurt a lab’s ability to publish first, it can be a dramatic boon for other research groups (especially new labs or labs with interdisciplinary focuses) who can not only use the data for their own analyses and experimental designs, but avoid committing resources to experiments which have already been conducted. If it can work <a href="http://blog.benchside.com/2010/01/why-biopharma-should-open-up/">for biotechs and pharma companies</a>, then there’s no reason it should be any different for non-commercial groups.</li>
</ul>
<p>These suggestions only scratch the surface of what new technologies and policies can do to help scientists in a world where scientific training takes longer and where scientific discoveries need to be more collaborative. If anyone else has any other suggestions, feel free to leave them in the comments!</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.benchside.com/2010/06/what-to-do-as-science-gets-older-and-more-crowded/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>GPU meets spacedust</title>
		<link>http://blog.benchside.com/2009/12/gpu-meets-spacedust/</link>
		<comments>http://blog.benchside.com/2009/12/gpu-meets-spacedust/#comments</comments>
		<pubDate>Wed, 02 Dec 2009 14:00:28 +0000</pubDate>
		<dc:creator>ben</dc:creator>
				<category><![CDATA[technology]]></category>
		<category><![CDATA[CUDA]]></category>
		<category><![CDATA[dust]]></category>
		<category><![CDATA[GPGPU]]></category>
		<category><![CDATA[GPU]]></category>
		<category><![CDATA[radiative transfer]]></category>
		<category><![CDATA[spacedust]]></category>

		<guid isPermaLink="false">http://blog.benchside.com/?p=1151</guid>
		<description><![CDATA[Dust can be a pain if you’re an astronomer. In the same way that clouds obscure a view of the night-sky, interstellar dust can distort an astronomer’s view (even if through the Hubble telescope) of interesting astronomical phenomena. This problem is compounded when you consider that stars and planets tend to form in dense interstellar [...]]]></description>
			<content:encoded><![CDATA[<p>Dust can be a pain if you’re an astronomer. In the same way that clouds obscure a view of the night-sky, interstellar dust can distort an astronomer’s view (even if through the Hubble telescope) of interesting astronomical phenomena. This problem is compounded when you consider that stars and planets tend to form in dense interstellar dust clouds.</p>
<p>The distortions caused by spacedust are caused by <a href="http://en.wikipedia.org/wiki/Radiative_transfer">radiative transfer</a> – a process of light absorption and scattering which also explains why the sky is blue and why sunsets/sunrises look red. Astronomers have built highly sophisticated models to understand radiative transfer across a wide range of different dust backgrounds. These models have enabled researchers to build very cool simulations, such as this one of two galaxies colliding:</p>
<div id="scid:5737277B-5D6D-4f48-ABFC-DD9C333F4C5D:ab2d8712-13a1-4b95-80ef-0418ebfdebf3" class="wlWriterEditableSmartContent" style="margin: 0px auto; padding: 0px; width: 425px; display: block; float: none;">
<div><object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="425" height="355" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="src" value="http://www.youtube.com/v/agqLEbOFT2A&amp;hl=en" /><embed type="application/x-shockwave-flash" width="425" height="355" src="http://www.youtube.com/v/agqLEbOFT2A&amp;hl=en"></embed></object></div>
</div>
<p>Interestingly, greater sophistication in our understanding of spacedust and a greater desire for precision and resolution in the modeling has meant that more and more of the computational processing of these radiative transfer models has been spent on calculating dust grain temperatures rather than the math behind the actual radiation transfer (a product of the fact that you need to calculate across many points in the “dust cloud”, across many types/sizes of dust particles, and because you need to iterate many times to find an equilibrium) – something on the order of 10<sup>11</sup>-10<sup>12</sup> exponentials per simulation!</p>
<p>That traditional processors are not well suited for calculating exponentials and that there were simply so many calculations which needed to be done in parallel <a href="http://arxiv.org/abs/0907.3768">convinced researchers to turn to NVIDIA’s CUDA</a> as a potential solution. As we’ve noted before with <a href="http://blog.benchside.com/2009/09/raytracing-radiotherapy/">using raytracing as a means to accelerate radiotherapy dosage calculations</a>, NVIDIA’s CUDA is a standard programming toolset which lets programmers more easily use the power of (NVIDIA) graphics cards for calculations. Because the calculations needed to do high-performance graphics <a href="http://modernwarfare2.infinityward.com/">for a game of Modern Warfare 2</a> are similar to the calculations that supercomputers crunch through, NVIDIA’s CUDA has been demonstrated to be able to accelerate calculation speed by orders of magnitude!</p>
<p>In the case of dust grain temperature calculation, the results were equally impressive. Not only were the researchers able to accelerate dust grain calculation using a NVIDIA Tesla C1060 (with 4 GB of memory) over an 8-core Intel Xeon E5420 processor (with 32 GB of RAM) <em>alone</em> <strong>by a factor of 55</strong>, they were able to do this despite:</p>
<ul>
<li>the fact that 17% of processing time on the GPU solution was dedicated to data transfer (something the CPU-only solution has to worry about less)</li>
<li>the maximum theoretical capacity of the GPU was only 6 times greater than that of the CPU, highlighting a big difference between the CUDA philosophy (crank up performance) and the CPU compiler philosophy (abstract but flexible)</li>
</ul>
<p>Amazingly, the researchers found that even if the CPU were to run an interpolation scheme (requires less processing power, but introduces a little more error and makes it harder to do more sophisticated calculations vs. the equilibrium calculations done here), the GPU solution <strong>is still faster by a factor of 16 times</strong>!</p>
<p>So: spacedust – 0. GPU – 1. Now let’s see if they can tackle the flexible dust temperature problem…</p>
<p>Paper: “Accelerating Dust Temperature Calculations with Graphics Processing Units”, submitted to <em>New Astronomy</em>; <a href="http://arxiv.org/abs/0907.3768">ArXiV link</a></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.benchside.com/2009/12/gpu-meets-spacedust/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>They’re not just for gaming II</title>
		<link>http://blog.benchside.com/2009/09/they%e2%80%99re-not-just-for-gaming-ii/</link>
		<comments>http://blog.benchside.com/2009/09/they%e2%80%99re-not-just-for-gaming-ii/#comments</comments>
		<pubDate>Thu, 24 Sep 2009 14:00:07 +0000</pubDate>
		<dc:creator>Kevin</dc:creator>
				<category><![CDATA[technology]]></category>
		<category><![CDATA[distributed computing]]></category>
		<category><![CDATA[GPGPU]]></category>
		<category><![CDATA[GPU]]></category>
		<category><![CDATA[graphics card]]></category>
		<category><![CDATA[Scarle]]></category>
		<category><![CDATA[XBox360]]></category>

		<guid isPermaLink="false">http://blog.benchside.com/?p=1021</guid>
		<description><![CDATA[We&#8217;ve talked before about researchers using PlayStation game consoles and gaming graphics cards to perform scientific computing, but we hadn&#8217;t heard too much about Microsoft&#8217;s XBox. Until now, that is, when University of Warwick researcher Dr. Simon Scarle demonstrated the use of the graphical horsepower on an XBox360 in high performance computing. By taking advantage [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignright size-medium wp-image-1022" title="2965__0001" src="http://blog.benchside.com/wp-content/uploads/2009/09/2965__0001-215x300.jpg" alt="2965__0001" width="215" height="300" />We&#8217;ve talked before about researchers using <a href="http://blog.benchside.com/2008/12/cluster-of-ps3s-break-md5-ssl/">PlayStation game consoles</a> and <a href="http://blog.benchside.com/2008/08/theyre-not-just-for-gaming/">gaming graphics cards</a> to perform scientific computing, but we hadn&#8217;t heard too much about Microsoft&#8217;s XBox. Until now, that is, when University of Warwick researcher Dr. Simon Scarle <a href="http://www.physorg.com/news171893988.html">demonstrated the use of the graphical horsepower on an XBox360 in high performance computing</a>. By taking advantage of the parallel processing power of the on-board GPU, <a href="http://www2.warwick.ac.uk/newsandevents/pressreleases/researchers_using_parallel/">Dr. Scarle was able to use an Xbox360 to aid in his research</a> and sidestepped the need to reserve time on a dedicated parallel processing computer or shell out thousands for a parallel network of PC&#8217;s.</p>
<p>Armed with his gaming console, Dr. Scarle used the Xbox&#8217;s GPU computing power to calculate and even predict cardiac arrhythmias based on his model of electric excitations of the heart. The result? A paper titled <em>Implications of the Turing completeness of reaction-diffusion models, informed by GPGPU simulations on an XBox 360: Cardiac arrhythmias, re-entry and the Halting problem</em>.</p>
<blockquote><p>This is a highly effective way of carrying out high end parallel computing on “domestic” hardware for cardiac simulations. Although major reworking of any previous code framework is required, the Xbox 360 is a very easy platform to develop for and this cost can easily be outweighed by the benefits in gained computational power and speed, as well as the relative ease of visualization of the system.</p></blockquote>
<p>So much attention thus far has focused on using the PlayStation 3 in <a href="http://blog.benchside.com/2008/12/distribute-compute/">distributed computing projects like Folding@Home</a> &#8212; maybe its time that Microsoft release some sort of software to let the legions of XBox360 owners out there show the PS3 users that their machines are good for more than just gaming?</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.benchside.com/2009/09/they%e2%80%99re-not-just-for-gaming-ii/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Raytracing Radiotherapy</title>
		<link>http://blog.benchside.com/2009/09/raytracing-radiotherapy/</link>
		<comments>http://blog.benchside.com/2009/09/raytracing-radiotherapy/#comments</comments>
		<pubDate>Mon, 14 Sep 2009 14:00:00 +0000</pubDate>
		<dc:creator>ben</dc:creator>
				<category><![CDATA[technology]]></category>
		<category><![CDATA[CUDA]]></category>
		<category><![CDATA[dose]]></category>
		<category><![CDATA[GPU]]></category>
		<category><![CDATA[radiotherapy]]></category>
		<category><![CDATA[ray tracing]]></category>

		<guid isPermaLink="false">http://blog.benchside.com/2009/09/raytracing-radiotherapy/</guid>
		<description><![CDATA[An impressive demonstration of the power of graphics cards is the use of graphics processing units (GPUs) in ray tracing. For those of you not in the know, ray tracing refers to a technique for rendering graphics by tracking how rays of light behave as they reflect from surface to surface, allowing you to create [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://blog.benchside.com/wp-content/uploads/2009/09/image1.png"><img style="margin: 0px 10px 0px 0px; display: inline" title="image" alt="image" align="left" src="http://blog.benchside.com/wp-content/uploads/2009/09/image_thumb1.png" width="360" height="255" /></a> An impressive demonstration of the <a href="http://blog.benchside.com/2008/08/theyre-not-just-for-gaming/">power of graphics cards</a> is the <a href="http://hothardware.com/News/NVIDIA-Shows-Interactive-Ray-Tracing-on-GPUs/">use of graphics processing units (GPUs) in ray tracing</a>. For those of you not in the know, <a href="http://en.wikipedia.org/wiki/Ray_tracing_%28graphics%29">ray tracing</a> refers to a technique for rendering graphics by tracking how rays of light behave as they reflect from surface to surface, allowing you to create photorealistic images that have complicated reflections and shadows which traditional graphics methods fail to deliver. The flip side of this photorealism is that these techniques are incredibly taxing on computational systems, and it has been something of a “holy grail” for GPU and CPU makers to demonstrate so-called “real time” ray tracing on their systems.</p>
<p>Of course, while ray tracing is an impressive computational feat, most of these demonstrations only show off the aesthetic benefits of being able to implement ray tracing quickly. There are much more real-world impacts in the scientific and medical domains, such as in the field of radiotherapy.</p>
<p><a href="http://blog.benchside.com/wp-content/uploads/2009/09/image2.png"><img style="margin: 0px 0px 0px 5px; display: inline" title="image" alt="image" align="right" src="http://blog.benchside.com/wp-content/uploads/2009/09/image_thumb2.png" width="240" height="215" /></a> In a nutshell, the idea behind radiotherapy as a cancer therapy is that you use strong bursts of radiation to kill off a tumor while minimizing the side effects of radiation exposure to the surrounding tissue. This balance is extremely difficult to manage as the calculations necessary to understand the effect of applying radiation from an external source on a complex three-dimensional maze of organs and liquids like the human body are highly sophisticated. Interestingly, these calculations actually resemble a ray tracing problem, as the problem of understanding radiotherapy dosage is one of understanding how individual “beams” of radiation travel and interact with the human body.</p>
<p>The result is that computer models which have been used to do dosage calculations are <u>slow</u> (making it impractical for physicians to consider multiple regimens or use more sophisticated “adaptive”/modulated radiation therapies), <u>error-prone</u> from the introduction of assumptions to “gloss over” some of the more sophisticated calculations, and <u>very expensive</u> given the need for large clusters of computational power.</p>
<p>What <a href="http://www.amc.nl/?pid=1076">researchers at the University of Amsterdam</a> have demonstrated is an implementation of a ray tracing algorithm targeted at the radiotherapy dosage question using <a href="http://www.nvidia.com/object/cuda_home.html">GPU maker NVIDIA’s CUDA toolkit</a> for performing mathematical calculations using the power of a graphics processor. The researchers used the fact that the power of a GPU rests in its ability to split up complicated math problems into many simpler problems to have the GPU calculate the paths of multiple “rays” of radiation simultaneously, resulting in a performance increase over a non-GPU accelerated technique <strong>ranging from 50% faster to 6 times faster</strong>! Amazingly, because of the way the GPU does its calculations (mainly that it avoids using a look-up table the CPU-driven algorithm needs), <strong>the GPU’s results are also more accurate</strong>, despite a single GPU implementation being <strong>both faster and cheaper than traditional techniques</strong>. </p>
<p>The implications to medicine? To quote the paper:</p>
<blockquote><p>“The developed GPU algorithm now enables <strong>dose calculations at a speed that will be experienced as real time</strong> for conventional forward planning based on clinically relevant datasets. this can lead to a major reduction in the workload of radiotherapy treatment planning. Moreover, the presented GPU algorithm can be used to accelerate more advanced treatment planning optimization techniques.”</p>
</blockquote>
<p><strong>Paper</strong>: M. de Greef et al, “Accelerated Ray Tracing for Radiotherapy Dose Calculations on a GPU.” <em>Medical Physics</em>, Vol 36, Issue 9 (link: <a title="http://dx.doi.org/10.1118/1.3190156" href="http://dx.doi.org/10.1118/1.3190156">http://dx.doi.org/10.1118/1.3190156</a>)</p>
<p><strong>Presentation</strong>: <a title="http://www.amc.nl/upload/teksten/radiotherapie/hyperthermie/RayForDose-NVIDIA.pdf" href="http://www.amc.nl/upload/teksten/radiotherapie/hyperthermie/RayForDose-NVIDIA.pdf">http://www.amc.nl/upload/teksten/radiotherapie/hyperthermie/RayForDose-NVIDIA.pdf</a></p>
<p>(<a href="http://www.codinghorror.com/blog/images/ray-tracing-diagram.png">Image credit – Ray tracing schema</a>) (<a href="http://www.amc.nl/upload/teksten/radiotherapie/hyperthermie/RayForDose-NVIDIA.pdf">Image from presentation</a>)</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.benchside.com/2009/09/raytracing-radiotherapy/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Distribute compute</title>
		<link>http://blog.benchside.com/2008/12/distribute-compute/</link>
		<comments>http://blog.benchside.com/2008/12/distribute-compute/#comments</comments>
		<pubDate>Wed, 17 Dec 2008 07:34:34 +0000</pubDate>
		<dc:creator>ben</dc:creator>
				<category><![CDATA[Science and the Internet]]></category>
		<category><![CDATA[technology]]></category>
		<category><![CDATA[BOINC]]></category>
		<category><![CDATA[distributed computing]]></category>
		<category><![CDATA[Folding@Home]]></category>
		<category><![CDATA[GPGPU]]></category>
		<category><![CDATA[GPU]]></category>
		<category><![CDATA[graphics card]]></category>
		<category><![CDATA[petaFLOPS]]></category>
		<category><![CDATA[Playstation]]></category>
		<category><![CDATA[protein folding]]></category>
		<category><![CDATA[SETI@Home]]></category>
		<category><![CDATA[Supercomputer]]></category>

		<guid isPermaLink="false">http://blog.benchside.com/2008/12/distribute-compute/</guid>
		<description><![CDATA[As the problems scientists solve become more and more complex, so do their demands for computational power. One approach to addressing this has been to build faster, more powerful computers, potentially with chips better suited to performing advanced calculations (like graphics cards or IBM&#8217;s Cell processor). But, this approach has serious limitations &#8212; mainly that [...]]]></description>
			<content:encoded><![CDATA[<p>As the problems scientists solve become more and more complex, so do their demands for computational power. One approach to addressing this has been to build faster, more powerful computers, potentially with <a href="http://blog.benchside.com/2008/08/theyre-not-just-for-gaming/">chips better suited to performing advanced calculations</a> (like graphics cards or IBM&#8217;s Cell processor). But, this approach has serious limitations &#8212; mainly that it&#8217;s expensive to build and to maintain these supercomputers.</p>
<p>Some researchers, however, have turned to a radically different approach. Instead of building a bigger, better mousetrap to deal with more mice, the <strong>distributed computing</strong> approach takes the approach of placing many small, cheap mousetraps. The result is cheap &#8220;supercomputers&#8221; which are able to “pool” the computing power of many computers connected over a network.</p>
<p>This approach has been used by projects like <a href="http://folding.stanford.edu/">Folding@Home</a> and <a href="http://setiathome.berkeley.edu/">SETI@Home</a> which are able to combine computing power from <strong><em>volunteers over the internet</em></strong> to do the number-crunching needed to simulate protein folding or scan deep space for extraterrestrial life. SETI@Home was the first such large-scale distributed computing platform. This platform, now the <a href="http://boinc.berkeley.edu/">Berkeley Open Infrastructure for Network Computing (BOINC)</a>, is today used for many other distributed computing projects such as attempts to <a href="http://einstein.phys.uwm.edu/">search for gravitational waves</a>, <a href="http://www.climateprediction.net/">do climate modeling</a>, and <a href="http://lhcathome.cern.ch/">simulate particle collisions in the Large Hadron Collider</a>.</p>
<p><a href="http://blog.benchside.com/wp-content/uploads/2008/12/image.png"><img style="display: block; float: none; margin-left: auto; margin-right: auto;" title="image" src="http://blog.benchside.com/wp-content/uploads/2008/12/image-thumb.png" alt="image" width="500" height="491" /></a></p>
<p><a href="mailto:Folding@Home">Folding@Home</a>, a project started by the <a href="http://folding.stanford.edu/Pande/Main">Pande group at Stanford</a> to use distributed computing to study protein folding uses a similar approach, albeit with different underlying software (is it any wonder that a <strong>Stanford</strong> group doesn’t use <strong>Berkeley’s</strong> distributed computing platform?! <img src='http://blog.benchside.com/wp-includes/images/smilies/icon_biggrin.gif' alt=':-D' class='wp-smiley' /> ) . It has probably been the most successful distributed computing approach to date, and, as a testament to the power of distributed computing, has become known as the first computing system to break the petaFLOPS barrier – e.g. capable of one quadrillion floating point calculations per second! This has enabled the team to do protein-folding simulations on a scale of ~10 micro-seconds.</p>
<p>But, as impressive as the science achieved by distributed computing projects is, what impresses me the most is that projects like Folding@Home and SETI@Home have defined some brilliant new ways to do science:</p>
<ul>
<li><strong>Use the internet</strong> – It’s <a href="http://blog.benchside.com/category/science-and-the-internet/">a common theme on Bench Press</a>, but with more and more people having faster and faster access to the internet, the potential for distributed computing becomes greater and greater. As Folding@Home demonstrated, such approaches can produce computing systems as powerful (or potentially more powerful) as leading supercomputer systems at a fraction of the cost.</li>
<li><strong>Mobilize the public</strong> – We’ve discussed <a href="http://blog.benchside.com/2008/11/reaching-out/">ways for the scientific community to reach out to the public</a> like using social media and creating interactive applications/tools for the public to use, but efforts like Folding@Home illustrate a way to not only reach out to the public but to get them vested in science. In a world where high school science teachers find it difficult to get teens interested in science, initiatives like Folding@Home have <a href="http://fah-web.stanford.edu/cgi-bin/main.py?qtype=teamstats">created a system where teams of individuals compete on who can contribute the most to the effort</a>! Instead of simply hoping that the public will continue to fund and listen, why not borrow a page from the many existing cancer-walk-a-thons and make it easy for the public to get involved?</li>
<li><strong>Leverage new technology – </strong>It <a href="http://blog.benchside.com/2008/08/theyre-not-just-for-gaming/">may not come as a surprise to our readers</a> that a significant amount of the computational power at Folding@Home <a href="http://folding.stanford.edu/English/FAQ-highperformance">comes from graphics cards and Playstation 3’s</a>. But, while many “mainstream” supercomputers ignored the new power afforded by these new chip types, Folding@Home developed software so that volunteers could quickly and easily use these powerful chips to boost their Folding@Home scores. The Folding@Home initiative also developed software to <a href="http://folding.stanford.edu/English/FAQ-gromacs">take advantage of innovations AMD and Intel included in their chips</a> (new multi-core architectures and special instructions to speed up calculations). Is it any wonder, then, that <a href="http://www.scei.co.jp/folding/en/">Sony</a>, <a href="http://www.tgdaily.com/content/view/37931/113/">NVIDIA</a>, and <a href="http://ati.amd.com/technology/streamcomputing/folding.html">AMD</a> have all publically announced support for the initiative with their products?</li>
</ul>
<p><a href="http://blog.benchside.com/wp-content/uploads/2008/12/image1.png"><img style="display: block; float: none; margin-left: auto; margin-right: auto;" title="image" src="http://blog.benchside.com/wp-content/uploads/2008/12/image-thumb1.png" alt="image" width="500" height="353" /></a></p>
<p>I don’t pretend that every scientific problem is amenable to a distributed computing initiative, but to some extent, I believe that every scientific endeavor has something valuable to learn from the success of Folding@Home and SETI@Home and their brethren. To that end, I sincerely hope to see an <strong>open-source distributed computing architecture </strong>like BOINC but with:</p>
<ul>
<li><strong>Support for new chip technologies</strong> – To provide greater value to the scientific effort, the architecture should support new chip technologies like Intel’s SSE extensions, SMP, or stream processing</li>
<li><strong>Client contribution tracking</strong> – To make it easier for volunteers to know how much they’ve contributed and/or have contests on how much they’ve contributed, a simple system to enable users/administrators to track the effort is needed</li>
<li><strong>Better security – </strong>Medical initiatives and volunteer privacy concerns demand that very fine and specialized security controls are necessary. Support for sophisticated encryption and authentication are a must.</li>
<li><strong>Linkage to social media</strong> – This probably seems extraneous, but since distributed computing efforts depend on motivated volunteers actively seeking out new volunteers, a successful architecture needs to make it easy for volunteers to share their progress with their friends whether it be via blog, or social network, or Twitter, or anything.</li>
<li><strong>Tie-in with new cloud computing systems</strong> – Along the theme of cutting costs, it is reasonable to assume that as offerings like Google’s App Engine and Amazon’s EC2 and technologies like MapReduce become better developed, we will see cash-strapped research groups using the power of “Clouds” to hold their computing power – after all, what is distributed/grid computing other than a specific variant of cloud computing (de-localized, pooled computing)? It’s probably necessary, then, for the new distributed computing architecture to more easily link with EC2 or MapReduce or App Engine.</li>
</ul>
<p>Anyone else have any thoughts?</p>
<p>(<a href="http://blogs.guardian.co.uk/technology/barilan_internet-thumb.jpg">Image Credit – picture of the internet</a>) (<a href="http://en.wikipedia.org/wiki/File:FAH-tflops.PNG">Image Credit – Folding@Home computing power</a>)</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.benchside.com/2008/12/distribute-compute/feed/</wfw:commentRss>
		<slash:comments>13</slash:comments>
		</item>
		<item>
		<title>They&#8217;re not just for gaming</title>
		<link>http://blog.benchside.com/2008/08/theyre-not-just-for-gaming/</link>
		<comments>http://blog.benchside.com/2008/08/theyre-not-just-for-gaming/#comments</comments>
		<pubDate>Sat, 30 Aug 2008 07:17:23 +0000</pubDate>
		<dc:creator>ben</dc:creator>
				<category><![CDATA[technology]]></category>
		<category><![CDATA[Chips]]></category>
		<category><![CDATA[CUDA]]></category>
		<category><![CDATA[GPGPU]]></category>
		<category><![CDATA[GPU]]></category>
		<category><![CDATA[Graphics Cards]]></category>
		<category><![CDATA[HPC]]></category>
		<category><![CDATA[Playstation]]></category>
		<category><![CDATA[Stream Processing]]></category>
		<category><![CDATA[Supercomputer]]></category>

		<guid isPermaLink="false">http://blog.benchside.com/?p=30</guid>
		<description><![CDATA[There was a time when video game consoles and graphics cards were “just for games.” In those days, game console chips and graphics cards were the domain of little boys, not grown men. Well, thank the stars those days are long gone! Today, if someone were to tease a grown man for purchasing Sony’s Playstation [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://blog.benchside.com/wp-content/uploads/2008/08/image.png"><img style="margin: 0px 10px 10px 0px; border-width: 0px;" title="image" src="http://blog.benchside.com/wp-content/uploads/2008/08/image-thumb.png" border="0" alt="image" width="248" height="307" /></a></p>
<p>There was a time when video game consoles and graphics cards were “just for games.” In those days, game console chips and graphics cards were the domain of little boys, not grown men. Well, thank the stars those days are long gone!</p>
<p>Today, if someone were to tease a grown man for purchasing Sony’s Playstation 3, he could simply reply, “I beg your pardon. I am a grown man, not a little boy. I am clearly using the Playstation 3, not to play great games like Grand Theft Auto IV and Metal Gear Solid, but to use its TeraFLOPS (1 trillion floating point calculations per second) capacity to solve important and complex scientific problems.”</p>
<p>It almost sounds like a fantasy, but it’s not. The idea behind this is pretty basic. To make games and graphics run smoothly, video game console chips and graphics cards have to do a mind-boggling number of calculations much faster than a basic computer chip can. It “just so happens” that the supercomputers scientists and Wall Street analysts use to do simulations and research with also need to do those same types of calculations. Hence the idea of <a href="http://en.wikipedia.org/wiki/Stream_processing">Stream Processing</a> was born – why not use graphics card/game console chips for things which aren’t directly related to graphics or gaming?</p>
<p>Why not indeed? I can’t list all of the projects out there, but here’s just a snapshot of the scientific applications that people have been able to do with the Playstation 3’s unique chip, <a href="http://www-03.ibm.com/technology/cell/">IBM’s Cell Broadband Engine</a>, and graphics cards from <a href="http://www.nvidia.com/page/home.html">NVIDIA</a> and <a href="http://ati.amd.com/">AMD</a>:</p>
<ul>
<li>Researchers at the National Center for Atmospheric Research used NVIDIA’s CUDA stream processing platform to <a href="http://www.nvidia.com/object/national_center_for_atmospheric_research.html">improve the speed of their weather forecasting models</a></li>
<li>Astrophysicist Gaurav Khanna from UMass Dartmouth used 16 Playstation 3’s to <a href="http://gravity.phy.umassd.edu/ps3.html">simulate the collision of two black holes</a></li>
<li><a href="http://ati.amd.com/technology/streamcomputing/siggraph07-sketch.pdf">AMD researchers</a> were able to demonstrate the use of AMD graphics cards to let a computer see in 3D and then map that image to an actual physical simulation</li>
<li>The Los Alamos National Laboratories turned to IBM to design them a new supercomputer, but instead of just designing them with plain vanilla chips from Intel and AMD, they asked for a new breed of computer. Enter <a href="http://www.lanl.gov/roadrunner/">the Roadrunner</a>, a computer which when fully operational will be capable of PetaFLOPS performance (1 quadrillion floating point calculations per second), and consists of ~7000 AMD Opteron chips coupled with ~13000 Cell processors</li>
<li>The University of Illinois at Urbana-Champaign, by using 3 NVIDIA graphics processors, was able to help <a href="http://www.nvidia.com/object/uiuc.html">perform biochemical simulations at a submolecular level 100 times faster than 18 typical computer CPUs</a>.</li>
</ul>
<p>Technology – it’s good for more than just playing games.</p>
<p><a href="http://www.krunker.com/wp-content/uploads/2007/07/playstation%203.jpg">Image Credit</a></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.benchside.com/2008/08/theyre-not-just-for-gaming/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
	</channel>
</rss>

