BEGIN:VCALENDAR PRODID:-//Microsoft Corporation//Outlook MIMEDIR//EN VERSION:2.0 BEGIN:VEVENT DTSTART:20141118T231500Z DTEND:20141119T010000Z LOCATION:New Orleans Theater Lobby DESCRIPTION;ENCODING=QUOTED-PRINTABLE:ABSTRACT: As data sizes for scientific computations grow larger, more of these types of computations are bottlenecked by disk input and output, rather than processing speed. Monitoring reads and writes, therefore, has become an important component of distributed computing clusters. Our team investigated several different possibilities for monitoring I/O on a Hadoop cluster, including the Splunk app for HadoopOps, Ganglia, and log file output. Each of these three methods were evaluated for compatibility, ease of use, and display. Surprisingly, despite the fact that Splunk HadoopOps is made specifically for Hadoop clusters, other monitoring programs and techniques still proved to be useful. Using our monitoring tools, we were also able to observe input and output behavior over different cluster architectures.=0ALA-UR-14-25814 SUMMARY:I/O Monitoring in a Hadoop Cluster PRIORITY:3 END:VEVENT END:VCALENDAR