A Study in Hadoop Streaming with Matlab for NMR Data Processing
Applying Cloud computing techniques for analyzing large data sets has shown promise in many data-driven scientific applications. Our approach presented here is to use Cloud computing for Nuclear Magnetic Resonance (NMR)data analysis which normally consists of large amounts of data. Biologists often use third party or commercial software for ease of use. Enabling the capability to use this kind of software in a Cloud will be highly advantageous in many ways. Scripting languages especially designed for clouds may not have the flexibility biologists need for their purposes. Although this is true, they are familiar with special software packages that allow them to write complex calculations with minimum effort, but are often not compatible with a Cloud environment. Therefore, biologists who are trying to perform analysis on NMR data, acquire many advantages due to our proposed solution. Our solution gives them the flexibility to Cloud-enable their familiar software and it also enables them to perform calculations on a significant amount of data that was not previously possible. Our study is also applicable to any other environment in need of similar flexibility. We are currently in the initial stage of developing a framework for NMR data analysis.
Digital Object Identifier (DOI)
Proceedings of the IEEE Second International Conference on Cloud Computing Technology and Science, 2010, pages 786-789.
© IEEE, 2010
Gunaratna, K., Anderson, P. E., Ranabahu, A. H., & Sheth, A. P. (2010). A Study in Hadoop Streaming with Matlab for NMR Data Processing. Proceedings of the IEEE Second International Conference on Cloud Computing Technology and Science, 786-789.