BioProject
Optimization of epigenome data analysis system and development of practical open analysis system
Research Aim With the development of next-generation sequencing (NGS) instruments, large-scale epigenetic data are produced in various experimental samples. As a result, the demand for customized analytical solutions is increasing rapidly, but the systematic analytical support system is insufficient. Therefore, it is necessary to construct a user-oriented and customized data analysis environment. In the first phase, we developed a customized standard epigenome analysis pipeline including quality control module that considers the characteristics of the epigenetic data and transferred the pipeline to the integrated project to establish a system for service to the people. In the second phase, based on the results of the first phase, we specialize a researcher-oriented and custom-tailored epigenome analysis service system by adding new epigenetic data analysis pipeline, enhancing analysis system and providing open analysis service. Also, we intend to construct a domestic environment for epigenome analysis service through systematic management and education. As a result, this study establishes standards for analysis of epigenetic data and develops tailor-made and open epigenetic data analysis service system for researchers who are not familiar with the current complex analysis process. Research Contents This project provides active service through SNS and education program to provide and operate analysis service system, enhance analytical pipeline utilization, introduce epigenome analysis pipeline and latest analysis technology. Advisory and analytical services are provide for a range of research processes, from production of epigenetic data to analysis and derivation of results. Establishment of an integrated analysis service system through exchange and cooperation with integrated project, construction of data sharing system, and development of new epigenome analysis pipeline, such as chromatin structure analysis (ATAC-seq) analysis pipeline and single cell analysis pipeline. Expected effect Establishment of a standardized epigenetic big data processing technology and analysis pipeline will solve the problem of data analysis, which has been a stumbling block to the study of epigenome. It is possible to have an independent research system deviating from research that is dependent on or mimic the developed countries in the field of epigenomics, and there will be a positive influence on the introduction of researchers in other fields or nurture professional manpower to major in this field. It is expected that the increase of application to the epigenome will promote domestic IT and BT industry as well as genome-related market. Especially, preventive medicine cost is expected to be reduced through disease prediction based on the analysis of individual epigenome.