병렬 시스템 환경 하에서 비정형 응용 프로그램을 위한 입출력 시스템의 설계 및 구현

노재춘; 박성순; 알록 샤우드리; 권오영; Jaechun No; Sung-Soon Park; Alok Choudhary; Oh-Young Kwon

연구문헌

국내 논문지

홈 > 연구문헌 > 국내 논문지 > 한국정보과학회 논문지 > 정보과학회 논문지 A : 시스템 및 이론

정보과학회 논문지 A : 시스템 및 이론

Current Result Document :

한글제목(Korean Title)	병렬 시스템 환경 하에서 비정형 응용 프로그램을 위한 입출력 시스템의 설계 및 구현
영문제목(English Title)	Design and Implementation of An I/O System for Irregular Applications under Parallel System Environments
저자(Author)	노재춘 박성순 알록 샤우드리 권오영 Jaechun No Sung-Soon Park Alok Choudhary Oh-Young Kwon
원문수록처(Citation)	VOL 26 NO. 11 PP. 1318 ~ 1332 (1999. 11)
한글내용 (Korean Abstract)	본 논문에서는 입출력 응용을 위해 collective I/O 기법을 기반으로 한 실행시간 시스템의 설계, 구현 그리고 그 성능평가를 기술한다. 여기서는 모든 프로세서가 동시에 I/O 요구에 따라 스케쥴링하며 I/O를 수행하는 collective I/O 방안과 프로세서들이 여러 그룹으로 묶이어, 다음 그룹이 데이터를 재배열하는 통신을 수행하는 동안 오직 한 그룹만이 동시에 I/O를 수행하는 pipelined collective I/O 등의 두 가지 설계방안을 살펴본다. Pipelined collective I/O의 전체 과정은 I/O 노드 충돌을 동적으로 줄이기 위해 파이프라인된다. 이상의 설계 부분에서는 동적으로 충돌 관리를 위한 지원을 제공한다. 본 논문에서는 다른 노드의 메모리 영역에 이미 존재하는 데이터를 재 사용하여 I/O 비용을 줄이기 위해 collective I/O 방안에서의 소프트웨어 캐슁 방안과 두 가지 모형에서의 chunking과 온라인 압축방안을 기술한다. 그리고 이상에서 기술한 방안들이 입출력을 위해 높은 성능을 보임을 기술하는데, 이 성능결과는 Intel Paragon과 ASCI/Red teraflops 기계 상에서 실험한 것이다. 그 결과 응용 레벨에서의 bandwidth는 peak point가 55%까지 측정되었다.
영문내용 (English Abstract)	In this paper we present the design, implementation and evaluation of a runtime system based on collective I/O techniques for irregular applications. We present two design, namely, "Collective I/O" and "Pipelined Collective I/O". In the first scheme, all processors participate in the I/O simultaneously, making scheduling of I/O request simpler but creating a possibility of contention at the I/O nodes. In the second approach, processors are grouped into several groups, so that only one group performs I/O simultaneously, while the next group performs communication to rearrange data, and this entire process is pipelined to reduce I/O node contention dynamically. In other words, the design provides support for dynamic contention management. Then we present a software caching method using collective I/O to reduce I/O cost by reusing data already present in the memory of other nodes. Finally, chunking and on-line compression mechanisms are included in both models. We demonstrate that we can obtain significantly high-performance for I/O above what has been possible so far. The performance results are presented on an Intel Paragon and on the ASCI/Red teraflops machine. Application level I/O bandwidth up to 55% of the peak is observed.
키워드(Keyword)
파일첨부	PDF 다운로드