Description
Baidu Yun ( https://cloud.baidu.com/ ) is one of top tier cloud computing provider. Baidu Yun BOS is widely used among China's cloud users, but currently it is not easy to access data laid on BOS storage from user's Hadoop/Spark application, because of no original support for BOS in Hadoop.
This work aims to integrate Baidu Yun BOS with Hadoop. By simple configuration, Spark/Hadoop applications can read/write data from BOS without any code change. Narrowing the gap between user's APP and data storage, like what have been done for S3 and Aliyun OSS in Hadoop.