<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>VeryCloud</title>
	<atom:link href="http://verycloud.wordpress.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://verycloud.wordpress.com</link>
	<description>Accelerate Your Data</description>
	<lastBuildDate>Sat, 31 Dec 2011 12:57:24 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='verycloud.wordpress.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://s2.wp.com/i/buttonw-com.png</url>
		<title>VeryCloud</title>
		<link>http://verycloud.wordpress.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://verycloud.wordpress.com/osd.xml" title="VeryCloud" />
	<atom:link rel='hub' href='http://verycloud.wordpress.com/?pushpress=hub'/>
		<item>
		<title>Random thoughts on Internet bandwidth</title>
		<link>http://verycloud.wordpress.com/2011/12/30/random-thoughts-on-internet-bandwidth/</link>
		<comments>http://verycloud.wordpress.com/2011/12/30/random-thoughts-on-internet-bandwidth/#comments</comments>
		<pubDate>Fri, 30 Dec 2011 05:29:50 +0000</pubDate>
		<dc:creator>Yunhong Gu</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://verycloud.wordpress.com/?p=77</guid>
		<description><![CDATA[A decade ago when I worked temporarily for a company in Beijing, I had an interesting conversation with the CEO about Internet bandwidth (I intentionally avoid using the word Internet speed as it is likely to refer heavily on latency nowadays). It was a casual talk but I still clearly remember two arguments we made. [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=verycloud.wordpress.com&amp;blog=9625497&amp;post=77&amp;subd=verycloud&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>A decade ago when I worked temporarily for a company in Beijing, I had an interesting conversation with the CEO about Internet bandwidth (I intentionally avoid using the word Internet speed as it is likely to refer heavily on latency nowadays). It was a casual talk but I still clearly remember two arguments we made. He said: &#8220;Internet is like highway, no matter how wide you build the road, there will always be more cars that its capacity can handle&#8221;. It was hard to argue with him about this when we were all using dial-up modems and the company, by the way, developed Internet video conference products.</p>
<p>Today there is still no clear answer. On the one hand, 100Gb/s Ethernet is being deployed and will probably become popular soon (recall the time when 1Gb/s and 10Gb/s were out), even cellular network can easily beat wired home Internet connection just a few years ago.  Do not forget that Google Fiber is going to be available in Kansas City soon: that is 1Gb/s to home. On the other hand, applications like HDTV over Internet is probably going to become mainstream as well. So will the Internet Bandwidth one day become so abundant that we would not need to worry about it any more. Considering the following facts: during the last decade,</p>
<ul>
<li>CPU performance doubles every 18 months (Moore&#8217;s Law)</li>
<li>Storage capacity on a typical PC increases roughly from 10GB (2001) to 1TB (2010).</li>
<li>Home Internet bandwidth increases roughly from 56Kb/s dial-up to 5Mb/s broadband.</li>
</ul>
<p>The increase speed of Internet bandwidth is not a clear winner over disk storage yet, but my bet will be on that in the next decade, bandwidth will no longer be on our top concerns when building a new Internet applications.</p>
<p>The other argument we both agreed, and is already happening, was that one day we would be able to use Internet video conference on a commercial flight.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/verycloud.wordpress.com/77/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/verycloud.wordpress.com/77/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/verycloud.wordpress.com/77/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/verycloud.wordpress.com/77/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/verycloud.wordpress.com/77/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/verycloud.wordpress.com/77/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/verycloud.wordpress.com/77/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/verycloud.wordpress.com/77/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/verycloud.wordpress.com/77/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/verycloud.wordpress.com/77/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/verycloud.wordpress.com/77/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/verycloud.wordpress.com/77/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/verycloud.wordpress.com/77/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/verycloud.wordpress.com/77/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=verycloud.wordpress.com&amp;blog=9625497&amp;post=77&amp;subd=verycloud&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://verycloud.wordpress.com/2011/12/30/random-thoughts-on-internet-bandwidth/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/848d66bf36343faeb2271ceaf52a830d?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">yunhong</media:title>
		</media:content>
	</item>
		<item>
		<title>A Private &#8220;DropBox&#8221; for Your Enterprise</title>
		<link>http://verycloud.wordpress.com/2010/12/12/a-private-dropbox-for-your-enterprise/</link>
		<comments>http://verycloud.wordpress.com/2010/12/12/a-private-dropbox-for-your-enterprise/#comments</comments>
		<pubDate>Sun, 12 Dec 2010 22:34:34 +0000</pubDate>
		<dc:creator>Yunhong Gu</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Sector/Sphere]]></category>

		<guid isPermaLink="false">http://verycloud.wordpress.com/?p=49</guid>
		<description><![CDATA[Companies today still use FTP, SFTP, SSH, and/or HTTP as their daily data sharing platform. Some people collect/generate data and upload the data to a data server, while others login the server and download the data they are interested in. There are several problems for this model. First, as the data increases, there are more [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=verycloud.wordpress.com&amp;blog=9625497&amp;post=49&amp;subd=verycloud&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Companies today still use FTP, SFTP, SSH, and/or HTTP as their daily data sharing platform. Some people collect/generate data and upload the data to a data server, while others login the server and download the data they are interested in. There are several problems for this model. First, as the data increases, there are more and more data servers set up, and users often have hard time to figure out which server hosts a particular data file. Second, when there are multiple data servers, data files are often replicated either intentionally or by mistake, it is difficult to tell which version is the most recent, even more difficult is to keep all replicas consistent. Third, when users try to download data files from remote locations via the Internet, they often experience low throughput. Now there have been a business called WAN acceleration to help solve the third problem, but WAN acceleration software cannot help anything about the first two problems.</p>
<p>On the other hand, there are many Internet storage service emerging in recent years, DropBox, Google Doc, Amazon S3, just to name a few. These services put users&#8217; files in a storage &#8220;cloud&#8221; and provide a single namespace. Replications and conflicts are handled transparently to users. These solved the first two problems described above. In fact, due to these benefits, online storage service is very popular today.</p>
<p>However, enterprises cannot simply move their data to these online storage providers. There is immediately a security concern. Then there are other issues including capacity and cost. Uploading 100TB+ data to an online storage is still less than practical today for most cases, and the cost to keep them in service is very high (e.g., at Amazon S3, 100TB will cost approximately $12K per month plus data transfer cost). In addition, the intranet network connection is usually faster.</p>
<p>An alternative and probably better approach is to own a private data cloud &#8220;DropBox&#8221; inside the company, managing and serving data to all branches. Such a private data cloud should have the following features:</p>
<ul>
<li>Single name space across multiple servers, even if the servers are located at different locations</li>
<li>Allow servers to be added and removed at run time (dynamic scaling)</li>
<li>Maintain replicas and take care of consistency between replicas transparently</li>
<li>Allow users to control the replication number and location of each file when necessary (e.g., hot files can be replicated more times)</li>
</ul>
<p><a title="Sector/Sphere" href="http://sector.sourceforge.net">Sector/Sphere</a> meets all the above requirements. Sector can manage your data across thousandths of servers with a single name space. Sector automatically replicates data files to multiple data centers for fault tolerance and to increase read performance. Data location and replication number can be configured at per-file level if necessary and is dynamically changeable. For example, if new files have more readers than old files, the new files can be replicated at a higher degree and gradually reduce the replication number when users are more interested in even newer files. In addition to all of these benefits, Sector also gives you integrated WAN acceleration ability with the <a title="UDT" href="http://udt.sourceforge.net">UDT</a> protocol, another open source software that we contribute. UDT has helped millions of users with their daily data transfer needs.</p>
<p>Overall, our system can support very large enterprises to share 100+TBs of data  every day among their global branches. You may also refer to <a title="A High Performance Data Distribution and Sharing Solution with Sector" href="http://verycloud.wordpress.com/2010/06/14/a-high-performance-data-distribution-and-sharing-solution-with-sector/">our previous blog post</a> to start trying the system by yourself.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/verycloud.wordpress.com/49/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/verycloud.wordpress.com/49/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/verycloud.wordpress.com/49/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/verycloud.wordpress.com/49/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/verycloud.wordpress.com/49/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/verycloud.wordpress.com/49/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/verycloud.wordpress.com/49/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/verycloud.wordpress.com/49/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/verycloud.wordpress.com/49/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/verycloud.wordpress.com/49/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/verycloud.wordpress.com/49/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/verycloud.wordpress.com/49/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/verycloud.wordpress.com/49/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/verycloud.wordpress.com/49/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=verycloud.wordpress.com&amp;blog=9625497&amp;post=49&amp;subd=verycloud&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://verycloud.wordpress.com/2010/12/12/a-private-dropbox-for-your-enterprise/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/848d66bf36343faeb2271ceaf52a830d?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">yunhong</media:title>
		</media:content>
	</item>
		<item>
		<title>Storage 2.0</title>
		<link>http://verycloud.wordpress.com/2010/12/08/storage-2-0/</link>
		<comments>http://verycloud.wordpress.com/2010/12/08/storage-2-0/#comments</comments>
		<pubDate>Wed, 08 Dec 2010 20:18:14 +0000</pubDate>
		<dc:creator>Yunhong Gu</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Sector/Sphere]]></category>

		<guid isPermaLink="false">http://verycloud.wordpress.com/?p=35</guid>
		<description><![CDATA[While I was discussing new storage and file systems with my friend Chuan Wang, he came up this term: storage 2.0, which is exactly the &#8220;catch&#8221; word I have been searching for a while. There have been numerous file systems developed since the birth of modern computers. However, among those mature and widely used file [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=verycloud.wordpress.com&amp;blog=9625497&amp;post=35&amp;subd=verycloud&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>While I was discussing new storage and file systems with my friend Chuan Wang, he came up this term: storage 2.0, which is exactly the &#8220;catch&#8221; word I have been searching for a while.</p>
<p>There have been numerous file systems developed since the birth of modern computers. However, among those mature and widely used file systems, there are basically two groups: the desktop file system (ZFS, NTFS, EXT, etc.) and the supercomputer file system (GPFS, Lustre, Panasas, etc.). In the middle, there is a chaos.</p>
<p>The &#8220;middle&#8221; is an area where users have a large number of loosely coupled commodity computers, where the supercomputer file system will not work well or additional layer middleware is required to aggregate individual desktop file systems. There are many file or storage systems in this area too (HDFS, Gluster, Dynamo, just to list a few), but they are used in more or less specific use scenarios and none of them becomes so dominant and widely deployed. This is partially due to the fact that requirements from different users can vary drastically from high consistency to high availability, but, in general, it is hard to provide all of these elements at once  (see Brewer&#8217;s CAP Theorem).</p>
<p>Yet there are indeed some common characteristics the &#8220;middle&#8221; file systems tend to (or need to) share. Together, these are the features required by what we would like to call storage 2.0, or the general &#8220;middle&#8221; file system.</p>
<p><strong>Software level fault tolerance.</strong></p>
<p>Almost all distributed file systems need to deal with hardware and network failure. Supercomputer file systems usually use RAID as a hardware level solution. However, hardware failure in commodity clusters is normal, so a file system must be able to provide fault tolerance within itself, rather than depending on another layer.</p>
<p><strong>Self-healing.</strong></p>
<p>Storage nodes may join and leave frequently, either due to system/network failure or maintenance/upgrade. The file system should continue to work in such situations and hide these internal changes from the clients. In particular, the file system should also  automatically re-balance storage space (e.g., when new nodes are inserted). In short, the file system should always work even if there is only one node left (although files may be lost if there are too many nodes down).</p>
<p><strong>In-storage processing.</strong></p>
<p>If a file system is running on a system where individual storage is attached with CPUs, it is only a waste of resources if these CPUs are used for serving data only. In-storage processing can significantly accelerate such operations as md5sum and grep, as it does not only avoid the need to read data out (to the client), but also can execute the commands on multiple files in parallel.</p>
<p><strong>Ability to treat files differently.</strong></p>
<p>A distributed file system, especially those served across wide area networks, are usually involved with higher level of versatility and flexibility. Files may come from different sources and serve different purposes. The location, security, and replication factor of each file needs to be treated differently and the rules should be dynamically updated whenever necessary.</p>
<p><strong>Scalability.</strong></p>
<p>A file system should be able to handle 10,000 storage nodes if necessary, yet we must be aware that the majority of systems have never come close to this level of scale. Extreme  high Scalability often does not come free. Many highly scalable systems uses P2P routing (e.g., distributed hash table) but consistency and performance are often compromised. Therefore, the file system should have &#8220;reasonable&#8221; scalability. It is also worth noting that scalability does not only apply to the number of nodes, but also apply to the number and size of files, and sometimes even geographical locations.</p>
<p><strong>Performance.</strong></p>
<p>The file system should support high performance lookup and IO throughput and provide faster IO throughput than a desktop file system due to concurrent data access. However, latency is usually higher than that in those supercomputer file systems.</p>
<p>There are other features that may be less important but can be crucial in certain situations, such as integrated security and integrated monitoring. For distributed file systems, depending on external security and monitoring may not be enough. They need to support these features within the system.</p>
<p>Storage 2.0 is not to replace current desktop and supercomputer file systems, but to fill a void left between them. At VeryCloud, we are trying to shape our Sector DFS within these requirements. We hope you can get involved with us, either you are an open source developer who is interested in distributed file system development or you are a potential user who feels that these features meet your specific requirements.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/verycloud.wordpress.com/35/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/verycloud.wordpress.com/35/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/verycloud.wordpress.com/35/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/verycloud.wordpress.com/35/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/verycloud.wordpress.com/35/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/verycloud.wordpress.com/35/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/verycloud.wordpress.com/35/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/verycloud.wordpress.com/35/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/verycloud.wordpress.com/35/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/verycloud.wordpress.com/35/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/verycloud.wordpress.com/35/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/verycloud.wordpress.com/35/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/verycloud.wordpress.com/35/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/verycloud.wordpress.com/35/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=verycloud.wordpress.com&amp;blog=9625497&amp;post=35&amp;subd=verycloud&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://verycloud.wordpress.com/2010/12/08/storage-2-0/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/848d66bf36343faeb2271ceaf52a830d?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">yunhong</media:title>
		</media:content>
	</item>
		<item>
		<title>Installing Sector within 5 Minutes</title>
		<link>http://verycloud.wordpress.com/2010/10/24/installing-sector-within-5-minutes/</link>
		<comments>http://verycloud.wordpress.com/2010/10/24/installing-sector-within-5-minutes/#comments</comments>
		<pubDate>Sun, 24 Oct 2010 01:10:45 +0000</pubDate>
		<dc:creator>Yunhong Gu</dc:creator>
				<category><![CDATA[Sector/Sphere]]></category>

		<guid isPermaLink="false">http://verycloud.wordpress.com/?p=30</guid>
		<description><![CDATA[When we develop Sector, we have a rule that a user with reasonable knowledge of Linux should be able to install a working Sector system within 5 minutes. In order to keep this rule, we intentionally limit the number of dependencies. Currently the software only relies on OpenSSL, with FUSE optional. On the other hand, [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=verycloud.wordpress.com&amp;blog=9625497&amp;post=30&amp;subd=verycloud&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>When we develop Sector, we have a rule that a user with reasonable knowledge of Linux should be able to install a working Sector system within 5 minutes. In order to keep this rule, we intentionally limit the number of dependencies. Currently the software only relies on OpenSSL, with FUSE optional. On the other hand, we also try to limit the places of configurations and most of the system parameters should work with default values.</p>
<p>Deploying a distributed software system can be &#8220;scary&#8221; and time-consuming because there are many components and roles involved and a single mistake can prevent the system from working properly.</p>
<p>You don&#8217;t need to be scared away by Sector. If you are interested in Sector, the best way to learn it is to get your hand dirty now: installing it on a Linux box and starting to play with it. You will find it is easier to use than you thought.</p>
<p>Here is a quick guide to install Sector within 5 minutes:</p>
<p>Step 0. You need a Linux box with g++, libssl-dev or openssl-devel.</p>
<p>Step 1. download the most recent version from https://sourceforge.net/projects/sector/.</p>
<p>Step 2. Untar/unzip the tarball file, you will see a directory ./sector-sphere.</p>
<p>Step 3. cd sector-sphere; then do &#8220;make&#8221;.</p>
<p>Step 4. Configuration: go to sector-sphere/conf. First, edit master.conf to update the security server address (where you want to start the security server; in this case it should be the local machine and leave the port number as default) and a local directory to store system information; Second, edit slave.conf to update the master server address (use local IP and default port) and a local directory to store Sector files; Finally, update client.conf to update the master server address, same as that in slave.conf.</p>
<p>Step 4. cd sector-sphere/security, start the Security server by running ./sserver. The security server manages system security information, including user accounts. Predefined accounts already exists, so there is no need to create a new account for testing purpose.</p>
<p>Step 5. cd sector-sphere/master, start the master server by running ./start_master.</p>
<p>Step 6. cd sector-sphere/slave, start a slave node by running ./start_slave.</p>
<p>Step 7. cd sector-sphere/tools, run ./sector_sysinfo to check the Sector system information.</p>
<p>That is all. Remember that there is a complete manual at http://sector.sourceforge.net/doc/index.htm and you can explore more details of the system.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/verycloud.wordpress.com/30/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/verycloud.wordpress.com/30/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/verycloud.wordpress.com/30/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/verycloud.wordpress.com/30/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/verycloud.wordpress.com/30/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/verycloud.wordpress.com/30/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/verycloud.wordpress.com/30/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/verycloud.wordpress.com/30/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/verycloud.wordpress.com/30/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/verycloud.wordpress.com/30/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/verycloud.wordpress.com/30/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/verycloud.wordpress.com/30/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/verycloud.wordpress.com/30/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/verycloud.wordpress.com/30/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=verycloud.wordpress.com&amp;blog=9625497&amp;post=30&amp;subd=verycloud&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://verycloud.wordpress.com/2010/10/24/installing-sector-within-5-minutes/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/848d66bf36343faeb2271ceaf52a830d?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">yunhong</media:title>
		</media:content>
	</item>
		<item>
		<title>Nine Years of UDT</title>
		<link>http://verycloud.wordpress.com/2010/08/18/nine-years-of-udt/</link>
		<comments>http://verycloud.wordpress.com/2010/08/18/nine-years-of-udt/#comments</comments>
		<pubDate>Wed, 18 Aug 2010 02:46:03 +0000</pubDate>
		<dc:creator>Yunhong Gu</dc:creator>
				<category><![CDATA[UDT]]></category>

		<guid isPermaLink="false">http://verycloud.wordpress.com/?p=21</guid>
		<description><![CDATA[This August marks the 9th year of the development of UDT. The project originated from SABUL (Simple Available Bandwidth Utilization Library)  that I worked together with Xinwei Hong (now at Microsoft) between 2001 and 2003. Around year 2000, researchers had noticed that stock TCP (newReno) was not efficient for the wide spreading OC-12 and  1GE [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=verycloud.wordpress.com&amp;blog=9625497&amp;post=21&amp;subd=verycloud&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>This August marks the 9th year of the development of UDT. The project originated from SABUL (Simple Available Bandwidth Utilization Library)  that I worked together with Xinwei Hong (now at Microsoft) between 2001 and 2003. Around year 2000, researchers had noticed that stock TCP (newReno) was not efficient for the wide spreading OC-12 and  1GE networks connecting research labs around the world. SABUL was one of the first research projects to resolve the problem.</p>
<p>SABUL used UDP to transfer large data blocks and TCP to transfer control packets (e.g., for reliability). It ran very well for our own applications on private networks, but there were three areas that would require (significant)  future work. The congestion control algorithm was not suitable for shared networks. The use of TCP for its control channel limited the protocol&#8217;s design choices. And the API was not friendly to generic application development.</p>
<p>In the second half of 2002, I started to design a new protocol to remove these limitations. This protocol was later named as UDT by <a href="http://rgrossman.com/category/blog/">Bob</a> because it is completed based on top of UDP and it uses single UDP socket for both data and control packets. The first version of UDT was out in early 2003. Compared to SABUL, UDT provides streaming style API so that it can simulate the TCP socket semantics, which is an important step to gain a large user community.</p>
<p>I spent about one year to investigate the congestion control algorithm. Having considered many approaches, I chose to modify the traditional loss-based AIMD algorithm, which had worked stably with TCP. Delay-based approaches have several attractions, especially that they are less affected by non-congestion related packet loss, but they face a fundamental problem, which is to learn the &#8220;base&#8221; delay value. An inaccurate &#8220;base&#8221; value can make a delay-based algorithm either too aggressive  if the base is overestimated or too &#8220;friendly&#8221; (co-existing flows may simply kill it) otherwise.</p>
<p>UDT uses packet-pair to estimate the available bandwidth so that it can rapidly explore the large bandwidth, while it can still share it fairly and friendly with other flows.</p>
<p>In addition to its native congestion algorithm, I have also implemented most of the major congestion control algorithms available by 2005. To this end, I have made UDT a composable framework so that a new control algorithm can be easily implemented by overriding several call back functions.  Due to the nature of the feedback delay and unknown coexisting flows, there is no &#8220;perfect&#8221; control algorithm. Each algorithm may work well in some situation but may behave poorly in others. UDT can be quickly customized to suit a specific environment.</p>
<p>By 2005, UDT3 already became production ready and had a large user community. As the user community grows, people had started to use UDT in commodity networks with relative small bandwidth (e.g., Cable, DSL, etc.). There was an important feature of UDP that accelerated this change: it is much easier to use UDP to punch a firewall than using TCP and UDT is completed based on UDP.</p>
<p>This is a completely different use scenario compared to the original design goal. While UDT can easily scale to inter-continental multi-10GE networks, it did not scale well to high concurrency. This motivated the  birth of UDT4 in 2007.</p>
<p>UDT4 introduces UDP multiplexing and buffer sharing to allow an application to start a very large number of UDT connections. UDP multiplexing makes it much easier to support firewall management and NAT punching because a single UDP socket can carry multiple UDT connections. Buffer sharing significantly reduced the memory usage. Today UDT can efficiently support 100,000 concurrent connections on a commodity computer. The scalability will be further improved when the epoll API and session multiplexing over UDT connections are completed.</p>
<p>Over these years, I have received numerous useful feedbacks and learned a lot from the users. In particular, many features were motivated by users&#8217; requirements. Several users have even developed their own UDT implementation, while some others created and shared wrappers for non-C++ programming languages.</p>
<p>UDT also benefits a lot from the open source approach, which helps UDT to reach to a greater community. Users can do code review, debug and submit bug fixes whenever necessary, which greatly increases the code quality.</p>
<p>At a user space protocol, UDT is able to include related new technologies from computer networking and adapt to new network environments and use scenarios. I am confident that UDT will continue to evolve and serve the data transfer jobs in more and more applications.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/verycloud.wordpress.com/21/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/verycloud.wordpress.com/21/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/verycloud.wordpress.com/21/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/verycloud.wordpress.com/21/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/verycloud.wordpress.com/21/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/verycloud.wordpress.com/21/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/verycloud.wordpress.com/21/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/verycloud.wordpress.com/21/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/verycloud.wordpress.com/21/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/verycloud.wordpress.com/21/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/verycloud.wordpress.com/21/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/verycloud.wordpress.com/21/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/verycloud.wordpress.com/21/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/verycloud.wordpress.com/21/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=verycloud.wordpress.com&amp;blog=9625497&amp;post=21&amp;subd=verycloud&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://verycloud.wordpress.com/2010/08/18/nine-years-of-udt/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/848d66bf36343faeb2271ceaf52a830d?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">yunhong</media:title>
		</media:content>
	</item>
		<item>
		<title>A High Performance Data Distribution and Sharing Solution with Sector</title>
		<link>http://verycloud.wordpress.com/2010/06/14/a-high-performance-data-distribution-and-sharing-solution-with-sector/</link>
		<comments>http://verycloud.wordpress.com/2010/06/14/a-high-performance-data-distribution-and-sharing-solution-with-sector/#comments</comments>
		<pubDate>Mon, 14 Jun 2010 12:51:37 +0000</pubDate>
		<dc:creator>Yunhong Gu</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Sector/Sphere]]></category>
		<category><![CDATA[UDT]]></category>

		<guid isPermaLink="false">http://verycloud.wordpress.com/?p=16</guid>
		<description><![CDATA[Over the years, many users have used UDT to power their high speed data transfer applications or tools.  Today, with Sector, we can provide an advanced data distribution and sharing application, in addition to the UDT library. This solution currently works on Linux only, but we will port it to Windows in the near future, [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=verycloud.wordpress.com&amp;blog=9625497&amp;post=16&amp;subd=verycloud&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Over the years, many users have used UDT to power their high speed data transfer applications or tools.  Today, with Sector, we can provide an advanced data distribution and sharing application, in addition to the UDT library. This solution currently works on Linux only, but we will port it to Windows in the near future, first the client side, then the server side.</p>
<p>Here are the several simple steps you can follow to set up a free, open source, advanced, high performance, and simple-to-use data distribution and sharing platform:</p>
<p>1. Download Sector from <a title="download Sector" href="https://sourceforge.net/projects/sector/" target="_blank">here</a>, compile and configure the software following the <a title="Sector Manual" href="http://sector.sourceforge.net/doc/index.htm" target="_blank">manual</a>.</p>
<p>2. Set up a security server, which allows you control the data access permission, including user accounts, passwords, IP access control list, etc. You can also set up an anonymous account for your public data.</p>
<p>3. Set up one or several Sector master servers, which can be on the same computer that hosts the security server and the data (slave server).</p>
<p>4. Set up Sector Slave servers on the computers that host your data. Unlike FTP or most commercial applications that supports only a single server, Sector allows you to install the servers on multiple computers, even 1000s of them, and yet provides a uniform namespace for the complete system.</p>
<p>5. Install client software on your users&#8217; computers and mount the Sector file system as a local file directory using the Sector-FUSE module. Your users can browse and access the data in Sector just as browsing and accessing data on a local directory by using file system commands &#8220;ls&#8221;, &#8220;cp&#8221;, etc. They will not feel the existence of Sector, although this &#8220;local directory&#8221; can actually run on 1000 servers across multiple continents!</p>
<p>All the data transfer occurring between the clients and the slave servers are on top of <a href="http://udt.sourceforge.net" target="_blank">UDT</a>. Therefore, data transfer throughput can be guaranteed even over wide area networks.</p>
<p>If you have any questions, please post them on the Sector project forum.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/verycloud.wordpress.com/16/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/verycloud.wordpress.com/16/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/verycloud.wordpress.com/16/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/verycloud.wordpress.com/16/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/verycloud.wordpress.com/16/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/verycloud.wordpress.com/16/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/verycloud.wordpress.com/16/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/verycloud.wordpress.com/16/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/verycloud.wordpress.com/16/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/verycloud.wordpress.com/16/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/verycloud.wordpress.com/16/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/verycloud.wordpress.com/16/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/verycloud.wordpress.com/16/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/verycloud.wordpress.com/16/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=verycloud.wordpress.com&amp;blog=9625497&amp;post=16&amp;subd=verycloud&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://verycloud.wordpress.com/2010/06/14/a-high-performance-data-distribution-and-sharing-solution-with-sector/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/848d66bf36343faeb2271ceaf52a830d?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">yunhong</media:title>
		</media:content>
	</item>
		<item>
		<title>Sector vs. Hadoop</title>
		<link>http://verycloud.wordpress.com/2010/03/02/sector-vs-hadoop/</link>
		<comments>http://verycloud.wordpress.com/2010/03/02/sector-vs-hadoop/#comments</comments>
		<pubDate>Tue, 02 Mar 2010 17:47:09 +0000</pubDate>
		<dc:creator>Yunhong Gu</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Sector/Sphere]]></category>

		<guid isPermaLink="false">http://verycloud.wordpress.com/?p=12</guid>
		<description><![CDATA[When I try to introduce Sector/Sphere to people  I meet at conferences, I usually start with one sentence: &#8220;Sector is a system similar to Hadoop&#8221;, because many people know Hadoop and understand how it works more or less, while Sector provides similar functionalities. This claim, however, is not very accurate, as there are many critical [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=verycloud.wordpress.com&amp;blog=9625497&amp;post=12&amp;subd=verycloud&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>When I try to introduce Sector/Sphere to people  I meet at conferences, I usually start with one sentence: &#8220;Sector is a system similar to Hadoop&#8221;, because many people know Hadoop and understand how it works more or less, while Sector provides similar functionalities. This claim, however, is not very accurate, as there are many critical differences between the two systems.</p>
<p>Sector is not simply a direct implementation of GFS/MapReduce. In fact, when I started to work on Sector in 2005, I have not read the Google paper yet, and I was not aware of Hadoop until 2007. Sector originated from a content distribution system for very large scientific datasets (Sloan Digital Sky Survey). The current version of Sector still supports efficient data access and distribution over wide area networks, a goal that was not considered by the GFS/Hadoop community. Unlike GFS, Sector does not split files. On the one hand, this limit the sizes of files stored in the Sector file system and hence the system usability. On the other hand, it also greatly improves data transfer and processing performance when proper file sizes are used.</p>
<p>Sector &#8211; to be accurate, Sphere, as part of Sector &#8211; supports arbitrary user defined functions (UDFs)  to be applied to any data segment (a record, a group of records, a file, etc.) and allows the result to be written to independent files or to be sent to multiple bucket files by a user-defined key. The UDF model turned out to be equivalent to the MapReduce model as each UDF can simulate a Map operation, while the output organization according to keys can simulate a Reduce operation. Note that the &#8220;key&#8221; in Sector UDF is not part of a data record; it is used for output destination only. While MapReduce treats each record as a &lt;key, value&gt; pair, Sector sees all data in binary format and leaves the specific processing to the UDF.</p>
<p>The table below compares Sphere UDF model and MapReduce. You can rewrite any MapReduce computation using Sphere UDFs. Sphere uses persistent record index instead of a run-time parser. The Map and Reduce operations can be replaced with one or more UDFs. Finally, the Sphere output can be written into Sector files. A more detailed list of different technologies used in Sector can be found on the Sector <a href="http://sector.sourceforge.net/tech.html">website</a>.</p>
<table border="1" cellspacing="0" cellpadding="0">
<tbody>
<tr>
<td width="150" valign="top"><strong>Sphere</strong></td>
<td width="168" valign="top"><strong>MapReduce</strong></td>
</tr>
<tr>
<td width="150" valign="top">Record Offset Index</td>
<td width="168" valign="top">Parser / Input Reader</td>
</tr>
<tr>
<td width="150" valign="top">UDF</td>
<td width="168" valign="top">Map</td>
</tr>
<tr>
<td width="150" valign="top">Bucket</td>
<td width="168" valign="top">Partition</td>
</tr>
<tr>
<td width="150" valign="top">-</td>
<td width="168" valign="top">Compare</td>
</tr>
<tr>
<td width="150" valign="top">UDF</td>
<td width="168" valign="top">Reduce</td>
</tr>
<tr>
<td width="150" valign="top">-</td>
<td width="168" valign="top">Output Writer</td>
</tr>
</tbody>
</table>
<p>Overall, Sector performs 2 &#8211; 20 times faster than Hadoop in our benchmark applications. It its worth trying, especially when you a C++ developer.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/verycloud.wordpress.com/12/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/verycloud.wordpress.com/12/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/verycloud.wordpress.com/12/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/verycloud.wordpress.com/12/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/verycloud.wordpress.com/12/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/verycloud.wordpress.com/12/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/verycloud.wordpress.com/12/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/verycloud.wordpress.com/12/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/verycloud.wordpress.com/12/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/verycloud.wordpress.com/12/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/verycloud.wordpress.com/12/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/verycloud.wordpress.com/12/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/verycloud.wordpress.com/12/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/verycloud.wordpress.com/12/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=verycloud.wordpress.com&amp;blog=9625497&amp;post=12&amp;subd=verycloud&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://verycloud.wordpress.com/2010/03/02/sector-vs-hadoop/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/848d66bf36343faeb2271ceaf52a830d?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">yunhong</media:title>
		</media:content>
	</item>
		<item>
		<title>Sector 2.0</title>
		<link>http://verycloud.wordpress.com/2010/01/31/sector-2-0/</link>
		<comments>http://verycloud.wordpress.com/2010/01/31/sector-2-0/#comments</comments>
		<pubDate>Sun, 31 Jan 2010 20:53:50 +0000</pubDate>
		<dc:creator>Yunhong Gu</dc:creator>
				<category><![CDATA[Sector/Sphere]]></category>

		<guid isPermaLink="false">http://verycloud.wordpress.com/?p=6</guid>
		<description><![CDATA[Sector version 2.0 is in the QA stage and will be released soon. This is a major milestone for the Sector project: 1) Sector 2.x is ready for production; 2) the code structure is re-designed to accept contributions from a large community. Technical Improvement Since the last version 1.24a, we have added several new features, [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=verycloud.wordpress.com&amp;blog=9625497&amp;post=6&amp;subd=verycloud&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Sector version 2.0 is in the QA stage and will be released soon. This is a major milestone for the Sector project: 1) Sector 2.x is ready for production; 2) the code structure is re-designed to accept contributions from a large community.</p>
<p><strong>Technical Improvement</strong></p>
<p>Since the last version 1.24a, we have added several new features, including on-disk metadata and in-memory objects. Previously, Sector keeps metadata in memory, which is very fast but may limit the number of files supported in the system. The new on-disk metadata will support much large number of files. Both metadata systems will exist in version 2.0 with the default set to the in-memory one for performance reason (we expect that  in the majority of systems the in-memory metadata structure will be enough to hold all the file information).</p>
<p>In fact, in Sector 2.0, it is easy to support a new metadata  structure by inheriting from a base C++ class. Switching between different metadata management can be done via the configuration files. In addition, because all the metadata system define the same interface, it is possible to have different metadata structures on different nodes.</p>
<p>Version 2.0 introduces in-memory objects. A UDF can create an in-memory object (a pointer to an allocated data structure in memory) while another UDF can access it. This can significantly increase certain iterative algorithms. The in-memory object can be released by a UDF when it is not needed any longer.</p>
<p><strong>Code Structure Reorganization</strong></p>
<p>We have reorganize the code structure so that the Sector system can be developed by a large development group.  Most functionalities, including security control, metadata, master synchronization, services, and client API have been separated from each other . These parts uses internal protocols to communicate between each other. By this way, each part can be independently developed or improved as long as it implements the protocol interface.</p>
<p>In version 2.0, only a single header file is needed to include when programming Sector. This makes it easy for a standard installation (one header file and one library file in the system directory).</p>
<p>During the reorganization, we have also improved the code quality. The new clearer and more loosely-coupled structure make Sector less prone to design and programming bugs.</p>
<p><strong>Plans for 2010</strong></p>
<p>We will continue to improve the software by providing better performance, more reliable software, and more detailed documentations. Minor versions (2.1, 2.2, etc.) may be released once a quarter.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/verycloud.wordpress.com/6/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/verycloud.wordpress.com/6/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/verycloud.wordpress.com/6/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/verycloud.wordpress.com/6/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/verycloud.wordpress.com/6/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/verycloud.wordpress.com/6/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/verycloud.wordpress.com/6/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/verycloud.wordpress.com/6/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/verycloud.wordpress.com/6/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/verycloud.wordpress.com/6/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/verycloud.wordpress.com/6/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/verycloud.wordpress.com/6/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/verycloud.wordpress.com/6/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/verycloud.wordpress.com/6/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=verycloud.wordpress.com&amp;blog=9625497&amp;post=6&amp;subd=verycloud&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://verycloud.wordpress.com/2010/01/31/sector-2-0/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/848d66bf36343faeb2271ceaf52a830d?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">yunhong</media:title>
		</media:content>
	</item>
		<item>
		<title>SC09 in Portland</title>
		<link>http://verycloud.wordpress.com/2009/11/21/sc09-in-portland/</link>
		<comments>http://verycloud.wordpress.com/2009/11/21/sc09-in-portland/#comments</comments>
		<pubDate>Sat, 21 Nov 2009 06:29:50 +0000</pubDate>
		<dc:creator>Yunhong Gu</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://verycloud.wordpress.com/?p=3</guid>
		<description><![CDATA[[Yunhong Gu travelled to SC09 as a Research Scientist of the Laboratory for Advanced Computing, University of Illinois, Chicago.] SC09 in Portland was my 7th SC trip and was one of the most memorable ones. Big Data and Bio-computing This year our two topics are big data and bio-computing. As one of the first group [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=verycloud.wordpress.com&amp;blog=9625497&amp;post=3&amp;subd=verycloud&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>[Yunhong Gu travelled to SC09 as a Research Scientist of the Laboratory for Advanced Computing, University of Illinois, Chicago.]</p>
<p>SC09 in Portland was my 7th SC trip and was one of the most memorable ones.</p>
<p><strong>Big Data and Bio-computing</strong></p>
<p>This year our two topics are big data and bio-computing. As one of the first group to start working on cloud computing, we have been developing big data cloud for 4 years now and the Sector/Sphere platform and the Open Cloud Testbed (OCT) has became to mature. Few testbed can match the scale of the wide area system of OCT, consisting 256 nodes in 4 data centers connected by 10Gb/s networks. New racks are being added and the testbed will be doubled soon.</p>
<p>Sector/Sphere now include most basic functionality modules of a distributed file system and a parallel data processing engine. It can significantly outperform Hadoop in data analytics applications. Visitors to the booth were very interested in the details of Sector/Sphere.</p>
<p>During the past several years, we have also investigated a significant amount of time on bio-computing. In SC09 bio-computing systems were demonstrated as well.</p>
<p><strong>Bandwidth Challenge</strong></p>
<p>BWC has a special significance for us. We have been participating BWC since it was first introduced in 2000 as the oldest competition in SC. Like many other competitions, experiences does help. after 5 failed tries, we first won it in SC06, and then again in SC08. At SC09 we shared the trophy with two other teams who have also been in most of the past competitions and have made remarkable demonstrations: Caltech and University of Tokyo.</p>
<p>This year our entry also included significant contributions from the hard-working guys from Naval Research Laboratory and iCAIR/Northwestern University.</p>
<p>As always, we could not have achieved any of this with our friends providing the network infrastructure support. In this year, National Lambda Rail, Force10, Alan Verlo, Kevin McGrattan, Kazumi Kumazoe and many others provided special support to our BWC entry.</p>
<p><strong>Portland</strong></p>
<p>The host city is always a part of the SC. With more than 10,000 attendees, SC is not just hosted by a conference center, it is hosted by a city. In Portland, most conference hotels are located either within walking distance or near free public transportation. This year I stayed in the city center, about 1.3 miles from the convention center. Taking the convenient public transport makes it feels like that I have lived there for years.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/verycloud.wordpress.com/3/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/verycloud.wordpress.com/3/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/verycloud.wordpress.com/3/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/verycloud.wordpress.com/3/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/verycloud.wordpress.com/3/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/verycloud.wordpress.com/3/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/verycloud.wordpress.com/3/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/verycloud.wordpress.com/3/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/verycloud.wordpress.com/3/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/verycloud.wordpress.com/3/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/verycloud.wordpress.com/3/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/verycloud.wordpress.com/3/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/verycloud.wordpress.com/3/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/verycloud.wordpress.com/3/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=verycloud.wordpress.com&amp;blog=9625497&amp;post=3&amp;subd=verycloud&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://verycloud.wordpress.com/2009/11/21/sc09-in-portland/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/848d66bf36343faeb2271ceaf52a830d?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">yunhong</media:title>
		</media:content>
	</item>
		<item>
		<title>Hello world!</title>
		<link>http://verycloud.wordpress.com/2009/09/23/hello-world/</link>
		<comments>http://verycloud.wordpress.com/2009/09/23/hello-world/#comments</comments>
		<pubDate>Wed, 23 Sep 2009 22:29:56 +0000</pubDate>
		<dc:creator>verycloud</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false"></guid>
		<description><![CDATA[Welcome to WordPress.com. This is your first post. Edit or delete it and start blogging!<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=verycloud.wordpress.com&amp;blog=9625497&amp;post=1&amp;subd=verycloud&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Welcome to <a href="http://wordpress.com/">WordPress.com</a>. This is your first post. Edit or delete it and start blogging!</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/verycloud.wordpress.com/1/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/verycloud.wordpress.com/1/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/verycloud.wordpress.com/1/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/verycloud.wordpress.com/1/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/verycloud.wordpress.com/1/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/verycloud.wordpress.com/1/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/verycloud.wordpress.com/1/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/verycloud.wordpress.com/1/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/verycloud.wordpress.com/1/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/verycloud.wordpress.com/1/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/verycloud.wordpress.com/1/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/verycloud.wordpress.com/1/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/verycloud.wordpress.com/1/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/verycloud.wordpress.com/1/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=verycloud.wordpress.com&amp;blog=9625497&amp;post=1&amp;subd=verycloud&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://verycloud.wordpress.com/2009/09/23/hello-world/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/e6f8393a7296779b39a24cb39804b8e7?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">verycloud</media:title>
		</media:content>
	</item>
	</channel>
</rss>
