W3C Architecture Domain

W3C Team Project Review: P2P

Outline - Goals

P2P, a buzzword??

Yes

abusive use of the term, for every kind of distributed computing environment, like Grid, multiagent systems... very similar concepts. Content Distribution Networks... Open Hypermedia Systems (OHS)

new wording = hype

No

deserved its own word as an application for end users, not started as a research topic, sneaked into the academic world after its public success

The non-geek view:

P2P Architectures

Network of peers as opposed to server-based exchanges (e.g. IM vs IRC)

Once upon a time: Napster (1999)

Need to hide: Gnutella (2000)

Hybridation: KaZaA/FastTrack (2001)

Hybridation

Similar architecture adopted in 2002-2003 in Gnutella2 (hubs and leafs)

And next?

P2P Balkanization?

Lots of different P2P software and servers, some with open source implementations (WinMX/Winny, eDonkey, eMule, etc.), forks implementing variants, etc.

Why? What made some success and others fail?

Here comes the **AA...

... and the ISPs

A 2002 study evaluated the average guy traffic as 43% P2P and 14% Web surfing

Countermeasures: encryption, tunneling into UDP packets

Actual Usability (2003)

Usability not that good but huge success of "free" content

BitTorrent (2001)

In the meanwhile...

Content Distributed Networks (CDNs) become more interesting (less cost for the original distributor, in particular bandwidth) and more feasible. Large distributed CPU network was already a reality (rc5, SETI, climate prediction, etc.)

BitTorrent's success?

A 2004 study estimated BitTorrent as high as 30% of all the Internet Traffic, and a 60% total of P2P

Many extensions for security, improvements on distribution of the data and the metadata (multiple trackers)

Legal problems still there, so are the consequences: poisoning chunks, filtering, etc.

and P2P applications were born

Same problems: Efficiency should be improved, NAT problem, ISPs not playing nice, even if efforts on caching extensions (e.g. Comcast & Azureus/Vuze)

Overlay Network: Routing between the Nodes

Distributed Hashtables (DHT)

Comes from dsitributed caching area (e.g. CARP)

Circle topology (Kademlia, Chord)

More structured P2P approaches

Kademlia is already in use (Overnet, Kad networks, BitTorrent extension)

Challenges of Overlay nets

Could address lots of issues at the same time:

Lower level: Network issues

Overlay network does not solve lower level issues. e.g. JXTA assumes a complete and secure network layer under its own architecture

Conclusions? Questions!

P2P revealed much bigger questions on the use of Internet

In 2007, after 4 fours behind P2P, HTTP traffic raised up to almost 50% of the Internet traffic

More P2P applications and Commercial networks to come

BitTorrent forks?

Links - Partial & mostly unordered bibliography