3.4 AO Links

We now show how the semantic network overall structure of Internet matches with the internal AO-structure of SOUR.

The HTML source text of the URL currently being accessed can also be used to extract AO link information. Each hyperlink to an external file will be identified as an inlink of the AO that abstracts the current URL. Possible references include:

Hyperlink references
- e.g. <A HREF = ``URL''>...</A>. References of this kind will create links identified by the ``Part Of'' label;
Image references
- e.g. <IMG SRC = ``URL''>...</IMG> References of this kind will create links identified by the ``Image Of'' label;
Embedded references
- e.g. <EMBED SRC = ``URL''> References of this kind will create links identified by the ``Embedded In'' label.

Hyperlink, Image and Embedded references become inlinks after the following procedures:

  1. the references are mapped onto SOUR AOs following the way described earlier in section 3;
  2. the resulting AOs are conceptualized into the SOUR system;
  3. the references are identified as links under the current conceptualization.

As an example, consider the access to the following address:

http://www.di.uminho.pt/cnw3.html

Directly from the URL it is possible to extract the following AO information:

AO Name http://www.di.uminho.pt/cnw3.html
AO Address /cnw3.html
AO Type HTML
AO Class HTTPHTML
Domain0 www
Domain1 di
Domain2 uminho
Domain3 pt

Now consider that the HTML file identified by the previous URL is the following:


<HTML>

<HEAD> <TITLE> WWW National Conference </TITLE> </HEAD>

<BODY>

<H1>
<CENTER>
WWW National Conference <P>
<IMG ALIGN=MIDDLE SRC="/IMI/imi2-ing-interlace.gif"> <P>
Internet Multimedia Information
</CENTER>
</H1>

<H2>
<CENTER>
July 6-8, 1995 <P>
<A HREF="http://www.di.uminho.pt/english-um.html">Minho University</A> <P>
<A HREF="http://s700.uminho.pt/braga.html>Braga</A>,
<A HREF="http://s700.uminho.pt/homepage-pt.html>Portugal</A>
</CENTER>
</H2>

</BODY>

</HTML>

From the analysis of the HTML source text we obtain the following references:

  1. <IMG ALIGN=MIDDLE SRC="/IMI/imi2-ing-interlace.gif">
  2. <A HREF="http://www.di.uminho.pt/english-um.html">
  3. <A HREF="http://s700.uminho.pt/braga.html>
  4. <A HREF="http://s700.uminho.pt/homepage-pt.html>

References 1 and 2 will be analyzed in detail in the next section. The other two (references 3 and 4) will originate AOs

AO Name http://s700.uminho.pt/braga.html
AO Address /braga.html
AO Type HTML
AO Class HTTPHTML
Domain0 s700
Domain1 uminho
Domain2 pt

and

AO Name http://s700.uminho.pt/homepage-pt.html
AO Address /homepage-pt.html
AO Type HTML
AO Class HTTPHTML
Domain0 s700
Domain1 uminho
Domain2 pt

Finally, these two AOs will produce the inlinks:

which will become part of the conceptualization of the current URL.

Figure 3 shows the result of the conceptualization of the URL ``http://www.di.uminho.pt/cnw3.html''. This figure displays both the links and the comparison relations among AOs.

Figure 3: AO Links


F. Luís Neves and José N. Oliveira , "Classifying Internet Objects" in WWW National Conference'95, Minho University, Braga, Portugal