|
> Homepage >
The Journal >
Issues Contents >
Vol. 11 (2007) >
Paper 2 |
|
VOLUME 11 (2007): ISSUE 1. PAPER 2The Performance of South African and
Kenyan Universities on the World Wide Web: a Web Link Analysis Omwoyo Bosire
Onyancha*, Dennis N. Ocholla**,1 Abstract The study used Link Analysis to compare Kenyan and South African universities according to several Web-based indicators, some of which include the number of pages, and the number of in and out-links. The authors examined the external out-links in order to determine the institutions targeted by South African and Kenyan universities. Also investigated were the networks or links between universities. Web Impact Factors (WIFs) were calculated and reported in order to compare the universities’ web influence. Results indicate that Kenyan universities, like most African universities, have embraced the Internet and its constructs fairly recently, hence most of their websites are at initial stages of construction. Comparatively, South African universities have made remarkable progress in their web presence, which is at an advanced stage of development, equaling counterparts in more developed countries. The study recommends that regional webometric studies be conducted periodically in order to investigate and map the web-related developments of African universities. It concludes that African universities, though not comparable to counterparts in developed countries, can have their websites evaluated webometrically. KeywordsWebometrics, link analysis, universities, Kenya, South Africa 1. Introduction The emergence of the Internet and the World Wide Web (WWW) has revolutionalized not only scholarly communication (although publishing on the WWW is yet to gain as much recognition as traditional scientific publishing) but also the manner in which institutions and individuals offer information concerning their services and products. Studies indicate that the Web is becoming a significant communication medium for science and scholarship (Cronin & McKim, 1996). Institutions of higher learning, and particularly universities, have embraced the WWW and its many features, enabling activities such as the: provision of online library catalogues; promotion of the existence and achievements of individuals, research groups, institutes and departments; and the dissemination of research findings, either through hosting online articles or publishing summaries, data sets or tools (Noruzi, 2005). This scenario has provided scholars with additional tools with which they can and/or have already used to evaluate universities. Thus, evaluation of universities, which was previously limited to the use of bibliometric analyses (publications count, citations analysis and patent analysis), expert reviews (peer-reviews), the economic rate of return, case studies, surveys, the analysis of competition for funds, and retrospective analysis (e.g. The Time's Higher Education Supplement [2005]; ARWU: Shanghai Jiao Tong University [2004]; Cybermetrics Lab, 2005), can now also be conducted webometrically. The aforementioned studies made use of a range of indicators (including informetric analysis) to rank institutions of higher learning, and in the process revealed that the performance of African universities is dismal. African universities did not feature in the Times list's 200 top ranking universities, and only the University of Cape Town (r=251), the University of Witwatersrand (r=395), the University of KwaZulu-Natal (r=469), and the University of Pretoria (r=480) made an appearance in the top 500 of the Shanghai Jiao Tong University's list [see http://ed.sjtu.edu.cn/rank/2006/ARWU2006/ARWU2006.xls]. A webometric ranking of world universities conducted by Cybermetrics Lab (2005) lists the top African university (i.e. University of Cape Town) at number 356. The absence or low ranking of African universities from/within world rankings has raised many concerns. Although many have queried the methods and criteria of evaluation, there has been general concern as to whether or not African Universities are really ready for cybermetric studies, given the region's poor economic and technology related developments, which have led considerably to the ever increasing digital divide between African countries and the developed world. Despite this, we acknowledge the fact that there are now several link analysis studies on university web sites. The application of bibliometric/informetric approaches by library and information scientists to web related studies is growing increasingly common. Several studies have applied publications count and citation analysis techniques whilst studying patterns of web information production, organization, storage, retrieval and use, and the influence of websites on each other. These studies have led to the establishment of webometrics as a viable field of research in Library and Information Science (LIS). Examples of LIS-related webometric studies include Björneborn (2004), Björneborn & Ingwersen 2004), Candan & Li (2002), Ingwersen (1998), Jana & Chatterjee (2004), Thelwall (2002a, 2002b, 2003), and Vaughan & Hysen (2002), to name a few. University Websites have been commonly evaluated using Link Analysis in order to: measure their web impact factors (Thelwall, 2002a); find the most important web pages (Thelwall, 2003); identify link relationships between universities (Thelwall, 2002b; 2002c); classify link types in academic environments on the Web (Bar-Ilan, 2005); and finally, to rank universities (Cybermetrics Lab, 2005). To the best of our knowledge, however, there are no comprehensive studies that have been conducted to report on the performance of African universities on the Web. This study provides the preliminary findings of a broader research project aimed at measuring and comparing the visibility and influence of African universities on the Web using several web indicators. 2. Research questions The paper takes the form of an exploratory study that employs Link Analysis to measure the performance of Kenyan and South African universities on the web. The study considers a question many have raised, i.e. are African institutions of higher learning ripe for cybermetric studies? In order to respond to the above, the current study focused on the following research questions:
3. Methodology 3.1. Procedures The study investigated Kenyan and South African institutions of higher learning that had own websites. At this stage, we targeted only universities. A total of 16 and 21 universities were selected from Kenya and South Africa respectively, based on website ownership and the availability of degree programs. The list of the studied universities in each country is provided in Table 1. Sources of data consisted of the following: Using Link Analysis, this study crawled through and compared Kenyan and South African universities' websites based on several Web-based indicators, some of which include the number of pages, number of out-links (page out-links, directory out-links, domain out-links, and site out-links) and the number of in-links (page in-links, directory in-links, domain in-links, and site in-links). Two approaches were used for data collection. The SocSciBot computer software and the SocSciBot toolkit were respectively used to crawl through university websites and analyze collected data. SocSciBot is a website crawler that is specially designed for research purposes. The software was chosen based on its extensive use for site analysis and its user-friendliness. It also generates a number of reports, some of which were the subject of discussion of this study. Duplicate elimination in SocSciBot is done through comparison of the full HTML of the two pages. The crawled pages were limited to only those without question marks, as in some cases these pages are repeated, and thus crawling them would be an endless task. Data collection was conducted within the same month (April 2006) in order to limit errors commonly associated with frequent website updates. The findings of this study, therefore, are representative of the status of each university regarding web presence and impact as of the first two weeks of April 2006. As a way of triangulation, AltaVista and Google, two top ranked search engines, were used for comparison purposes, particularly in terms of the number of pages, in-links, and the WIF. The following uniform search strategy was used to collect data from the two search engines: 1. The total number of pages linking to the website
2. The total number of pages at the website
External links were examined to identify the most common Top Level Domains (TLDs) targeted by Kenyan and South African universities. We employed the Citation Impact Factor analogy to calculate the universities' Web Impact Factors (WIFs) in order to compare the universities' web influences. The following formula was used to calculate the WIFs:
Visualization networks illustrating the inter-connectivity of universities were provided using Pajek software. 3.2. Limitations of the study As already mentioned, this study was limited to a total of 21 and 16 universities in South Africa and Kenya respectively. Presently, a study that will measure the performance of other universities in the continent is underway. A further limitation of the study relates to the search strategy that was employed, i.e.: site:(domain name). The approach meant that all links, including self-links, were analyzed, which in turn exaggerates the performance of a university with a large number of self-links, hence distorting the WIF of a given university. The citation IF has been faulted on the same premise that in its calculation, all citations (including self-citations) are used. Nevertheless, it is widely acknowledged that self-citations - just as citations from other documents - are indicators of the use made of a given document and thus they (i.e. self-citations) can be included in the calculation of IFs. After all, it has been observed that the "self-citation rate shows only a weak correlation with the impact and subject of a journal. There is also a weak correlation between self-citation rate and the size or specificity of the category (categories) assigned to a journal". (from Journal self-citation in the Journal Citation Reports). It is worth noting, however, that there are limitations associated with the use of self-citations, especially in the calculation of IFs as observed by Seglen (1997). Table 1: List of Kenyan & S. African universities that were the focus of this study
4. Results This section provides and discusses results under five sub-headings, namely:
4.1. Number of pages and out-links Table 1 provides the total number of pages and out-links (links from the universities' web pages) as crawled by SocSciBot. From South Africa, the University of Cape Town produced 119,454 pages and 1,118,132 out-links, followed by the University of Pretoria (111,496; 1,551,541) and Rhodes University (61,418; 2,403,463). In Kenya, the United States International University (USIU) had the highest number of pages (797), as well as the largest number of out-links (11,933). Moi University came second with 352 pages and 2039 out-links, followed by Egerton University with 254 pages and 1002 out-links. Table 1: Kenyan and South African universities: ranked by number of pages
4.2. Web Influence of Kenyan and South African universities Table 2 ranks the universities according to the number of web pages, links to the websites, and WIFs in both AltaVista and Google search engines. The top ranked universities are, in descending order, the University of Witwatersrand, the University of Cape Town, the University of South Africa and the University of Pretoria. The bottom half of Table 2 and Appendix A consists largely of Kenyan universities. None of the universities appeared to rank constantly throughout the variables, i.e. number of web pages, links to the websites and WIFs. For instance, the University of Witwatersrand ranked fifth in the number of web pages in both search engines and in Google's links to the website, and twentieth in Google's WIF, whilst ranking second and twelfth in AltaVista's links to the website and WIF respectively. Table 2 also shows that most of the universities recorded high impact factors: - a situation that may be attributed to the inclusion of self-links. High impact factors were particularly recorded by universities with fewer web pages and a high pattern of in-links. Table 2: Rank distribution of Kenyan and South African Universities by number of web pages, links to websites and WIF in AltaVista and Google search engines
4.3. Universities' networks Figures 1 and 2 are visual maps demonstrating links (only site links were mapped) between the universities within South Africa and Kenya respectively. Figure 3 combines the two maps in order to check for inter-connectivity between South African and Kenyan Universities. Whereas Universities in South Africa do have a fairly well developed network, Kenyan Universities' web inter-connections are few. Unlike South African universities, it was noted that the main linking universities (i.e. Moi University and the United States International University) provided links to universities in their categories (i.e. private or public). Figure 1: University inter-linkages: South Africa
Figure 3 indicates that Kenyan universities, particularly the Jomo Kenyatta University of Agriculture and Technology, Africa Nazarene University, Moi University, University of Nairobi, Kenyatta University, United States international University, and Kiriri Women's University of Science and Technology, do have links with South Africa's Rhodes University, University of Pretoria, University of Witwatersrand, and the University of Cape Town. Furthermore, it was observed that all the links between Kenyan and South African universities emanated from the latter. The nature of these connections could not, however, be established. Figure 2: University inter-linkages: Kenya
Figure 3: University Inter-linkages: South Africa and Kenya
Table 3: Number of in- and out-links amongst Kenyan Universities: ranked by number of page in-links
4.4. In- and out-links amongst universities in Kenya and S. Africa Tables 3 and 4 provide the number of in- and out-links among and between universities in Kenya and South Africa respectively. In Kenya, most out-links were produced by the USIU and MU, which recorded five out-links each for pages, directories, domains and sites, while KU received the highest number of in-links (2 in-links each for pages, directories, domains and sites). In South Africa, the UCT received the highest number of page in-links (357), directory in-links (245), and domain in-links (101), whilst receiving 8 site in-links from other South African universities. Overall, South African universities have performed better in providing links to each other than their Kenyan counterparts, as shown in Figures 1, 2 and 3 and Tables 3 and 4. Table 4: In- and out-links amongst South African Universities: ranked by number of page in-links
4.5. Targeted external websites and/or links Table 5 provides a list of the top 15 most targeted TLD links by the Kenyan and South African universities. In South Africa, the highest ranking TLD domains include: informationweek.com (magazine published in the U.S. that offers news, features and events for technology professionals); java.sun.com (website that offers information on Java programming); boingboing.net (a blog whose main themes include technology, futurism, science fiction, gadgets, intellectual property, and political issues); and forbes.com (American business and financial magazine founded in 1917 by B.C. Forbes). The sites that recorded the highest number of links from Kenyan universities, in descending order, are: usiualumni.com (website for the United States International University's alumni); adobe.com (a company popular for the development of acrobat reader used for desktop publishing); search.freefind.com (website search engine); daystarus.org (Daystar University's parent organization); and fordfound.org (The Ford Foundation - an independent nonprofit grant-making organization). Notable appearances, in Kenya's case, include eastandard.net and nationaudio.com (local newspapers); nbnet.com (an Internet Service Provider [ISP]); google.com (global search engine); search.epnet.com (EBSCO database - offers a variety of information resources/databases) and emeraldinsight.com (Emerald company - publisher of the world's widest range of management and library information science journals, as well as a strong specialist range of engineering, applied science and technology journals); to name a few. It appears that preference has been given to sites that offer daily news. Table 5: Most commonly targeted TLD domains by Kenyan and South African Universities
5. Discussions, Conclusions and recommendations A significant observation that has already been made is that Web-based tools are rarely used within South African universities (Blewett & Singh, 2002), perhaps because in general, African universities have embraced the Internet and its features fairly recently. Consequently, their websites are still relatively new. This late launch into cyberspace could be attributed to the African governments' lack of support or active involvement in web development (Chisenga, 2004). Nevertheless, it was noted in this study that all the universities in the two countries have their own websites, although some of them were under construction at the time we crawled the Web (e.g. Maseno University - which had one page then - has since updated its website). The total number of pages produced by SocSciBot and the two search engines differed significantly, as shown in Table 1 and Appendix A. Even within Appendix A, AltaVista yielded different results to those of Google. In the first instance, the approach that was used to download data presented in Table 1 was different from that used to extract data presented in Appendix A. Appendix A contains all web pages as indexed in AltaVista and Google, while Table 1 consists of only the web pages whose links did not contain question marks. The differences between AltaVista's and Google's coverage may be attributed to differences in indexing. Previous studies have shown that search engine overlaps of web coverage are small (Lawrence & Giles, 1999 and Bar-Ilan, in Thelwall, 2004), therefore meaning that whatever is indexed in one search engine may not necessarily be covered in another. In addition, Thelwall (n.d.) observes that Google only reports a fraction of the links that Google is aware of (approx. 10%), which may explain why Google produced less links to African universities than AltaVista. There were a total of 382453 university pages for the two countries, while the number of out-links, according to SocSciBot data, totaled 5820833, thus creating an average number of 15.2 links per page. Kenyan universities recorded an average of 11.8 links per page from a total of 2457 pages and 28939 out-links, while South African universities had a total of 379996 pages and 5791894 out-links, producing an average score of 15.2 links per page. Comparatively, therefore, South African universities were the most prolific, although the mean number of links per page among them does not differ significantly as might have been expected. An analysis of each university's mean number of pages, using AltaVista and Google, showed that Kenyan Universities produced only 382 pages per university, while South Africa recorded 55,090 pages per university. As a result, South African universities seem to have made remarkable progress in developing their websites, which are at an advanced stage and can be compared to those of their counterparts in developed countries, e.g. Spain, Australia, the UK, Taiwan and New Zealand, which recorded 33187, 71749, 49000, 46754 and 39393 pages respectively per university between 2001 and 2002 (Thelwall, 2004). Impact-wise, Kenya (0.79) had a higher impact factor than South Africa (0.61), a situation that may have been caused by the few pages and several in-links (including self-links) that Kenyan universities yielded. This perhaps explains why using the impact factor as a means of ranking, evaluating or assessing individuals, institutions and countries is problematic. WIF usage with other measures such as the number of pages, number of links, website size, etc.; may yield better rank results as illustrated in this study. Overall, universities from the two countries produced an impact factor of 0.61 (AltaVista) and 0.01 (Google). It was observed that the most commonly targeted external links were largely news sites (especially magazines and newspapers on computers/computing and technology), downloadable freeware, and electronic databases. The former two's high ranking could be attributed to the persons responsible for creating links on the universities' websites. Usually, these people are computer scientists or information technologists. Links to electronic databases such as EBSCO and EMERALD may have originated from the university libraries' websites. A 'content divide' was noted, not only between South African and Kenyan universities' websites, but also between South African universities' websites. The minimal web page content in Kenyan universities is manifested in the universities' few web pages. In South Africa, it was noted that historically advantaged universities' (HAUs) web performance in terms of the number of web pages, content, out-links and in-links, was better than historically disadvantaged universities (HDUs), perhaps due to privileges the former category enjoyed during the apartheid era (Jacobs, 2000) and their subsequent growth following the 'Matthew's' principle. Nevertheless, links between South African universities were manifest. Unlike Kenyan universities, which had poor networks, South African universities exhibited fairly strong linkages/patterns. Seemingly, the broad social gap between the HAUs and the HDUs that existed in the apartheid era is slowly being narrowed. Meanwhile, Kenya's situation may be attributed to the engagement of external/internal poorly skilled webmasters, or unwillingness for collaboration, partnership or sharing among the universities. (For example, one computer science lecturer, whom we casually asked why Kenyan universities have such poor connectivity, noted that these universities are competitors and thus cannot warrant advertising each others' services, products, curricula, etc., on their websites). In conclusion, given that webometric studies are usually based on web-based indicators, which in turn largely depend on how well the website of an institution is developed and visible, it is our view that some universities in Africa may not qualify for comparative webometric studies, especially for ranking purposes, due to underdeveloped or non-existent [yet-to-be constructed] websites. Similarly, universities operate under different economic, political and social conditions, and unless these aspects are taken into consideration, comparative webometric studies meant to rank universities on the basis of quality may be subject to unprecedented criticism. Worth noting as well is that the size of an institution may also influence the quantity of web pages. Nevertheless, African institutions of higher learning have developed web sites which, in our view, should be periodically evaluated. Although weblinks have been used for ranking universities, studies have emphasized that given the constraints of web-link methods, caution should be taken when applying these indicators to rankings. Emphasis on the benefits of web-link studies, such as enabling visibility, should be viewed as strong reasons demonstrating why such webometric studies are vital. However, it should be appreciated that there is no method that is absolutely perfect and reasonably objective. For African universities to compete favorably with the rest of the world and perform better than they are currently performing on the Web there is an urgent need to invest in information technology and to popularize the Web within institutions whilst engaging the services of qualified webmasters in the design and construction of their websites. Other specific measures to be taken include: formulating minimum web development standards; revisiting link development and codification policies so as to increase links for visibility; and placing/locating institutions' products on the web through tools such as Open Access and institutional repositories. Similarly, ICT policies with adequate web development guidelines are critical. We believe that these factors would, among others, assist in improving web development, bridging both the digital and content divide between African universities' websites and those of universities elsewhere, and provide the universities with better visibility. There are a number of unresolved issues that could be on an agenda for future research. Among them are:
6. Acknowledgements A preliminary version of this paper was presented at the 8th Annual Conference on World Wide Web Applications, Bloemfontein (South Africa), 6 to 8 September 2006. We wish to acknowledge and appreciate the contributions made by the University of Zululand [Research Committee] and the University of Eastern Africa, Baraton, for funding part of the travel expenses to attend the conference, and anonymous reviewers of the paper for their input. 7. References ARWU: Shanghai Jiao Tong University. (2006). The academic ranking of World Universities. [Online]. Available http://ed.sjtu.edu.cn/ranking.htm (Accessed 10th April 2006) Björneborn, L. 2004. Small-World Link Structures across an Academic Web Space: A Library and Information Science Approach. A dissertation. [Online]. Available online: http://www.db.dk/lb/phd/phd-thesis-section7.5-LISimplications.pdf (Accessed 10 April 2006). Björneborn, L. & Ingwersen, P. 2004. Towards a basic framework of webometrics. Journal of the American Society for Information Science and Technology, 55(14): 1216-1227 Blewett, C.N. & Singh, U.G. 2002. An Investigation into the Usage of Web-Based Education Tools by University IT Departments in South Africa. In: P.A. Van Brakel, (Ed). 2003. 5th Annual Conference on WWW Applications, Durban, 10-12 September 2003. Online:http://www.dissanet.com (Accessed 10 April 2006). Candan, K. S. & Li, W.S. 2002. Reasoning for Web document associations and its applications in site map construction. Data & Knowledge Engineering, 43:121-150 Chisenga, J. 2004. Africa Governments in Cyberspace: are they Bridging the Content Divide? In: P. Birungi and M.G. Musoke. 2004. SCECSAL XVI. Towards a Knowledge Society for African. [Online]. www.dissanet.com (Accessed 10 April 2006) Cronin, B., & McKim, G. 1996. Science and scholarship on the World Wide Web: a North American perspective. Journal of Documentation, 52(2), 163-171 Cybermetrics Lab (2007). Webometrics ranking of world universities. [Online]. Available http://www.webometrics.info/ (Accessed 10th April 2006) Hertzel, D. 1987. Bibliometrics, history of the development of ideas in: statistical bibliography or bibliometrics? Encyclopaedia of Library and Information Science, 42, 144-219. Ikpaahindi, L. 1985. An overview of bibliometrics: its measurements, laws and their applications. LIBRI, 35 (2), 163-177. Ingwersen, P. 1998. The Calculation of Web impact factors. Journal of Documentation, 54(2):236-243 Jacobs, D. (2000). Institutional status, funding and grading in relation to faculty research productivity. In I. Wormell (ed.). ProLISSA 2000. Proceeding of the 1st biennial ProLISSA Conference, Pretoria, 26 - 27 October 2000. Pretoria: CID University of Pretoria, pp. 143 - 156. Jana, S. & Chatterjee, S. 2004. Quantifying Web-site visits using Web statistics: an extended cybermetrics study. Online Information Review, 28(3): 191-199 Noruzi, A. 2005. Web Impact Factors for Iranian Universities. Webology, 2(1). [Online]. Available www: http://ww.webology.ir/2005/v2n1/a11.html (Accessed 10 April 2006). Seglen, P.O. Why the impact factor of journals should not be used for evaluating research. [Online]. BMJ, 314:497. http://www.bmj.com/cgi/content/full/314/7079/497 (Accessed 21 June 2004). Thelwall, M. 2002a. A comparison of sources of links for academic web impact factor calculations. Journal of Documentation, 58(1):66-78 Thelwall, M. 2002b. An initial exploration of the link relationship between UK University Web sites. ASLIB Proceedings, 54(2):118-126 Thelwall, M. 2002c. Evidence for the existence of geographic trends in university Web site interlinking. Journal of Documentation, 58(5):563-574 Thelwall, M. 2003. Can Google's PageRank be used to find the most important academic web pages? Journal of Documentation, 59(2):205-217 Thelwall, M. 2004. Link analysis: an information science approach. Amsterdam: Elsevier. Thelwall, M. n.d. Using Commercial Search Engines and the Internet Archive. [Online]. Available www: http://linkanalysis.wlv.ac.uk/17.htm (Accessed 10th April 2006) THES: The Times Higher Education Supplement. 2005. World University rankings: who's up? Who is down? [Online]. Available http://www.thes.co.uk/worldrankings/ (Accessed 10th April 2006) Appendix A: University web pages, links to websites and WIFs
Notes 1. Omwoyo Bosire Onyancha, PhD, is a lecturer at the University of South Africa( on study leave) and also Deputy Librarian at the University of Eastern Africa, Baraton, P.O. Box 2500, ELDORET - KENYA. He completed his PhD at the University of Zululand in 2007. Dennis N. Ocholla (PhD) is professor and Head of Department of Library and Information Science, University of Zululand, South Africa. Received Feb 2nd 2007 |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Copyright
information | | | Sitemap |