The development of cloud technology and the web-based technology and serching technology is becoming important. About the web crawler technology which collects URL for serching, one of issues of the distributed crawler system is the effective URL split. Therefore, this study designed algorithm to split the URL LIST effectively collected by the web crawler and the split algorithm environment of URL LIST collected by web crawler was composed by implementing depositary in Hadoop environment. © 2011 Springer-Verlag.
CITATION STYLE
Lim, I. K., Kim, Y. H., Kang, S. G., & Lee, J. K. (2011). A study on the split algorithm of URL LIST collected by web crawler. In Communications in Computer and Information Science (Vol. 184 CCIS, pp. 492–499). https://doi.org/10.1007/978-3-642-22333-4_64
Mendeley helps you to discover research relevant for your work.