[BUCH][B] Web data mining: exploring hyperlinks, contents, and usage data

B Liu - 2011 - Springer
Liu has written a comprehensive text on Web mining, which consists of two parts. The first
part covers the data mining and machine learning foundations, where all the essential …

Focused crawling: a new approach to topic-specific Web resource discovery

S Chakrabarti, M Van den Berg, B Dom - Computer networks, 1999 - Elsevier
The rapid growth of the World-Wide Web poses unprecedented scaling challenges for
general-purpose crawlers and search engines. In this paper we describe a new hypertext …

[BUCH][B] Mining the Web: Discovering knowledge from hypertext data

S Chakrabarti - 2002 - books.google.com
Mining the Web: Discovering Knowledge from Hypertext Data is the first book devoted
entirely to techniques for producing knowledge from the vast body of unstructured Web data …

Web crawling

C Olston, M Najork - Foundations and Trends® in Information …, 2010 - nowpublishers.com
This is a survey of the science and practice of web crawling. While at first glance web
crawling may appear to be merely an application of breadth-first-search, the truth is that …

Handling long-tail content in a content delivery network (CDN)

D Fullagar, C Newton, L Lipstone - US Patent 9,762,692, 2017 - Google Patents
A content delivery network has at least a first tier of servers. A content delivery method
includes, at a first server in the first tier of servers, obtaining a request from a client for a …

Internet content delivery network

DA Farber, RE Greer, AD Swart, JA Balter - US Patent 6,654,807, 2003 - Google Patents
Resource requests made by clients of origin servers in a network are intercepted by reflector
mechanisms and selectively reflected to other servers called repeaters. The reflectors select …

Delivering resources to clients in a distributed computing environment with rendezvous based on load balancing and network conditions

DA Farber, RE Greer, AD Swart, JA Balter - US Patent 8,296,396, 2012 - Google Patents
(21) Appl. No.: 11/980,686 (22) Filed: Oct. 31, 2007 (65) Prior Publication Data US
2008/0215755A1 Sep. 4, 2008 Related US Application Data (60) Continuation of application …

Policy-based content delivery network selection

M Brady, M Yevmenkin, PE Stolorz, JK Salmon… - US Patent …, 2010 - Google Patents
In a framework wherein resources of a content provider may be delivered to clients from
different domains, a method distributes the requests based on content-provider policies. In …

Internet content delivery network

DA Farber, RE Greer, AD Swart, JA Balter - US Patent 7,054,935, 2006 - Google Patents
Resource requests made by clients of origin servers in a network are intercepted by reflector
mechanisms and selectively reflected to other servers called repeaters. The reflectors select …

Handling long-tail content in a content delivery network (CDN)

D Fullagar, C Newton, LR Lipstone - US Patent 8,930,538, 2015 - Google Patents
A content delivery network has at least a first tier of servers. A content delivery method
includes, at a first server in the first tier of servers, obtaining a request from a client for a …