GB2417104A - Searching for and indexing mobile content on a network - Google Patents

Searching for and indexing mobile content on a network Download PDF

Info

Publication number
GB2417104A
GB2417104A GB0417999A GB0417999A GB2417104A GB 2417104 A GB2417104 A GB 2417104A GB 0417999 A GB0417999 A GB 0417999A GB 0417999 A GB0417999 A GB 0417999A GB 2417104 A GB2417104 A GB 2417104A
Authority
GB
United Kingdom
Prior art keywords
internet
content
mobile
web server
indexer
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
GB0417999A
Other versions
GB0417999D0 (en
Inventor
Stephen John Doyle
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to GB0417999A priority Critical patent/GB2417104A/en
Publication of GB0417999D0 publication Critical patent/GB0417999D0/en
Publication of GB2417104A publication Critical patent/GB2417104A/en
Withdrawn legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

A web server configured to provide an indication to an internet indexer that mobile content, such as WAP sites, is available at a certain location. An internet indexer configured to interrogate a web server as to the availability of mobile content is also provided, as is an internet indexer configured to examine internet content from a web server for the presence of a signature of a content-transcoding scheme.

Description

INDICATING MOBILE CONTENT
The invention relates to search engines, internet directories and the like and their interaction with web servers.
The internet can be accessed using many different types of device. For example, material from the internet, hereinafter termed "internet content", can be downloaded into desktop and laptop computers, personal digital assistants (PDAs) and mobile telephones. However, different types of device have different capabilities in terms of being able to present internet content to a person. For example, a web page with a complicated visual layout and animated parts is more suited to presentation to a person through a PC than through a mobile telephone. Indeed, such a web page is unlikely to display at all on a mobile telephone. Thus, it is normal to provide internet content that is tailored to the capabilities of devices such as mobile telephones, PDAs and the like that, in comparison with desktop computers, have small memories and small screen sizes (usually due to compromises between functionality and portability). Commonly, such internet content is referred to as "mobile content" and the sector of the internet that is concerned with its delivery is referred to as the "mobile internet". In this document, internet content that has not been adapted for the needs of such resource- constrained devices will be referred to as "plain" internet content.
It is widely appreciated that a person might have difficulty in locating internet content of interest and entities known as search engines have been devised to address this problem.
Broadly speaking, a search engine is a collection of software processes that a person can query through the internet in order to locate internet content of interest. in essence, a search engine is based around a database of locations of internet content and search software for interrogating the database with queries received from the internet. The locations within the database are indexed with indications of the internet content that is to be found at those locations and the search software is capable of resolving received queries into information that can be compared against the locations or their indexing information in order to formulate replies that contain locations from the database that most closely match the queries. Other facilities, similar to search engines, for locating internet content of interest, also exist. For example, there are directories of internet content in which one can select a branch of internet content of interest and peruse internet locations that provide internet content of the selected type. In this document, search engines, directories and like services will be referred to as "internet indexes".
Clearly, since the internet is dynamic, internet indexes need to be kept up to date. For this purpose, an internet index will normally include a software element such as a web crawler, a spider or a web robot. Such elements will hereinafter be referred to as "indexers". The purpose of an indexer is to visit internet locations and report back to an internet index with indications of the internet content to be found at those locations so that the internet index can be refreshed.
So prevalent at present is the use of desktop and laptop computers to access the internet that internet indexes and, consequently, indexers provide very little support for users of the mobile internet. Thus, it can be difficult for a person to locate mobile content of interest.
According to one aspect, the invention provides a web server that is configured to provide an indication to an internet indexer that mobile content is available at a certain location.
Thus, the invention provides an interaction between an internet indexer and a web server that facilitates the support of mobile content by internet indexes.
In some embodiments, the indication is provided in internet content, such as a web page, that has been requested by an indexer. The indication can be provided in a header within the internet content or in a mete tag within the internet content.
The indication may direct an internet indexer to another internet location for mobile content or the indication may indicate that mobile content is available from the same location via, e.g., a content transcoding scheme.
The indication may be in the form of a signature of a content transcoding scheme.
The indication may be available in a file that is customarily provided to give instructions to visiting internet indexers.
The invention also relates to an interact indexer configured to interrogate a web server as to the availability of mobile content.
The invention also relates to an internet indexer that is configured to examine internet content from a web server for the presence of a signature of a content transcoding scheme.
By way of example only, certain embodiments of the invention will now be described with reference to Figure 1 which shows several devices connected to the internet.
More specifically, Figure 1 shows a desktop PC 2, a PDA 4, a mobile telephone 6, a search engine 8 and a web server 10, all connected to the internet 12. The PDA 4 and the mobile telephone 6 use the mobile internet whereas PC 2 does not.
Search engine 8 has a database 14 of uniform resource locators (URLs). Each URL in the database 14 is annotated with an indication of whether the internet content at that URL is, or can be provided in the form of, mobile content and with an indication of the subject matter of the internet content to be found at that URL. The PC 2 can contact the search engine 8 to seek out plain internet content whilst the mobile telephone 6 and the PDA 4 can use the search engine to locate mobile content.
In order to keep the database 14 current, the search engine 8 also includes an internet indexer 18, which is typically a web crawler, a spider or a web robot. The internet indexer 18 visits URLs and assesses the internet content that it finds there with the aim of including the visited URLs in the database 14 with appropriate annotations.
The interaction between the internet indexer 18 and a web server will now be described in more detail using, as an example, the case where the internet indexer probes a URL "www.mobile-life.com" held at web server 10. To initiate this process, internet indexer 18 sends web server 10 a standard request for the internet content that is held at the URL "www.mobile-life.com": GET http://www.mobile-life.corr/HTTP/1.1 Host:www. mobile-life.com User-Agent: SlipStream Search Engine Indexer Accept:*/* where "SlipStream Search Engine Indexer" is the name assigned to internet indexer 18.
The nature of web server l O determines the response that is made to this request. First, let us consider the case where web server 10 is capable of content transcoding, i.e. where web server lO is capable of providing internet content in both "mobile" and "plain" forms. In such circumstances, the web server lO will usually respond by default with plain internet content that is located at the URL "www.mobile-life.com", as follows: HTTP/1.1 200 OK Set-Cookie: JSESSIONID=E5B3F22A404C9D5DB81D70E73C351E16C; Path=/ X-Mobile-Compatible: generator=SlipStream&vendor=www.mobile-llfe.com Content-Type: text/html Date: Mon. 29 Mar 2004 16:59:21 GMT Server: Apache-Coyote/1.1 Connection: close Content-Length: 1435 <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"> <html> <head> <title>Mobile Life - Leading The Way With Mobile Web Solutions</title> <meta http-equiv="Content-Type" content="text/htmli charset=iso-8859 1"/> <link href="styles/regweb.cas" rel="stylesheet" media="screen" type="text/css"/> </head> <body> <p>Welcome to SlipStream, the easiest way to create web sites for mobile devices and PDA's.</p> <p>This section could contain large images, frames and large content blocks that would not display on a mobile device</p> </body> </html> The internet indexer 18 evaluates this response and informs the search engine 8 that plain content is available at the URL "www.mobile-life.com" and also provides the search engine with an indication of the subject matter of that content. The search engine 8 can use this information to add or, as the case may be, update a record in database 14 concerning s the URL "www.mobile-life.com". As mentioned, the response from the web server 10 contains plain internet content that is unlikely to be presented successfully by a device requiring mobile content. However, the response from the web server lO contains the header X-Mobile-Compatible which indicates to internet indexer 18 that mobile content is also available at the same URL, "www.mobile-life.com", on account of the content transcoding capabilities of the web server.
Upon detection of the header X-Mobile-Compatible, the interact indexer 18 sends a request to web server 10 for mobile content from the URL "www. mobile-life.com": GET htEp://www.mobile-life.com/HTTP/1.1 Host:www.mobilelife.com User-Agent: Slipstream Search Engine Indexer X-Confirm-MobileContent:* Accept:*/* This request contains the header X-Confirm-MobileContent which specifies a list of MIME types to check for. In this instance, an asterisk is specified as a "wild card", indicating that all MIME types should be checked for. The web server 10 now responds with mobile content: HTTP/1.1 200 OK X-Mobile-Compatible: generator=SlipStream&vendor=www.mobile-life.com Content-Type: application/vnd.wap.xhtml+xml X-Confirm-Mobile-Content Reply:text/vnd.wap. wml;application/xhtml+xml;application/vOd.wap.xhtml+ xml;text/html Date: Mon. 29 March 2004 18:19:02 GMT Server: Apache=Coyote/1.1 Connection: close Content=Length: 1435 <?xml version-"1.0"?><!DOCTYPE html PUBLIC "-//OPENWAVE//DTD XHTML Mobile 1.0//EN" "http://www.openwave.com/DTD/xhtml-mobilel0.dtd"> <html xmlns="http://www.w3.org/1999/xhtml" xmlns:wml="htEp://www.wapforum. org/2001/wml" xml:lang="en"> <head> <title>Welcome To Mobile Life</title> </head> <body> <p>Welcome to Slipstream, the easiest way to create web sites for mobile devices and PDA's.</p> <p>This section is for mobile devices and has limitations regarding image size, no frames support and small content blocks. </p> </body> </html> This time the response contains the header X-Confirm-Mobile-Content-Reply listing the formats in which the web server 10 can deliver mobile content. The internet indexer 18 informs the search engine 8 that mobile content is available from the URL "www.mobile-life.com" in the stated formats and also provides the search engine with an indication of the subject matter of that content. The search engine 8 uses this information to update its record in database 14 for the URL "www.mobile-life.com".
It should be noted that the wild card in the header X-Confirm-MobileContent sent out by internet indexer 18 could be replaced with a set of one or more desired types of mobile content, e.g.: X-Confirm-MobileContent:text/vnd.wap.wml;application/xhtml+xml in the case where the web server 10 can handle all of the requested content types, then the header X-Confirm-Mobile-Content-Reply returned by web server 10 becomes: XConfirm-Mobile-Content-Reply:text/vnd.wap.wml;application/xhtml+xml It is thus clear that a mechanism is provided by which an internet indexer can unambigiously determine whether a web server supports specific types of mobile content.
Let us now examine the case where web server 10 is not capable of content transcoding.
In these circumstances, web server 10 responds to the initial request from the indexer 18 with plain internet content as follows: HTTP/1.1 200 OK Set-Cookie: JSESSIONID=E5B3F22A404C9D5DB8lD70F73C35E16C; Path=/ Content-Type: text/html Date: Mon. 29 Mar 2004 16:59:21 GMT Server: Apache-Coyote/1.1 Connection: close Content-Length: 1435 <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"> <html> <head> <title>Mobile Life - I.eading The Way With Mobile Web Solutions</title> <meta name="Mobile-Compatible" content="location=xhtmlmpindex.html, wmlindex.wml generator=SlipStream&URL=www.mobile-life.com"/> <meta htEp-equiv="Content-Type" content="text/htmli charset=iso-8859- 1"/> <link href="styles/regweb.css" rel="stylesheet" media="screen" type="text/css"/> </head> <body> <p>Welcome to Slipstream, the easiest way to create web sites for mobile devices and PDA's.</p> <p>This section could contain large images, frames and large content blocks that would not display on a mobile device</p> </body> </html> The internet indexer 18 evaluates this response and informs the search engine 8 that plain content is available from the URL "www.mobile-life.com" and also provides the search engine with an indication of the subject matter of that content. The search engine 8 uses this information to add, or as the case may be, update a record in database 14 concerning the URL "www. mobile-life.com". The response from web server 10 to the query from the internet indexer 18 contains the mete tag "Mobile-Compatible" which indicates to the indexer 18 that internet content is available from two alternative URLs, "www.mobile-life.com/htmlmpindex.html" and "www.mobile-life.com/wmlindex. wml", one for XHTML Mobile Profile (XHTMLMP) content and one for WAP Markup Language (WML) content, respectively.
The internet indexer 18 recognises the mete tag "Mobile-Compatible" and proceeds to retrieve the mobile content from the specified URLs. Consider the case where the internet indexer 18 requests mobile content from the URL "www.mobile-life.com/htmlmpindex.html" by issuing a request: GET http://www.mobile-life.com/xhtmlmpindex.html HTTP/1.1 Host:www.mobilelife.com User-Agent: Slipstream Search Engine Indexer Accept:*/* The web server 10 responds by returning mobile content as follows: HTTP/1.1 200 OK X-Mobile-Compatible: generator=SlipStream&vendor=www.mobile-life.com Content-Type: appllcation/vnd.wap.xhtml+xml Date: Mon. 29 Mar 2004 18:19:02 GMT Server: Apache-Coyote/1.1 Connection: close Content-Length: 1435 <?xml version="1.0"?><,DOCTYPE html PUBLIC "-//OPENWAVE//DTD XHTML Mobile 1.0//EN" "http://www.openwave.com/DTD/xhtml-mobilelO.dtd"> <html xmlns="http://www.w3.org/1999/xhtml" xmlns:wml="http://www.wapforum. org/2001/xml" xml:lang="en"> <head> <title>Welcome to Mobile Life</title> </head> <body> <p>Welcome to Slipstream, the easiest way to create web sites for mobile devices and PDA's.</p> <p>This section is for mobile devices and has limitations regarding image size, no frames support and small content blocks. </p> </body> </html> The internet indexer 18 evaluates the retrieved mobile content and informs the search engine 8 that mobile content is available from the URL "www.mobile-life.com/htmlmpindex.html" and also provides the search engine 8 with an indication of the subject matter of that content. The search engine 8 uses this information to add or, as the case may be, update a record concerning the URL "www.mobile-life.com/htmlmpindex.html" in database 14. The internet indexer can, if desired, then proceed to request mobile content from any other URLs specified in the "Mobile-Compatible" mete tag, and provide information about those URLs to the search engine 8 for database 14.
Another embodiment makes use of the convention that web servers make available control files containing instructions for visiting internet indexers to obey. Such a control file can be modified to include an indication that mobile content is available from its web server or some other source. A visiting internet indexer can pick up the mobile content information from the control file and feed this back to a search engine.
In yet another embodiment, the internet indexer is configured to recognise a signature of a content transcoding system upon visiting a URL. The signature can include, for example, one or more of the product name of the transcoding scheme, the URL of the developer or publisher of the transcoding scheme and information identifying the version of the transcoding scheme. Such an internet indexer can be arranged to feed back to a search engine information about any mobile content that can be derived from signature analysis.

Claims (14)

  1. A web server configured to provide an indication to an internet indexer that mobile content is available at a certain location.
  2. 2. A web server according to claim 1, wherein said indication identifies the specific type or types of mobile content that are available.
  3. 3. A web server according to claim 1 or 2, wherein the indication is in internet content served by the server to the indexer.
  4. 4. A web server according to claim 3, wherein the indication is in a header of said internet content.
  5. 5. A web server according to claim 3, wherein the indication is in mete tag information in said internet content.
  6. 6. A web server according to any one of claims 3 to 5, wherein the location specified in the indication is different to the location of said internet content.
  7. 7. A web server according to claim 3, wherein the indication is or includes a signature of a content transcoding scheme that appears in said internet content.
  8. 8. A web server according to claim 1, wherein the web server is configured to make a control file available to a visiting internet indexer and said indication is included within said control file.
  9. 9. An internet indexer configured to interrogate a web server as to the availability of mobile content.
  10. 10. An internet indexer according to claim 9, configured to interrogate a web server as to the availability of one or more specific types of mobile content.An internet indexer configured to examine internet content from a web server for the presence of an indication that mobile content is available from the server.
  11. 11. An internet indexer according to claim 11, wherein said indication identifies the specific type or types of mobile content that are available.
  12. 12. An internet indexer according to claim 11, wherein said indication is or includes a signature of a content-transcoding scheme.
  13. 13. An internet indexer configured to examine a location specified by a web server as providing mobile content to verify that mobile content is available at said location.
  14. 14. A reference structure, such as a search engine, that can be interrogated to locate internet content of interest, the structure being configured to present to an interrogating device only search results that pertain to internet content that can be handled by said device.
GB0417999A 2004-08-12 2004-08-12 Searching for and indexing mobile content on a network Withdrawn GB2417104A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
GB0417999A GB2417104A (en) 2004-08-12 2004-08-12 Searching for and indexing mobile content on a network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
GB0417999A GB2417104A (en) 2004-08-12 2004-08-12 Searching for and indexing mobile content on a network

Publications (2)

Publication Number Publication Date
GB0417999D0 GB0417999D0 (en) 2004-09-15
GB2417104A true GB2417104A (en) 2006-02-15

Family

ID=33017403

Family Applications (1)

Application Number Title Priority Date Filing Date
GB0417999A Withdrawn GB2417104A (en) 2004-08-12 2004-08-12 Searching for and indexing mobile content on a network

Country Status (1)

Country Link
GB (1) GB2417104A (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1124360A2 (en) * 2000-02-09 2001-08-16 Nec Corporation Data conversion system and data conversion method thereof
WO2002037867A2 (en) * 2000-10-30 2002-05-10 Nokia Corporation Content converter portal
GB2376767A (en) * 2001-06-22 2002-12-24 Hewlett Packard Co Portal to allow access to web pages using different formats
GB2392744A (en) * 2002-09-05 2004-03-10 Hewlett Packard Co Alternative web content provision

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1124360A2 (en) * 2000-02-09 2001-08-16 Nec Corporation Data conversion system and data conversion method thereof
WO2002037867A2 (en) * 2000-10-30 2002-05-10 Nokia Corporation Content converter portal
GB2376767A (en) * 2001-06-22 2002-12-24 Hewlett Packard Co Portal to allow access to web pages using different formats
GB2392744A (en) * 2002-09-05 2004-03-10 Hewlett Packard Co Alternative web content provision

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
"Mobile Search Engines White Paper", Sonera MediaLab, 15th November 2002, http://www.medialab.sonera.fi/workspace/MobileSearchEnginesWhitePape.pdf *
"Search Engine Watch", 20th February, 2002, http://searchenginewatch.com/links/article.php/2156391 *
Greg Notess, "Web Wanderings", August 2002, http://notess.com/write/archive/200008ww.html *
Jones M., et al. "Sorting Out Searching on Small Screen Devices", September 2002, Proceedings of the 4th International Symposium on Mobile HCI, pages 81-94 *
Kennedy, "Wireless Search Engines: Bigger Isn't Better", 23rd January, 2001 http://www.wirelessdevnet.com/channels/wireless/features/wireless_search.phtml *

Also Published As

Publication number Publication date
GB0417999D0 (en) 2004-09-15

Similar Documents

Publication Publication Date Title
US8429201B2 (en) Updating a database from a browser
KR101389969B1 (en) Message Catalogs for Remote Modules
AU2010201642B2 (en) Remote module incorporation into a container document
US7299298B2 (en) Web address converter for dynamic web pages
KR101059452B1 (en) Control communication within container documents
US20020143523A1 (en) System and method for providing a file in multiple languages
US20040044747A1 (en) Link navigator method and system for locating web content
US20070204010A1 (en) Remote Module Syndication System and Method
WO2005052811A1 (en) Searching in a computer network
JP2012069163A (en) Mobile site map
WO2006081143A2 (en) Technique for modifying presentation of information displayed to end users of a computer system
US20120191686A1 (en) System and Method for Assisting a User with Searching Multimedia Objects
JP2010508579A (en) Personalized search using macros
CN101523393A (en) Locally storing web-based database data
US20030204573A1 (en) Method of providing a web user with additional context-specific information
US20060101417A1 (en) Automatic generation of device and user profiles
US8990680B2 (en) Methods and systems for managing fault-tolerant webpage presentation
EP1980973A1 (en) Personal information leakage preventive device and method
US7797447B1 (en) Data detector for creating links from web content for mobile devices
GB2417104A (en) Searching for and indexing mobile content on a network
Goel et al. Making links on your web pages last longer than you
US9727650B2 (en) Method for delivering query responses
KR20020069795A (en) The method for recognize markup language to utilize universal document type definition and its readable form
Garofalakis et al. MokE: a tool for Mobile-ok evaluation of Web Content
KR20010047539A (en) System and method for generation static markup language files supporting diverse platforms

Legal Events

Date Code Title Description
WAP Application withdrawn, taken to be withdrawn or refused ** after publication under section 16(1)