JPH11306160A

JPH11306160A - Service unit extraction method and device from service application history and recording medium recording extraction program

Info

Publication number: JPH11306160A
Application number: JP11066198A
Authority: JP
Inventors: Takuya Washimi; 卓哉鷲見; Yasuhisa Sakamoto; 泰久坂本; Koji Kishi; 晃司岸; Takeshi Motohashi; 健本橋
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 1998-04-21
Filing date: 1998-04-21
Publication date: 1999-11-05

Abstract

PROBLEM TO BE SOLVED: To automatically sort the services of many and unspecified servers by deciding the subsequent information address as a base point of service units if the different computer names are given to the preceding and subsequent addresses showing the information which are included in an information service application history and continuously referred to. SOLUTION: The application history data 21 are produced (S10) to include an information address consisting of a computer name, a directory name and a file name and an item that can identify whether a link is formed by hyperlink based on the information which is used right before the production of the data 21. Then the information addresses which are continuously referred to in the data 21 and preceding and subsequent to a link that is formed by hyperlink are acquired (S20). It's decided whether the computer names included in the said preceding and subsequent information addresses are coincident with each other (S30). If no coincidence is confirmed between both addresses, the computer name and the directory name which are included in the subsequent information address are extracted as a service unit 61 of the information service (S40).

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、ネットワークサー
ビスの利用調査データの分類・統計を得る方法及び装置
に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a method and an apparatus for obtaining classification and statistics of network service usage survey data.

【０００２】[0002]

【従来の技術】近年、ＷＷＷサービス並びに電子メール
等に見られるインターネットを中心としたネットワーク
サービスが一般家庭に普及するのに伴って、ネットワー
ク通信機能の付加されたパソコンの普及も進んでいる。
例えば、ＷＷＷサービスでは、インターネット上にある
複数のＷＷＷサーバマシンがそのマシン内に文字・音声
・映像等のデータを持ち、それらのデータが情報端末の
ブラウザソフトウェアからＨＴＴＰと呼ばれるプロトコ
ルでアクセスして閲覧される。これらのネットワークサ
ービスは、広告・マーケティング媒体としての価値を持
つため、サービス毎の利用状況を客観的に知る必要性が
高まっている。2. Description of the Related Art In recent years, with the spread of WWW services and network services centered on the Internet, such as e-mail, to ordinary households, personal computers with a network communication function have been spreading.
For example, in the WWW service, a plurality of WWW server machines on the Internet have data such as characters, voices, and images in the machines, and the data is accessed and browsed from a browser software of an information terminal using a protocol called HTTP. Is done. Since these network services have value as advertising and marketing media, there is an increasing need to objectively know the usage of each service.

【０００３】しかし、ＷＷＷサービスのように分散した
多数のマシンによるサービス提供形態の場合には、テレ
ビや新聞等他のメディアとは異なり、サービス媒体の数
が多く、しかもサービス媒体の変化が激しいため、サー
ビス媒体の状況を把握することが困難である。従来、Ｗ
ＷＷサーバの利用状況を調べるため、一般的に、アクセ
スログのデータ履歴を分析する手段が用いられている。
データ履歴とは、ＷＷＷサービスで閲覧されたデータ情
報名がアクセスの時間順に並べられているものである。[0003] However, in the case of a service providing form using a large number of distributed machines, such as a WWW service, unlike other media such as televisions and newspapers, the number of service media is large, and the service media changes drastically. However, it is difficult to grasp the status of the service medium. Conventionally, W
In general, a means for analyzing the data history of the access log is used to check the usage status of the WW server.
The data history is a list in which data information names browsed by the WWW service are arranged in order of access time.

【０００４】データ情報の存在する場所及びデータ情報
名を示すために、一般にはhttp://等のプロトコル名に
続けて、ＷＷＷサーバのホスト名及びサーバ内のディレ
クトリ・ファイル名を記述したＵＲＬと称される形式が
使用されている。以下の説明では、このＵＲＬによって
示されたデータ情報を便宜上ページと呼ぶ。従って、デ
ータ履歴を分析することは、データ履歴内の夫々のペー
ジをサービス別に分類すること、即ち、各サービスがど
のサービス単位（サイト）に属するかを判定することで
ある。[0004] In order to indicate the location of data information and the name of the data information, generally, following a protocol name such as http: //, a URL describing the host name of a WWW server and the name of a directory or file in the server are provided. The so-called format is used. In the following description, the data information indicated by the URL is called a page for convenience. Therefore, analyzing the data history means classifying each page in the data history by service, that is, determining which service unit (site) each service belongs to.

【０００５】図９は、ＷＷＷサーバの概要を説明する図
である。同図に示すように、ＷＷＷサーバマシンは、ル
ートディレクトリを先頭とするツリー構造のディレクト
リを有する。各データはサーバ内のツリー型構造のディ
レクトリに情報分野毎に分類されて格納されている。例
えば、あるディレクトリＡから直下にはないディレクト
リＢのファイルｂにハイパーリンクが張られる。FIG. 9 is a diagram for explaining the outline of a WWW server. As shown in the figure, the WWW server machine has a directory having a tree structure starting from a root directory. Each data is classified and stored in a tree-type directory in the server for each information field. For example, a hyperlink is provided from a directory A to a file b in a directory B that is not immediately below.

【０００６】また、各サービスの単位をツリー構造によ
って定めることができる。図１０はサービス単位の説明
図である。同図の例によれば、一つのＷＷＷサーバに
は、ＡさんのホームページとＢさんのホームページとい
う二つのコンテンツが存在する。各コンテンツが夫々一
つのサービスに対応する。この場合、ＷＷＷサーバは情
報別整理の観点から、二つのコンテンツに対して別個の
ディレクトリを作成し、そのディレクトリに情報、即
ち、ページを置くように設定する。かくして、一つのＷ
ＷＷサーバが二つのサービス内容を持ち、各サービス毎
の切り分けはルートディレクトリの直下のディレクトリ
においてなされていることになる。[0006] The unit of each service can be determined by a tree structure. FIG. 10 is an explanatory diagram of a service unit. According to the example shown in the figure, one WWW server has two contents, Mr. A's homepage and Mr. B's homepage. Each content corresponds to one service. In this case, from the viewpoint of organizing information, the WWW server creates separate directories for the two contents, and sets the information, that is, the pages, in the directories. Thus, one W
The WW server has two service contents, and each service is separated in a directory immediately below the root directory.

【０００７】ＷＷＷサーバ内のディレクトリ構造に関す
る情報、及びＷＷＷサーバで提供されているサービスが
どのディレクトリ単位で切り分けられているかを示す情
報が事前に与えられている場合、データ履歴と、上記の
事前に与えられている情報を保持したデータベースとを
照らし合わせることにより、データ履歴のサービスの分
類方法が実現される。[0007] When information relating to the directory structure in the WWW server and information indicating in which directory unit the service provided by the WWW server is divided are given in advance, the data history and the advance By collating with a database holding given information, a method of classifying data history services is realized.

【０００８】しかし、上記の情報が事前に与えられてい
ない場合、分析者がデータ履歴の内容を吟味し、サービ
スの切り分け単位を割り出す必要がある。しかし、この
ような切り分け方法を自動化することは容易ではない。
上記の情報が事前に与えられていない状況で、サービス
の切り分けを自動化をする従来の方法は、パターンによ
って明らかに区別できる範囲の切り分け、例えば、ＷＷ
Ｗサーバ単位での切り分けを行っている。このような切
り分け方法を図Ｙの例に適用すると、同一サーバ内にあ
る「Ａさんのホームページ」と「Ｂさんのホームペー
ジ」という二つのコンテンツは、同一サービスとして分
類されてしまう。このように、従来の方法は、同一サー
バ内にある別個のサービスを同一サービスとして分類す
る。[0008] However, when the above information is not provided in advance, it is necessary for the analyst to examine the contents of the data history and to determine the unit of service separation. However, it is not easy to automate such a dividing method.
In a situation where the above information is not given in advance, a conventional method for automating the separation of services is to separate a range that can be clearly distinguished by a pattern, for example, WW
Classification is performed for each W server. When such a separation method is applied to the example in FIG. Y, two contents “Ms. A's homepage” and “Ms. B's homepage” in the same server are classified as the same service. Thus, the conventional method classifies separate services residing in the same server as the same service.

【０００９】[0009]

【発明が解決しようとする課題】上記従来のサービス分
類方法には以下のような問題点がある。１．予めＷＷＷ
サーバ内のディレクトリ構造を調査してサーバ内で提供
されているサービスを分類しておく方法の場合、分類可
能な対象は当該ＷＷＷサーバ内のサービスに限定され
る。この方法は、不特定多数のＷＷＷサーバ内のサービ
スを対象とするサービス分類を行えないという欠点あ
る。２．また、従来のサービス分類の方法で不特定多数
のＷＷＷサーバのサービスを対象としてサービス分類を
行うためには、ＷＷＷサービスの閲覧データの検証を人
手に頼るか、若しくは、分類の単位をＷＷＷサーバ別と
いった粗い単位で行うで必要がある。人手による検証は
時間的、費用的にコスト高であり、実際上、対象とする
ＷＷＷサーバの範囲が制限されるという欠点がある。ま
た、ＷＷＷサーバ別のような粗い分類は、サービスの分
類単位として有効な場合は非常に稀である。The above-mentioned conventional service classification method has the following problems. 1. WWW in advance
In the case of a method of classifying services provided in the server by examining the directory structure in the server, the classifiable targets are limited to services in the WWW server. This method has a drawback that service classification cannot be performed on services in an unspecified number of WWW servers. 2. In addition, in order to perform service classification for an unspecified number of WWW server services by the conventional service classification method, verification of browsing data of WWW services is manually performed, or the unit of classification is determined by WWW server. It is necessary to do it in such a coarse unit. The manual verification is costly in terms of time and cost, and has a disadvantage that the range of the target WWW server is practically limited. Also, coarse classification such as by WWW server is extremely rare when it is effective as a service classification unit.

【００１０】従って、本発明は、上記従来技術の問題点
に鑑み、不特定多数のＷＷＷサーバのサービスを自動的
に分類することができるサービス利用履歴からのサービ
ス単位の抽出方法及び抽出装置並びに抽出プログラムを
記録した記録媒体の提供を目的とする。[0010] Accordingly, the present invention has been made in view of the above-described problems of the prior art, and has been made in view of the above-described problems. It is intended to provide a recording medium on which a program is recorded.

【００１１】[0011]

【課題を解決するための手段】上記目的を達成するた
め、本発明のサービス利用履歴からサービス単位を抽出
する方法及び装置はデータ履歴の時間的な前後関係に着
目する。本発明は、ハイパーリンクにより結ばれた情報
を辿りながら参照する情報サービスの利用履歴（例え
ば、ＷＷＷサービスの利用履歴）に含まれる連続して参
照された情報を示す前後の情報アドレスのコンピュータ
名が異なる場合に、後の情報アドレスをサービス単位の
基点として判定することを特徴とするサービス利用履歴
からのサービス単位の抽出方法である。In order to achieve the above object, a method and apparatus for extracting a service unit from a service use history according to the present invention focuses on the temporal context of a data history. According to the present invention, the computer names of the information addresses before and after indicating the continuously referenced information included in the use history of the information service (for example, the use history of the WWW service) referred to while tracing the information linked by the hyperlink are described. In a different method, a service unit is extracted from a service usage history, wherein a later information address is determined as a base point of the service unit.

【００１２】図１は本発明の原理を説明するための図で
ある。同図に示す如く、本発明のサービス利用履歴から
サービス単位を抽出する方法は、情報サービスにおいて
利用された情報を提供するコンピュータ名、ディレクト
リ名及びファイル名からなる情報アドレスと、当該情報
が直前に利用された情報からハイパーリンクによってリ
ンクを張られたか否かを識別できる項目とを含む利用履
歴データ２１を作成し（ステップ１０）、上記利用履歴
データ２１の中で連続して参照され、かつ、ハイパーリ
ンクによってリンクを張られた前後の情報アドレスを取
得し（ステップ２０）、上記取得された前後の情報アド
レスに含まれるコンピュータ名が一致するか否かを判定
し（ステップ３０）、一致しないと判定された場合に、
後の情報アドレスに含まれるコンピュータ名及びディレ
クトリ名を上記情報サービスにおけるサービス単位６１
として抽出する（ステップ４０）。FIG. 1 is a diagram for explaining the principle of the present invention. As shown in the figure, the method of extracting a service unit from the service usage history according to the present invention employs an information address including a computer name, a directory name, and a file name that provide information used in an information service, and the information immediately before the information address. The usage history data 21 including an item that can identify whether or not a link has been established by a hyperlink is created from the used information (step 10), and is continuously referred to in the usage history data 21, and The information addresses before and after the link by the hyperlink are acquired (step 20), and it is determined whether or not the computer names included in the acquired information addresses before and after match (step 30). If it is determined,
The computer name and the directory name included in the subsequent information address are converted into the service unit 61 in the information service.
(Step 40).

【００１３】また、本発明のサービス利用履歴からサー
ビス単位を抽出する装置２は、情報を提供する情報サー
バ６と、上記情報サーバ６に接続され、ハイパーリンク
により結ばれた情報を辿りながら参照するブラウザ４と
からなる情報サービスシステムにおいて、上記情報サー
ビスシステムにおいて利用された情報の情報アドレス
と、当該情報が直前に利用された情報からハイパーリン
クによってリンクを張られたか否かを識別できる項目と
を含む利用履歴データの記憶手段２０と、上記ブラウザ
４による情報の利用状況を監視し、上記利用履歴データ
を作成し、上記記憶手段２０に格納する手段１０と、上
記利用履歴データの中で連続して参照され、かつ、ハイ
パーリンクによってリンクを張られた前後の情報アドレ
スを取得する手段３０と、上記取得された前後の情報ア
ドレスに含まれるコンピュータ名が一致するか否かを判
定するため、上記取得された前後の情報アドレスを比較
する手段４０と、上記後の情報アドレスに含まれるコン
ピュータ名及びディレクトリ名を上記情報サービスシス
テムにおけるサービス単位として抽出し、サービス単位
データ記憶手段６０に格納する手段５０とからなる。Further, the apparatus 2 for extracting a service unit from the service use history according to the present invention refers to an information server 6 for providing information, and is connected to the information server 6 and traces information linked by a hyperlink. In the information service system including the browser 4, the information address of the information used in the information service system and an item that can identify whether the information is linked by a hyperlink from the information used immediately before are described. A storage means 20 for storing usage history data, a means 10 for monitoring the usage status of the information by the browser 4, creating the usage history data, and storing it in the storage means 20; Means 3 for acquiring information addresses before and after being referred to and linked by a hyperlink Means 40 for comparing the acquired information addresses before and after to obtain whether or not the computer names included in the acquired information addresses before and after match, and a computer included in the information addresses after the acquired information addresses Means 50 for extracting a name and a directory name as a service unit in the information service system, and storing the extracted service name and directory name in the service unit data storage means 60.

【００１４】また、本発明のサービス利用履歴からサー
ビス単位を抽出するプログラムを記録した記録媒体は、
ハイパーリンクにより結ばれた情報を辿りながら参照す
る情報サービスの利用履歴に含まれる連続して参照され
た前後の情報アドレスのコンピュータ名が異なる場合
に、後の情報アドレスをサービス単位の基点として判定
するプロセスからなることを特徴とするサービス利用履
歴からサービス単位を抽出するプログラムを記録した記
録媒体である。[0014] A recording medium for recording a program for extracting a service unit from a service use history according to the present invention includes:
When the computer name of the information address before and after successively included in the use history of the information service referred to while tracing the information linked by the hyperlink is different, the subsequent information address is determined as the base point of the service unit. It is a recording medium that records a program for extracting a service unit from a service usage history, which is characterized by comprising a process.

【００１５】また、本発明の記録媒体は、情報サービス
において利用された情報を提供するコンピュータ名及び
ディレクトリ名を含む情報アドレスと、当該情報が直前
に利用された情報からハイパーリンクによってリンクを
張られたか否かを識別できる項目とを含む利用履歴デー
タの中で連続して参照され、かつ、ハイパーリンクによ
ってリンクを張られた前後の情報アドレスを取得させる
プロセスと、上記取得された前後の情報アドレスに含ま
れるコンピュータ名が一致するか否かを判定させるプロ
セスと、一致しないと判定された場合に、後の情報アド
レスに含まれるコンピュータ名及びディレクトリ名を上
記情報サービスにおけるサービス単位として抽出させる
プロセスとからなることを特徴とするサービス利用履歴
からサービス単位を抽出するプログラムを記録した記録
媒体である。In the recording medium of the present invention, an information address including a computer name and a directory name providing information used in the information service and a link from the information used immediately before by the information are hyperlinked. A process of acquiring information addresses before and after consecutively referred to in the usage history data including items that can identify whether or not the information addresses are linked by a hyperlink; and A process for determining whether or not the computer names included in the information service match; and a process for extracting the computer name and the directory name included in the subsequent information address as a service unit in the information service when it is determined that they do not match. Service unit based on service usage history A recording medium for recording a program to be extracted.

【００１６】また、本発明の記録媒体は、情報サービス
システムにおいて利用された情報の情報アドレスと、当
該情報が直前に利用された情報からハイパーリンクによ
ってリンクを張られたか否かを識別できる項目とを含む
利用履歴データの中で連続して参照され、かつ、ハイパ
ーリンクによってリンクを張られた前後の情報アドレス
を取得させるプロセスと、上記取得された前後の情報ア
ドレスに含まれるコンピュータ名が一致するか否かを判
定するため、上記取得された前後の情報アドレスを比較
させるプロセスと、上記後の情報アドレスに含まれるコ
ンピュータ名及びディレクトリ名を上記情報サービスシ
ステムにおけるサービス単位として抽出させるプロセス
とからなることを特徴とするサービス利用履歴からサー
ビス単位を抽出するプログラムを記録した記録媒体であ
る。Further, the recording medium of the present invention includes an information address of information used in the information service system, and an item capable of identifying whether or not the information is linked by a hyperlink from information used immediately before. The process of acquiring the information addresses before and after consecutively referenced in the usage history data including and linked by the hyperlink matches the computer name included in the acquired information addresses before and after In order to determine whether or not the information address before and after the obtained information address is compared, a process of extracting the computer name and directory name included in the subsequent information address as a service unit in the information service system, Extract service unit from service usage history characterized by Which is a recording medium for recording the program.

【００１７】[0017]

【発明の実施の形態】以下では、利用者毎のデータ履歴
の前後関係に着目してサービス単位の項目を自動的に抽
出する本発明の実施の形態について説明する。図３は本
発明の第１の実施例のサービス単位抽出システムの構成
図である。複数台のＷＷＷサーバ６₁，６₂，．．．が
ネットワーク９を介して複数台の利用者ＷＷＷブラウザ
４₁，４₂，．．．に接続され、利用者が情報サービス
を享受する。各利用者ＷＷＷブラウザ４₁，
４₂，．．．には、夫々、サービス単位抽出装置２₁，
２₂，．．．が接続され、サービス利用履歴からサービ
ス単位を抽出する。また、サービス単位抽出装置２₁，
２₂，．．．は、ネットワークを介して、抽出されたサ
ービス単位を格納するデータベース９を含むセンタサー
バ８に接続されている。サービス単位抽出装置及び利用
者ブラウザは、例えば、所謂情報端末の一部として実現
してもよい。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment of the present invention will be described below in which items of a service unit are automatically extracted by focusing on the context of data history for each user. FIG. 3 is a configuration diagram of the service unit extraction system according to the first embodiment of this invention. A plurality of WWW servers 6 ₁ , 6 ₂ ,. . . Via the network 9, a plurality of user WWW browsers 4 ₁ , 4 ₂ ,. . . And the user enjoys the information service. Each user WWW browser 4 ₁ ,
42 _,. . . Includes a service unit extraction device 2 ₁ ,
2 ₂ ,. . . Is connected, and a service unit is extracted from the service usage history. Also, the service unit extraction device 2 ₁ ,
2 ₂ ,. . . Is connected via a network to a center server 8 including a database 9 for storing the extracted service units. The service unit extraction device and the user browser may be realized, for example, as a part of a so-called information terminal.

【００１８】サービス単位抽出装置２₁，２₂，．．．
は、ハイパーリンクにより結ばれた情報を辿りながら参
照するＷＷＷサービス等の情報サービスの利用履歴か
ら、連続して参照されたＵＲＬ(Uniform Resource Loca
tor)等の情報アドレスに含まれるホスト名等のコンピュ
ータ名が異なる場合に、移動後の情報アドレスをサイト
等のサービス単位の基点とみなすことによりサービス利
用履歴からサービス単位を抽出する。The service unit extraction devices 2 ₁ , 2 ₂ ,. . .
Is a URL (Uniform Resource Locator) that is continuously referenced from the usage history of an information service such as a WWW service that refers while tracing information linked by a hyperlink.
When the computer name such as the host name included in the information address such as tor) is different, the service unit is extracted from the service use history by regarding the moved information address as the base point of the service unit such as the site.

【００１９】本例におけるＷＷＷサービスの特徴は次の
通りである。（Ａ）ＷＷＷサービスは、各データがハイパーリンクに
よって網の目状につながっている。利用者は、あるＷＷ
Ｗ上のページからハイパーリンクを辿って次々と情報を
閲覧することができる。（Ｂ）ＷＷＷサーバ上において、一般的にデータはツリ
ー型（ディレクトリ）構造を取り、同一サービスに関す
るコンテンツはそのツリー型構造において同一ノードに
収められている。The features of the WWW service in this example are as follows. (A) In the WWW service, each data is connected in a network pattern by a hyperlink. The user is a certain WW
Information can be browsed one after another by following hyperlinks from the page on W. (B) On a WWW server, data generally takes a tree-type (directory) structure, and contents related to the same service are stored in the same node in the tree-type structure.

【００２０】以下、図３の構成図に示されたＷＷＷサー
バＡ及びサーバＢのディレクトリ構造が示されている図
４を参照して、本発明の第１の実施例によるサービス単
位抽出の具体例を説明する。尚、以下の説明では、ＵＲ
Ｌのファイルは拡張子ｈｔｍｌを持つものだけを取り扱
う。また、図５は本発明の第１の実施例によるサービス
単位抽出の処理フローチャートである。Referring now to FIG. 4 showing the directory structure of WWW server A and server B shown in the configuration diagram of FIG. 3, a specific example of service unit extraction according to the first embodiment of the present invention will be described. Will be described. In the following description, UR
Only files with the extension html are handled as L files. FIG. 5 is a flowchart of a service unit extraction process according to the first embodiment of the present invention.

【００２１】ステップ１００：ユーザがＷＷＷサーバＡ
（ＵＲＬ名 http://www.a.com/)上のルートディレクト
リ(/) 上に存在するファイル(e.html)を閲覧する。ステップ１０２：その際、ユーザが http://www.a.com/
e.htmlというページを閲覧していることがＵＲＬ表記に
よる履歴データ上に記載される。ステップ１０４：ユーザがe.html上のハイパーリンクを
辿り、同じサーバＡ（http://www.a.com/)上のディレク
トリ(/job/) に存在する別のファイル(f.html)を閲覧す
る。Step 100: The user is the WWW server A
Browse the file (e.html) that exists on the root directory (/) on (URL name http://www.a.com/). Step 102: At this time, when the user enters http://www.a.com/
The fact that the user is browsing the page e.html is described in the history data in URL notation. Step 104: The user follows the hyperlink on e.html and copies another file (f.html) existing in the directory (/ job /) on the same server A (http://www.a.com/). browse.

【００２２】ステップ１０６：このとき、ユーザは ht
tp://www.a.com/job/f.html というページを閲覧したこ
とになり、履歴データ上に記録される。このとき、履歴
データは、 1. http://www.a.com/e.html 2. http://www.a.com/job/f.html hyper のように時系列に従って記録されている。ここで、hype
r という文字は、履歴１から履歴２への移動は、e.html
上のハイパーリンクによって行われたという事実を表
すため便宜上記載したものである。Step 106: At this time, the user enters ht
You have visited the page tp: //www.a.com/job/f.html and it is recorded on the history data. At this time, the history data is recorded in chronological order, such as 1. http://www.a.com/e.html 2. http://www.a.com/job/f.html hyper . Where hype
The letter r means that the transition from History 1 to History 2 is e.html
It is provided for convenience to indicate the fact that it was done by the hyperlink above.

【００２３】ステップ１０８：ここで、前の履歴１と後
の履歴２はハイパーリンクで結ばれているため、前後の
履歴のＵＲＬに含まれるサーバ名を比較する。ステップ１１０：このとき、両方のサーバ名は http:/
/www.a.com/ であり一致しているので、閲覧サーバの変
化が無いと判定される。同一サーバ内でのディレクトリ
移動であるため、履歴２のディレクトリは、サービス単
位として抽出されない。Step 108: Here, since the previous history 1 and the subsequent history 2 are connected by a hyperlink, the server names included in the URLs of the preceding and succeeding histories are compared. Step 110: At this time, both server names are http: /
Since it is /www.a.com/ and they match, it is determined that there is no change in the browsing server. Since the directory is moved within the same server, the directory of the history 2 is not extracted as a service unit.

【００２４】ステップ１１２：次に、ユーザが f.html
上のハイパーリンクを辿り、別のサーバＢ（ＵＲＬ名
http://www.b.co.jp/ ）上にあるディレクトリ(/musi
c/)上のファイル g.html を閲覧する。ステップ１１４：このとき、http://www.b.co.jp/music
/g.html というページを閲覧したことになるので、履歴
データは、 1. http://www.a.com/e.html 2. http://www.a.com/job/f.html hyper 3. http://www.b.co.jp/music/g.html hyper のように更新される。Step 112: Next, the user enters f.html
Follow the hyperlink above to another server B (URL name
http://www.b.co.jp/) directory (/ musi
Browse the file g.html on c /). Step 114: At this time, http://www.b.co.jp/music
Since you have viewed the page /g.html, the history data is 1. http://www.a.com/e.html 2. http://www.a.com/job/f.html hyper 3. Updated as http://www.b.co.jp/music/g.html hyper.

【００２５】ステップ１１６：前の履歴２と後の履歴３
はハイパーリンクで結ばれているため、前後の履歴のＵ
ＲＬに含まれるサーバ名を比較する。ステップ１１８：このとき、前の履歴２のサーバ名は
http://www.a.com/ であり、後の履歴３のサーバ名は
http://www.b.co.jp/ で両者は一致しないので、閲覧サ
ーバの変化有りと判定される。Step 116: History 2 before and History 3 after
Is linked by a hyperlink, so U
The server names included in the RL are compared. Step 118: At this time, the server name of the previous history 2 is
http://www.a.com/ and the server name of History 3 is
Since the two do not match at http://www.b.co.jp/, it is determined that the browsing server has changed.

【００２６】ステップ１２０：異なるサーバ間でのディ
レクトリ移動であるため、履歴３の情報アドレス http:
//www.b.co.jp/music/g.html からファイル名 g.html
を除いた部分のＵＲＬ http://www.b.co.jp/music/
より下位のページをサービス単位として抽出し、センタ
サーバに送信する。ステップ１２２：センタサーバは、 http://www.b.co.j
p/music/ をサービス単位データベースに登録する。Step 120: Since the directory is moved between different servers, the information address of the history 3 http:
//www.b.co.jp/music/g.html to file name g.html
URL except for http://www.b.co.jp/music/
The lower page is extracted as a service unit and transmitted to the center server. Step 122: The center server is connected to http: //www.b.co.j
Register p / music / in the service unit database.

【００２７】同様の処理が履歴３から履歴４、履歴４か
ら履歴５及び履歴５から履歴６に対して繰り返される。ステップ１２４：次に、ユーザがファイル g.html上の
ハイパーリンクを辿り、同じサーバＢ（http://www.b.c
o.jp/)上のディレクトリ(/suzuki/books/)に存在する別
のファイル(h.html)を閲覧する。A similar process is repeated for histories 3 to 4, histories 4 to 5 and histories 5 to 6. Step 124: Next, the user follows the hyperlink on the file g.html and the same server B (http: //www.bc
Browse another file (h.html) in the directory (/ suzuki / books /) on o.jp/).

【００２８】ステップ１２６：このとき、ユーザは ht
tp://www.b.co.jp/suzuki/books/h.html というページ
を閲覧したことになり、履歴データ上に記録される。こ
のとき、履歴データは、 1. http://www.a.com/e.html 2. http://www.a.com/job/f.html hyper 3. http://www.b.co.jp/music/g.html hyper 4. http://www.b.co.jp/suzuki/books/h.html hyper のように更新される。Step 126: At this time, the user enters ht
This means that you have visited the page tp: //www.b.co.jp/suzuki/books/h.html and it is recorded on the history data. At this time, the history data is: 1.http: //www.a.com/e.html 2.http: //www.a.com/job/f.html hyper 3.http: //www.b. co.jp/music/g.html hyper 4. Updated as http://www.b.co.jp/suzuki/books/h.html hyper.

【００２９】ステップ１２８：ここで、前の履歴３と後
の履歴４はハイパーリンクで結ばれているため、前後の
履歴のＵＲＬに含まれるサーバ名を比較する。ステップ１３０：このとき、両方のサーバ名は http:/
/www.b.co.jp/ であり一致しているので、閲覧サーバの
変化が無いと判定される。同一サーバ内でのディレクト
リ移動であるため、履歴４のディレクトリは、サービス
単位として抽出されない。Step 128: Here, since the preceding history 3 and the following history 4 are connected by a hyperlink, the server names included in the URLs of the preceding and succeeding histories are compared. Step 130: At this time, both server names are http: /
/www.b.co.jp/, which matches, it is determined that there is no change in the browsing server. Since the directory is moved within the same server, the directory of the history 4 is not extracted as a service unit.

【００３０】ステップ１３２：次に、ユーザがファイル
h.html上のハイパーリンクを辿り、同じサーバＢ（ht
tp://www.b.co.jp/)上のディレクトリ(/suzuki/)に存在
する別のファイル(i.html)を閲覧する。ステップ１３４：このとき、ユーザは http://www.b.c
o.jp/suzuki/i.htmlというページを閲覧したことにな
り、履歴データ上に記録される。このとき、履歴データ
は、 1. http://www.a.com/e.html 2. http://www.a.com/job/f.html hyper 3. http://www.b.co.jp/music/g.html hyper 4. http://www.b.co.jp/suzuki/books/h.html hyper 5. http://www.b.co.jp/suzuki/i.html hyper のように更新される。Step 132: Next, the user enters the file
Following the hyperlink on h.html, the same server B (ht
Browse another file (i.html) in the directory (/ suzuki /) on tp: //www.b.co.jp/). Step 134: At this time, the user goes to http: //www.bc
The page "o.jp/suzuki/i.html" is browsed and recorded on the history data. At this time, the history data is: 1.http: //www.a.com/e.html 2.http: //www.a.com/job/f.html hyper 3.http: //www.b. co.jp/music/g.html hyper 4.http: //www.b.co.jp/suzuki/books/h.html hyper 5.http: //www.b.co.jp/suzuki/i. Updated like html hyper.

【００３１】ステップ１３６：ここで、前の履歴４と後
の履歴５はハイパーリンクで結ばれているため、前後の
履歴のＵＲＬに含まれるサーバ名を比較する。ステップ１３８：このとき、両方のサーバ名は http:/
/www.b.co.jp/ であり一致しているので、閲覧サーバの
変化が無いと判定される。同一サーバ内でのディレクト
リ移動であるため、履歴５のディレクトリは、サービス
単位として抽出されない。Step 136: Here, since the preceding history 4 and the following history 5 are connected by a hyperlink, the server names included in the URLs of the preceding and succeeding histories are compared. Step 138: At this time, both server names are http: /
/www.b.co.jp/, which matches, it is determined that there is no change in the browsing server. Since the directory is moved within the same server, the directory of the history 5 is not extracted as a service unit.

【００３２】ステップ１４０：次に、ユーザが i.html
上のハイパーリンクを辿り、別のサーバＡ（ＵＲＬ名
http://www.a.com/ ）上にあるディレクトリ(/sports
/baseball)上のファイル j.htmlを閲覧する。ステップ１４２：このとき、http://www.a.com/sports/
baseball/j.html というページを閲覧したことになるの
で、履歴データは、 1. http://www.a.com/e.html 2. http://www.a.com/job/f.html hyper 3. http://www.b.co.jp/music/g.html hyper 4. http://www.b.co.jp/suzuki/books/h.html hyper 5. http://www.b.co.jp/suzuki/i.html hyper 6. http://www.a.com/j.html hyper のように更新される。Step 140: Next, the user enters i.html
Follow the above hyperlink to another server A (URL name
http://www.a.com/) directory (/ sports
Browse the file j.html on / baseball). Step 142: At this time, http://www.a.com/sports/
Since you have visited the page called baseball / j.html, the history data is 1. http://www.a.com/e.html 2. http://www.a.com/job/f. html hyper 3.http: //www.b.co.jp/music/g.html hyper 4.http: //www.b.co.jp/suzuki/books/h.html hyper 5.http: // www.b.co.jp/suzuki/i.html hyper 6. Updated as http://www.a.com/j.html hyper.

【００３３】ステップ１４４：前の履歴５と後の履歴６
はハイパーリンクで結ばれているため、前後の履歴のＵ
ＲＬに含まれるサーバ名を比較する。ステップ１４６：このとき、前の履歴５のサーバ名は
http://www.b.co.jp/であり、後の履歴６のサーバ名は
http://www.a.com/ で両者は一致しないので、閲覧サ
ーバの変化有りと判定される。Step 144: History 5 before and History 6 after
Is linked by a hyperlink, so U
The server names included in the RL are compared. Step 146: At this time, the server name of the previous history 5 is
http://www.b.co.jp/
Since the two do not match at http://www.a.com/, it is determined that the browsing server has changed.

【００３４】ステップ１４８：異なるサーバ間でのディ
レクトリ移動であるため、履歴６の情報アドレス http:
//www.a.com/sports/baseball/j.html からファイル名
j.html を除いた部分のＵＲＬ http://www.a.com/sp
orts/baseball/ より下位のページをサービス単位とし
て抽出し、センタサーバのサービス単位データベースに
登録する。Step 148: Since the directory is moved between different servers, the information address of the history 6 http:
// File name from www.a.com/sports/baseball/j.html
URL except for j.html http://www.a.com/sp
Extract pages lower than orts / baseball / as service units and register them in the service unit database of the center server.

【００３５】以上の処理によって、センタサーバのデー
タベースに登録されたサービス単位は、 http://www.b.co.jp/music/ http://www.a.com/sports/baseball/ である。With the above processing, the service unit registered in the database of the center server is http://www.b.co.jp/music/ http://www.a.com/sports/baseball/. .

【００３６】上記のサービス単位として抽出するか否か
を判定するためのサーバ名の比較は、前のＵＲＬのサー
バ名と後のＵＲＬのサーバ名との文字列比較により実現
することができる。また、ＵＲＬからサービス単位を抽
出するためのファイル名の除去操作は、文字列の一部を
除去することにより実現される。本例のサービス単位の
抽出処理の例についてまとめると、以下の二つの条件：１．直前の閲覧ページからハイパーリンクで現閲覧ペー
ジに移動している。２．直前閲覧ページと現閲覧ページとではＷＷＷサーバ
が異なる。が充たされる場合、現閲覧ページのディレクトリがサー
ビス単位として抽出される。The comparison of the server name for determining whether or not to extract as a service unit can be realized by a character string comparison between the server name of the preceding URL and the server name of the subsequent URL. The operation of removing the file name for extracting the service unit from the URL is realized by removing a part of the character string. To summarize an example of the service unit extraction processing of the present example, the following two conditions are satisfied. A hyperlink has been taken to the current browse page from the previous browse page. 2. The WWW server is different between the last browse page and the current browse page. Is satisfied, the directory of the current browse page is extracted as a service unit.

【００３７】本実施例において、現閲覧ページが直前閲
覧ページと同一サーバ内の別のノードに収められている
場合、現閲覧ページはサービス単位として抽出されな
い。これは、一般のＷＷＷサービスにおいて、「同一サ
ービスのコンテンツが別ノードに収められている場合が
無視できない程度に存在する」という実情を考慮してい
るためである。現閲覧ページが直前閲覧ページと同一サ
ーバ内の別のノードに収められている場合に現閲覧ペー
ジをサービス単位として抽出すると、切り分けが必要な
コンテンツ単位が抽出されるのみならず、切り分け不要
なコンテンツ単位も同時に抽出され、かつ、切り分け不
要なコンテンツの方が切り分けが必要なコンテンツより
も多くなる傾向がある。しかし、必要に応じて、現閲覧
ページが直前閲覧ページと同一サーバ内の別のノードに
収められている場合に、現閲覧ページをサービス単位と
して抽出するように変形することは容易である。In this embodiment, if the current browse page is stored in another node in the same server as the immediately preceding browse page, the current browse page is not extracted as a service unit. This is because a general WWW service takes into account the fact that "a case where contents of the same service are stored in different nodes exists to an extent that cannot be ignored". If the current browse page is stored in another node on the same server as the previous browse page, extracting the current browse page as a service unit not only extracts content units that need to be separated, but also content that does not need to be separated. Units are also extracted at the same time, and contents that do not need to be divided tend to be more than contents that need to be divided. However, if necessary, when the current browse page is stored in another node in the same server as the previous browse page, it is easy to change the current browse page to be extracted as a service unit.

【００３８】また、直前閲覧ページと現閲覧ページとが
同一サーバかつ同一ノード内に在るときは、同一コンテ
ンツである可能性が高いため、現閲覧ページはサービス
単位として抽出されない。尚、上記本発明の第１の実施
例の構成において、サービス単位抽出装置が各利用者ブ
ラウザ毎に設けられている。しかし、サービス単位抽出
装置をサービス単位抽出システムに１台だけ設けるよう
に構成することも可能である。図６は、サービス単位抽
出装置２がセンタサーバ８内に設けられている本発明の
第２の実施例によるサービス単位抽出システムの構成図
である。本発明の第２の実施例におけるサービス単位抽
出処理は、上記本発明の第１の実施例に関して説明した
サービス単位抽出処理と同様に動作する。When the immediately preceding browse page and the current browse page are in the same server and the same node, there is a high possibility that the content is the same, so the current browse page is not extracted as a service unit. In the configuration of the first embodiment of the present invention, a service unit extraction device is provided for each user browser. However, it is also possible to configure so that only one service unit extraction device is provided in the service unit extraction system. FIG. 6 is a configuration diagram of a service unit extraction system according to the second embodiment of the present invention in which the service unit extraction device 2 is provided in the center server 8. The service unit extraction processing according to the second embodiment of the present invention operates in the same manner as the service unit extraction processing described with respect to the first embodiment of the present invention.

【００３９】また、上記実施例の説明では、サービス単
位抽出処理は、ユーザがハイパーリンクにより結ばれた
情報を辿りながら情報を参照するとき、同時にその履歴
からサービス単位を抽出し、抽出されたサービス単位を
センタサーバに通知している。しかし、本発明は、かか
る例に限定されることはなく、例えば、リアルタイムに
抽出したサービス単位を、後で一括してセンタサーバに
通知してもよく、或いは、予め作成されたユーザ履歴デ
ータに対しサービス単位の抽出を行ってもよい。In the description of the above embodiment, the service unit extraction process is such that when a user refers to information while tracing information linked by a hyperlink, the service unit is simultaneously extracted from the history and the extracted service unit is extracted. The unit is notified to the center server. However, the present invention is not limited to such an example. For example, service units extracted in real time may be collectively notified to the center server later, or the user history data created in advance may be used. On the other hand, a service unit may be extracted.

【００４０】[0040]

【実施例】図７は、本発明による利用履歴からサービス
単位を抽出する方法及び装置が適用されたユーザ履歴収
集システムの概要図である。本システムにおいて、情報
端末１００は、実際のユーザのマシンであり、情報サー
バ１２１，１２２，１２３．．．からのサービス利用等
に用いるＷＷＷブラウザ１０２が設けられている。この
情報端末１００にユーザのＷＷＷブラウザ１０２を監視
する監視・履歴情報作成部１０４を設けることにより、
ＷＷＷブラウザの挙動を監視する。ＷＷＷブラウザを監
視することによって、得られたユーザ履歴がハイパーリ
ンク又はブックマーク移動のいずれによって得られた
か、等の判別が行える。FIG. 7 is a schematic diagram of a user history collection system to which a method and apparatus for extracting a service unit from a use history according to the present invention are applied. In this system, the information terminal 100 is an actual user's machine, and the information servers 121, 122, 123. . . A WWW browser 102 is provided for use of services from the Internet. By providing the information terminal 100 with a monitoring / history information creation unit 104 for monitoring the user's WWW browser 102,
Monitor the behavior of the WWW browser. By monitoring the WWW browser, it is possible to determine whether the obtained user history is obtained by a hyperlink or a bookmark move.

【００４１】この監視・履歴情報作成部１０４によって
取得されたユーザ履歴は、ネットワーク１３０を通じて
センタサーバ１１０のユーザ履歴管理部１１２に送ら
れ、ユーザ履歴データベース１１４に蓄積される。ユー
ザ履歴データベース１１４に蓄積されたユーザ履歴デー
タは、例えば、統計処理などの分析プロセス２１０を経
て、ユーザ行動の分析、ＷＷＷ上での視聴率調査等に使
用される。The user history acquired by the monitoring / history information creation unit 104 is sent to the user history management unit 112 of the center server 110 via the network 130 and stored in the user history database 114. The user history data stored in the user history database 114 is used, for example, for analysis of user behavior, audience rating survey on the WWW, and the like through an analysis process 210 such as statistical processing.

【００４２】本発明による利用履歴からサービス単位を
抽出する方法及び装置は、このようなユーザ履歴データ
の分析プロセスの前処理段階の一部２００に適用され
る。図８は、分析プロセスの前処理段階に適用されるサ
ービス分類システムの概要説明図である。データベース
１１４から取り出されたユーザ履歴（又はユーザログ履
歴）に対し、本発明による利用履歴からのサービス単位
の抽出処理を適用し、サービス単位（候補）を抽出する
（ステップ２００）。次に、ステップ２００で得られた
サービス単位の中で、不要なサービス単位を削除し（ス
テップ２０２）、ＵＲＬ別にユーザ履歴に関するルール
を作成する（ステップ２０４）。かくして作成されたル
ールをユーザ履歴に対して適用し、ユーザ履歴にサービ
ス単位という項目を付加する（ステップ２０６）。最後
に、サービス単位という項目が付加されたユーザ履歴を
データベース１１４に書き戻すことによりユーザ履歴デ
ータベースが更新され、更なる分析プロセスに利用され
る。The method and apparatus for extracting the service unit from the usage history according to the present invention are applied to a part 200 of the preprocessing stage of the analysis process of the user history data. FIG. 8 is a schematic explanatory diagram of a service classification system applied to the preprocessing stage of the analysis process. The service unit (candidate) is extracted by applying the service unit extraction process from the usage history according to the present invention to the user history (or user log history) extracted from the database 114 (step 200). Next, of the service units obtained in step 200, unnecessary service units are deleted (step 202), and rules relating to the user history are created for each URL (step 204). The rules thus created are applied to the user history, and an item called a service unit is added to the user history (step 206). Finally, the user history to which the item of the service unit is added is written back to the database 114, so that the user history database is updated and used for a further analysis process.

【００４３】図７及び図８を参照して説明したシステム
は、予め収集されたユーザ履歴を処理するように構成さ
れているが、ユーザの情報端末に監視プロセスと共にサ
ービス単位を抽出するプロセスを組み込み、情報端末側
でリアルタイムに抽出処理されたサービス単位の情報を
後でセンタサーバに集めてもよい。さらに、リアルタイ
ムに得られたサービス単位を分類プロセスと組み合わせ
て、ユーザに対しフィードバックするように構成するこ
とも可能である。これにより、例えば、ユーザが現時に
ブラウジングしようとしているサービスが、今までブラ
ウジングしたことのない新しいサービス単位であるか否
かを画面上でユーザに通知できるようになる。The system described with reference to FIGS. 7 and 8 is configured to process a previously collected user history, but incorporates a process of extracting a service unit together with a monitoring process in a user's information terminal. Alternatively, information of service units extracted and processed in real time on the information terminal side may be collected later in the center server. Further, the service unit obtained in real time may be combined with the classification process and configured to provide feedback to the user. Thus, for example, it is possible to notify the user on the screen whether or not the service that the user is currently browsing is a new service unit that has not been browsed so far.

【００４４】また、利用履歴からのサービス単位抽出シ
ステムの構成は、上記の実施例で説明された例に限定さ
れることなく、サービス単位抽出システムの各々の構成
要件をソフトウェア（プログラム）で構築し、ディスク
装置等に格納しておき、必要に応じてサービス単位抽出
システムのコンピュータにインストールしてサービス単
位を抽出することも可能である。さらに、構築されたプ
ログラムをフロッピーディスクやＣＤ−ＲＯＭ等の可搬
記録媒体に格納し、このようなシステムを用いる場面で
汎用的に使用することも可能である。Further, the configuration of the service unit extraction system from the usage history is not limited to the example described in the above embodiment, and each component requirement of the service unit extraction system is constructed by software (program). It is also possible to extract the service unit by storing it in a disk unit or the like and installing it on a computer of the service unit extraction system as needed. Further, the constructed program can be stored in a portable recording medium such as a floppy disk or a CD-ROM, and can be used for general purposes in a case where such a system is used.

【００４５】本発明は、上記の実施例に限定されること
なく、特許請求の範囲内で種々変更・応用が可能であ
る。The present invention is not limited to the above embodiment, but can be variously modified and applied within the scope of the claims.

【００４６】[0046]

【発明の効果】以上、説明した通り、本発明によれば、
ＷＷＷサービスのように分散した多数のマシンによりサ
ービスが提供され、サービス媒体の数が多く、サービス
媒体の変化が激しい場合であっても、ユーザのサービス
利用履歴からサービス単位が自動的に抽出されるので、
抽出されたサービス単位に基づいて、広告・マーケッテ
ィング媒体としての価値を有するサービス媒体の状況を
把握し、サービス毎の利用状況を客観的に知ることがで
きるようになる。As described above, according to the present invention,
Services are provided by a large number of distributed machines such as WWW services, the number of service media is large, and even when the service media changes drastically, service units are automatically extracted from the user's service usage history. So
Based on the extracted service units, the status of service media having value as an advertisement / marketing media can be grasped, and the usage status of each service can be objectively known.

[Brief description of the drawings]

【図１】本発明の原理説明図である。FIG. 1 is a diagram illustrating the principle of the present invention.

【図２】本発明のサービス単位抽出装置の構成図であ
る。FIG. 2 is a configuration diagram of a service unit extraction device of the present invention.

【図３】本発明の第１の実施例のサービス単位抽出シス
テムの構成図である。FIG. 3 is a configuration diagram of a service unit extraction system according to the first embodiment of this invention.

【図４】本発明の第１の実施例によるサービス単位抽出
の具体例の説明図である。FIG. 4 is an explanatory diagram of a specific example of service unit extraction according to the first embodiment of the present invention.

【図５】本発明の第１の実施例によるサービス単位抽出
の処理フローチャートである。FIG. 5 is a processing flowchart of service unit extraction according to the first embodiment of the present invention.

【図６】本発明の第２の実施例のサービス単位抽出シス
テムの構成図である。FIG. 6 is a configuration diagram of a service unit extraction system according to a second embodiment of this invention.

【図７】ユーザ履歴収集システムの概要図である。FIG. 7 is a schematic diagram of a user history collection system.

【図８】サービス単位分類システムの概要説明図であ
る。FIG. 8 is a schematic explanatory diagram of a service unit classification system.

【図９】ＷＷＷサービスの概要説明図である。FIG. 9 is a schematic explanatory diagram of a WWW service.

【図１０】サービス単位の説明図である。FIG. 10 is an explanatory diagram of a service unit.

[Explanation of symbols]

４ブラウザ６情報サーバ１０利用履歴データ作成手段２０利用履歴データ記憶手段３０情報アドレス取得手段４０コンピュータ名取得手段５０サービス単位抽出手段６０サービス単位データ記憶手段 4 Browser 6 Information Server 10 Usage History Data Creation Means 20 Usage History Data Storage Means 30 Information Address Acquisition Means 40 Computer Name Acquisition Means 50 Service Unit Extraction Means 60 Service Unit Data Storage Means

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁶ 識別記号ＦＩＧ０６Ｆ 15/419 ３２０ (72)発明者本橋健東京都新宿区西新宿三丁目19番２号日本電信電話株式会社内────────────────────────────────────────────────── ─── Continued on the front page (51) Int.Cl. ⁶ Identification code FIG06F 15/419 320 (72) Inventor Ken Motobashi 3-19-2 Nishishinjuku, Shinjuku-ku, Tokyo Nippon Telegraph and Telephone Corporation

Claims

[Claims]

When the computer name of an information address before and after consecutively referenced information included in a use history of an information service referred to while tracing information linked by a hyperlink is different, the subsequent information address is assigned to a service unit. A method of extracting a service unit from a service usage history, characterized in that the service unit is determined as a base point of the service.

2. An information address including a computer name and a directory name for providing information used in the information service, and an item for identifying whether or not the information was linked by a hyperlink from information used immediately before. The information addresses before and after that are successively referenced in the usage history data including and linked by a hyperlink are acquired, and the computer names included in the acquired information addresses before and after match. It is determined whether or not they match, and if they do not match, the computer name and directory name included in the subsequent information address are extracted as service units in the information service, and the service unit is extracted from the service use history. how to.

3. Usage history data including an information address of information used in the information service system and an item that can identify whether the information has been linked by a hyperlink from information used immediately before is stored. Means for acquiring information addresses before and after successively referred to in the usage history data and linked by a hyperlink, and included in the acquired information addresses before and after Means for comparing the acquired information addresses before and after to determine whether or not the computer names match, and using the computer name and the directory name included in the information address after the above as a service unit in the information service system Means for extracting and storing the service from the service usage history. A unit that extracts unit.

4. The apparatus for extracting a service unit from a service use history according to claim 3, further comprising means for monitoring a state in which information is used, creating the use history data, and storing the data in the storage means.

5. When the computer name of an information address before and after a continuous reference included in the use history of an information service referred to while tracing information linked by a hyperlink is different, a subsequent information address is used as a service unit. A recording medium for recording a program for extracting a service unit from a service use history, characterized by comprising a process of determining the service unit as a base point.

6. An information address including a computer name and a directory name for providing information used in the information service, and an item for identifying whether the information was linked by a hyperlink from information used immediately before. The process of acquiring information addresses before and after consecutively referenced in the usage history data including and linked by a hyperlink, and the computer name included in the acquired information addresses before and after match And a process of extracting a computer name and a directory name included in the subsequent information address as a service unit in the information service when it is determined that they do not match. Write a program to extract service units from service usage history The recording medium.

7. Usage history data including an information address of information used in an information service system and an item for identifying whether or not the information has been linked by a hyperlink from information used immediately before. A process of acquiring information addresses before and after successively referred to and linked by a hyperlink, and to determine whether or not the computer names included in the acquired information addresses before and after match. Using the service, characterized by comprising: a process of comparing the obtained information addresses before and after the obtained information address; and a process of extracting a computer name and a directory name included in the obtained information address as a service unit in the information service system. A record of a program that extracts service units from the history Media.