CN102622358A - Method and system for information searching - Google Patents

Method and system for information searching Download PDF

Info

Publication number
CN102622358A
CN102622358A CN2011100297758A CN201110029775A CN102622358A CN 102622358 A CN102622358 A CN 102622358A CN 2011100297758 A CN2011100297758 A CN 2011100297758A CN 201110029775 A CN201110029775 A CN 201110029775A CN 102622358 A CN102622358 A CN 102622358A
Authority
CN
China
Prior art keywords
label
information
labels
client
database
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2011100297758A
Other languages
Chinese (zh)
Inventor
伍昕
吴鹏
高晓光
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
TVMining Beijing Media Technology Co Ltd
Original Assignee
TVMining Beijing Media Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by TVMining Beijing Media Technology Co Ltd filed Critical TVMining Beijing Media Technology Co Ltd
Priority to CN2011100297758A priority Critical patent/CN102622358A/en
Publication of CN102622358A publication Critical patent/CN102622358A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a method and system for information searching. The method comprises the steps of: collecting information at first, and setting at least one label for each piece of information; dividing any two labels of each piece of information into a group, storing two labels of each group and a corresponding relationship of the two labels into a database, and setting counting values to count the frequency of occurrence of each group of labels; storing information corresponding to each label into the database; inputting keywords for information searching at a client end; searching a corresponding label in the database, obtaining all labels in the group with the corresponding label, using the obtained labels as first-stage labels, and sequencing the first-stage labels according to the counting values in a descending manner; and providing feedback of the corresponding label and the first-stage labels to the client end, and searching corresponding information via the client end according to the obtained labels. By adopting the technical scheme, the time cost for information acquisition can be saved, and the recognition of information from different perspectives can be strengthened.

Description

A kind of method and system of search information
Technical field
The present invention relates to magnanimity information retrieval technique field, relate in particular to a kind of method and system of search information.
Background technology
By the development of Internet technology, great deal of information occurred in this world that we lived in every day, and the growth rate of information definitely is one and is close to terrified thing.Magnanimity information can let us has the fidgets, because in the face of great deal of information, we do not know to begin from what to finish from what sometimes, and when these information are finished by reading, new information has been come again.We also will spend bigger energy simultaneously and come the identifying information inner link, and consumption also can not be ignored to time cost.
In the system of magnanimity information, traditional search engine has solved people include particular keywords at magnanimity information unidirectional information searching mode.The process of search engine organize your messages is called " setting up index ".Search engine not only will be preserved and collect the information of getting up, and also will they be carried out layout according to certain rule.Like this, search engine does not find desired data rapidly with thumbing the information of its all preservation again.The user sends inquiry to search engine, and search engine is accepted inquiry and returned data to the user.Search engine all to receive all the time from a large number of users almost be the inquiry of sending simultaneously, it finds the data of user's needs at the utmost point, and returns to the user according to the own index of each user's requirement inspection in the short time.
Technique scheme is to be based upon on the knowledge point network that the people of using system itself formed to exploration of knowledge; When the people of using system searches for relevant knowledge information through importing some specific labels or keyword; When he seeks out when other angles are treated this knowledge point; Owing to be subject to own knowledge network, his getable information must be incomplete.When he attempts the keyword with other conception, face search system again and feed back to a large amount of garbages and cause information overload, also just can't system form the knowledge network of oneself.
Summary of the invention
The objective of the invention is to propose a kind of method and system of search information, can save the time cost of the information of obtaining, strengthen from different perspectives understanding information.
For reaching this purpose, the present invention adopts following technical scheme:
A kind of method of search information may further comprise the steps:
A, acquisition of information are no less than 1 label to each bar information setting, are used for identification information;
B, any two labels of each bar information are divided into one group, and with two labels of each group and between corresponding relation store in the database, and count value is set each group label occurrence number in the database is counted;
C, information stores that each label is corresponding are in database;
D, client input are used for the keyword of search information;
E, according to the corresponding label in the keyword search database, obtain and the whole labels of corresponding label branch at one group, as the first order label of corresponding label, and first order label sorted according to count value from big to small;
F, give client with corresponding label and first order tag feedback, client is according to the label that obtains, the information that search is corresponding.
In the step e,, obtain and the whole labels of each first order label branch,, and second level label sorted according to count value from big to small as the second level label of corresponding label at one group for whole first order labels.
The client default value is as the label progression that obtains corresponding label.
With the whole labels that obtain is the center with the corresponding label, forms label network, feeds back to client.
In addition, client is selected a label, repeating step E and step F.
A kind of system of search information; Comprise that tab indexes unit, database, label excavate unit and client, database excavates the unit with tab indexes unit and label respectively and is connected, and client is excavated the unit with label and is connected; Wherein, the tab indexes unit is used for the label of acquisition of information; Database is used for the information of storage tags group, label correspondence and the number of times that set of tags occurs; Label excavates the unit and is used for obtaining corresponding label according to the keyword of client input from database, and is organized into label network and feeds back to client; Client is used to import keyword, selects keyword, and receives the label network that label excavates the unit feedback.
Adopted technical scheme of the present invention, the not related information of original separate dispersion, through foundation to the information labels internal relation; Thereby organize information is significant again in big information aspect; When people import keyword in system, can access the knowledge network relevant automatically with this keyword, each relevant with it knowledge point all is the relation of in magnanimity information, excavating; The big more relation of quantity of information is also just accurate more; Each knowledge point can infinitely be explored down, and providing more at the more comprehensively knowledge network of setting up oneself for people, valuable reference also makes the accuracy that obtains relevant information higher simultaneously.
Description of drawings
Fig. 1 is the system architecture synoptic diagram of search information in the specific embodiment of the invention.
Fig. 2 is the process flow diagram of search information in the specific embodiment of the invention.
Embodiment
Further specify technical scheme of the present invention below in conjunction with accompanying drawing and through embodiment.
The main thought of technical scheme of the present invention is through each information is carried out the label processing; Carry out unique identification to the part of the elite of this information through a plurality of brief phrases, will form a huge knowledge network after treatment when magnanimity information like this based on label.Each node is exactly a label in the network, exists the relation of a weight to judge similarity between them between every pair of label.Each label also comprises related specifying information, formed the three-dimensional knowledge network that a label and label, label and information, information and information are closely connected mutually like this.This giant grid also is a dynamic network simultaneously, and along with each new label adds, the relation between the node also can correspondingly be adjusted, and network is from growth and dynamic.
Fig. 1 is the system architecture synoptic diagram of search information in the specific embodiment of the invention.As shown in Figure 1, the system of this search information comprises that tab indexes unit 101, database 102, label excavate unit 103 and client 104, and database excavates the unit with tab indexes unit and label respectively and is connected, and client is excavated the unit with label and is connected.
Wherein, The label of tab indexes unit acquisition of information; The number of times that information that database storing set of tags, label are corresponding and set of tags occur, label excavate the unit and obtain corresponding label according to the keyword of client input from database, and are organized into label network and feed back to client; Client input keyword, selection keyword, and receive the label network that label excavates the unit feedback.
Fig. 2 is the process flow diagram of search information in the specific embodiment of the invention.As shown in Figure 2, the flow process of this search information may further comprise the steps:
The information of step 201, collection magnanimity to a plurality of labels of each bar information setting, is used to identify this information.
Step 202, any two labels of each bar information are divided into one group; And with two labels of each group and between the corresponding relation that forms store in the database; And count value is set each group label occurrence number in the database is counted, promptly occurring once, count value adds 1.
Step 203, the information that each label is corresponding also store in the database.
Step 204, client input are used for the keyword of search information.
Step 205, according to the corresponding label in the keyword search database; Obtain and the whole labels of corresponding label branch,, and first order label sorted according to count value from big to small as the first order label of corresponding label at one group; Count value is big more, representes that the relation between two labels is close more.
Client can be preset a numerical value, as the label progression that obtains corresponding label.For example this numerical value is 2, so for whole first order labels, obtains and the whole labels of each first order label branch at one group again, as the second level label of corresponding label, and second level label is sorted according to count value from big to small.
If this numerical value is 3, can also continue to the second season label remove to obtain branch at whole labels of one group, as the third level label of corresponding label, and third level label sorted according to count value from big to small.
Step 206, give client with corresponding label and first order tag feedback, whole labels that perhaps will obtain are the center with the corresponding label, form label network, feed back to client, and client is according to the label that obtains, the information that search is corresponding.
In addition, also can remove to select a label,, as long as information is abundant, just can ad infinitum explore down, like this to obtain the knowledge network relevant with this keyword to each label to this label repeating step 205 and step 206 through client.
The above; Be merely the preferable embodiment of the present invention, but protection scope of the present invention is not limited thereto, anyly is familiar with this technological people in the technical scope that the present invention disclosed; The variation that can expect easily or replacement all should be encompassed within protection scope of the present invention.Therefore, protection scope of the present invention should be as the criterion with the protection domain of claim.

Claims (6)

1. the method for a search information is characterized in that, may further comprise the steps:
A, acquisition of information are no less than 1 label to each bar information setting, are used for identification information;
B, any two labels of each bar information are divided into one group, and with two labels of each group and between corresponding relation store in the database, and count value is set each group label occurrence number in the database is counted;
C, information stores that each label is corresponding are in database;
D, client input are used for the keyword of search information;
E, according to the corresponding label in the keyword search database, obtain and the whole labels of corresponding label branch at one group, as the first order label of corresponding label, and first order label sorted according to count value from big to small;
F, give client with corresponding label and first order tag feedback, client is according to the label that obtains, the information that search is corresponding.
2. the method for a kind of search information according to claim 1; It is characterized in that; In the step e,, obtain and the whole labels of each first order label branch at one group for whole first order labels; As the second level label of corresponding label, and second level label sorted according to count value from big to small.
3. the method for a kind of search information according to claim 2 is characterized in that, the client default value is as the label progression that obtains corresponding label.
4. according to the method for claim 2 or 3 described a kind of search information, it is characterized in that, is the center with the corresponding label with the whole labels that obtain, and forms label network, feeds back to client.
5. according to the method for the described a kind of search information of arbitrary claim among the claim 1-3, it is characterized in that client is selected a label, repeating step E and step F.
6. the system of a search information; It is characterized in that; Comprise that tab indexes unit, database, label excavate unit and client, database excavates the unit with tab indexes unit and label respectively and is connected, and client is excavated the unit with label and is connected; Wherein, the tab indexes unit is used for the label of acquisition of information; Database is used for the information of storage tags group, label correspondence and the number of times that set of tags occurs; Label excavates the unit and is used for obtaining corresponding label according to the keyword of client input from database, and is organized into label network and feeds back to client; Client is used to import keyword, selects keyword, and receives the label network that label excavates the unit feedback.
CN2011100297758A 2011-01-27 2011-01-27 Method and system for information searching Pending CN102622358A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2011100297758A CN102622358A (en) 2011-01-27 2011-01-27 Method and system for information searching

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2011100297758A CN102622358A (en) 2011-01-27 2011-01-27 Method and system for information searching

Publications (1)

Publication Number Publication Date
CN102622358A true CN102622358A (en) 2012-08-01

Family

ID=46562281

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2011100297758A Pending CN102622358A (en) 2011-01-27 2011-01-27 Method and system for information searching

Country Status (1)

Country Link
CN (1) CN102622358A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103279513A (en) * 2013-05-22 2013-09-04 百度在线网络技术(北京)有限公司 Method for generating content label and method and device for providing multi-media content information
CN103366060A (en) * 2013-07-10 2013-10-23 江苏省电力设计院 Method for generating three-dimensional design electrical cross-section diagram equipment material table of transformer substation
CN103810544A (en) * 2012-11-06 2014-05-21 金蝶软件(中国)有限公司 Method and correlative apparatus for acquiring skill label
CN104239314A (en) * 2013-06-09 2014-12-24 天津海量信息技术有限公司 Search word expanding method and system
CN105282177A (en) * 2015-11-16 2016-01-27 上海晶赞科技发展有限公司 Safe and controllable transmission method of audience data
CN107291930A (en) * 2017-06-29 2017-10-24 环球智达科技(北京)有限公司 The computational methods of weight number

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1648901A (en) * 2005-02-03 2005-08-03 中国科学院计算技术研究所 Method and system for large scale keyboard matching
JP4024906B2 (en) * 1997-09-08 2007-12-19 株式会社東芝 Tagged document search system
CN101114295A (en) * 2007-08-11 2008-01-30 腾讯科技(深圳)有限公司 Method for searching on-line advertisement resource and device thereof
CN101192220A (en) * 2006-11-21 2008-06-04 财团法人资讯工业策进会 Label construction method and system
CN101458708A (en) * 2008-12-05 2009-06-17 北京大学 Searching result clustering method and device
CA2666016A1 (en) * 2008-05-15 2009-11-15 Mathieu Audet Method for building a search algorithm and method for linking documents with an object
KR20100071359A (en) * 2008-12-19 2010-06-29 한국전자통신연구원 Apparatus and method for information search on the basis of tag and method for tag management
CN101876999A (en) * 2009-12-04 2010-11-03 中国人民解放军信息工程大学 Method for generating fax indexes, message analysis device and fax retrieval system

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4024906B2 (en) * 1997-09-08 2007-12-19 株式会社東芝 Tagged document search system
CN1648901A (en) * 2005-02-03 2005-08-03 中国科学院计算技术研究所 Method and system for large scale keyboard matching
CN101192220A (en) * 2006-11-21 2008-06-04 财团法人资讯工业策进会 Label construction method and system
CN101114295A (en) * 2007-08-11 2008-01-30 腾讯科技(深圳)有限公司 Method for searching on-line advertisement resource and device thereof
CA2666016A1 (en) * 2008-05-15 2009-11-15 Mathieu Audet Method for building a search algorithm and method for linking documents with an object
CN101458708A (en) * 2008-12-05 2009-06-17 北京大学 Searching result clustering method and device
KR20100071359A (en) * 2008-12-19 2010-06-29 한국전자통신연구원 Apparatus and method for information search on the basis of tag and method for tag management
CN101876999A (en) * 2009-12-04 2010-11-03 中国人民解放军信息工程大学 Method for generating fax indexes, message analysis device and fax retrieval system

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103810544A (en) * 2012-11-06 2014-05-21 金蝶软件(中国)有限公司 Method and correlative apparatus for acquiring skill label
CN103279513A (en) * 2013-05-22 2013-09-04 百度在线网络技术(北京)有限公司 Method for generating content label and method and device for providing multi-media content information
CN103279513B (en) * 2013-05-22 2017-03-01 百度在线网络技术(北京)有限公司 The method of generation content tab is, provide the method and device of multimedia content information
CN104239314A (en) * 2013-06-09 2014-12-24 天津海量信息技术有限公司 Search word expanding method and system
CN104239314B (en) * 2013-06-09 2018-01-19 天津海量信息技术股份有限公司 A kind of method and system of query expansion word
CN103366060A (en) * 2013-07-10 2013-10-23 江苏省电力设计院 Method for generating three-dimensional design electrical cross-section diagram equipment material table of transformer substation
CN103366060B (en) * 2013-07-10 2016-12-28 中国能源建设集团江苏省电力设计院有限公司 The generation method of three-dimensional design electrical cross-section diagram equipment material table of transformer substation
CN105282177A (en) * 2015-11-16 2016-01-27 上海晶赞科技发展有限公司 Safe and controllable transmission method of audience data
CN107291930A (en) * 2017-06-29 2017-10-24 环球智达科技(北京)有限公司 The computational methods of weight number

Similar Documents

Publication Publication Date Title
CN101055585B (en) System and method for clustering documents
CN105701216B (en) A kind of information-pushing method and device
CN101408886B (en) Selecting tags for a document by analyzing paragraphs of the document
CN104899273B (en) A kind of Web Personalization method based on topic and relative entropy
CN102236663B (en) Query method, query system and query device based on vertical search
CN101408887B (en) Recommending terms to specify body space
CN103970871B (en) File metadata querying method and system based on information of tracing to the source in storage system
CN103226618B (en) The related term extracting method excavated based on Data Mart and system
CN104008106B (en) A kind of method and device obtaining much-talked-about topic
CN102306176B (en) On-line analytical processing (OLAP) keyword query method based on intrinsic characteristic of data warehouse
CN105335402B (en) Searching method, index data generation method and device based on static Cache
CN108170692A (en) A kind of focus incident information processing method and device
CN102622358A (en) Method and system for information searching
CN104021125B (en) A kind of method, system and a kind of search engine of search engine sequence
CN103294692B (en) A kind of information recommendation method and system
CN107291895B (en) Quick hierarchical document query method
Martin et al. A framework for business intelligence application using ontological classification
CN106503175A (en) The inquiry of Similar Text, problem extended method, device and robot
CN105787097A (en) Distributed index establishment method and system based on text clustering
CN104699786A (en) Communication network complaint system for semantic intelligent search
CN104050235A (en) Distributed information retrieval method based on set selection
KR20150018880A (en) Information aggregation, classification and display method and system
CN110795613B (en) Commodity searching method, device and system and electronic equipment
CN104615734B (en) A kind of community management service big data processing system and its processing method
CN111144831B (en) Accurate selection screening system and method suitable for recruitment

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C12 Rejection of a patent application after its publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20120801