CN102622358A - Method and system for information searching - Google Patents
Method and system for information searching Download PDFInfo
- Publication number
- CN102622358A CN102622358A CN2011100297758A CN201110029775A CN102622358A CN 102622358 A CN102622358 A CN 102622358A CN 2011100297758 A CN2011100297758 A CN 2011100297758A CN 201110029775 A CN201110029775 A CN 201110029775A CN 102622358 A CN102622358 A CN 102622358A
- Authority
- CN
- China
- Prior art keywords
- label
- information
- labels
- client
- database
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention discloses a method and system for information searching. The method comprises the steps of: collecting information at first, and setting at least one label for each piece of information; dividing any two labels of each piece of information into a group, storing two labels of each group and a corresponding relationship of the two labels into a database, and setting counting values to count the frequency of occurrence of each group of labels; storing information corresponding to each label into the database; inputting keywords for information searching at a client end; searching a corresponding label in the database, obtaining all labels in the group with the corresponding label, using the obtained labels as first-stage labels, and sequencing the first-stage labels according to the counting values in a descending manner; and providing feedback of the corresponding label and the first-stage labels to the client end, and searching corresponding information via the client end according to the obtained labels. By adopting the technical scheme, the time cost for information acquisition can be saved, and the recognition of information from different perspectives can be strengthened.
Description
Technical field
The present invention relates to magnanimity information retrieval technique field, relate in particular to a kind of method and system of search information.
Background technology
By the development of Internet technology, great deal of information occurred in this world that we lived in every day, and the growth rate of information definitely is one and is close to terrified thing.Magnanimity information can let us has the fidgets, because in the face of great deal of information, we do not know to begin from what to finish from what sometimes, and when these information are finished by reading, new information has been come again.We also will spend bigger energy simultaneously and come the identifying information inner link, and consumption also can not be ignored to time cost.
In the system of magnanimity information, traditional search engine has solved people include particular keywords at magnanimity information unidirectional information searching mode.The process of search engine organize your messages is called " setting up index ".Search engine not only will be preserved and collect the information of getting up, and also will they be carried out layout according to certain rule.Like this, search engine does not find desired data rapidly with thumbing the information of its all preservation again.The user sends inquiry to search engine, and search engine is accepted inquiry and returned data to the user.Search engine all to receive all the time from a large number of users almost be the inquiry of sending simultaneously, it finds the data of user's needs at the utmost point, and returns to the user according to the own index of each user's requirement inspection in the short time.
Technique scheme is to be based upon on the knowledge point network that the people of using system itself formed to exploration of knowledge; When the people of using system searches for relevant knowledge information through importing some specific labels or keyword; When he seeks out when other angles are treated this knowledge point; Owing to be subject to own knowledge network, his getable information must be incomplete.When he attempts the keyword with other conception, face search system again and feed back to a large amount of garbages and cause information overload, also just can't system form the knowledge network of oneself.
Summary of the invention
The objective of the invention is to propose a kind of method and system of search information, can save the time cost of the information of obtaining, strengthen from different perspectives understanding information.
For reaching this purpose, the present invention adopts following technical scheme:
A kind of method of search information may further comprise the steps:
A, acquisition of information are no less than 1 label to each bar information setting, are used for identification information;
B, any two labels of each bar information are divided into one group, and with two labels of each group and between corresponding relation store in the database, and count value is set each group label occurrence number in the database is counted;
C, information stores that each label is corresponding are in database;
D, client input are used for the keyword of search information;
E, according to the corresponding label in the keyword search database, obtain and the whole labels of corresponding label branch at one group, as the first order label of corresponding label, and first order label sorted according to count value from big to small;
F, give client with corresponding label and first order tag feedback, client is according to the label that obtains, the information that search is corresponding.
In the step e,, obtain and the whole labels of each first order label branch,, and second level label sorted according to count value from big to small as the second level label of corresponding label at one group for whole first order labels.
The client default value is as the label progression that obtains corresponding label.
With the whole labels that obtain is the center with the corresponding label, forms label network, feeds back to client.
In addition, client is selected a label, repeating step E and step F.
A kind of system of search information; Comprise that tab indexes unit, database, label excavate unit and client, database excavates the unit with tab indexes unit and label respectively and is connected, and client is excavated the unit with label and is connected; Wherein, the tab indexes unit is used for the label of acquisition of information; Database is used for the information of storage tags group, label correspondence and the number of times that set of tags occurs; Label excavates the unit and is used for obtaining corresponding label according to the keyword of client input from database, and is organized into label network and feeds back to client; Client is used to import keyword, selects keyword, and receives the label network that label excavates the unit feedback.
Adopted technical scheme of the present invention, the not related information of original separate dispersion, through foundation to the information labels internal relation; Thereby organize information is significant again in big information aspect; When people import keyword in system, can access the knowledge network relevant automatically with this keyword, each relevant with it knowledge point all is the relation of in magnanimity information, excavating; The big more relation of quantity of information is also just accurate more; Each knowledge point can infinitely be explored down, and providing more at the more comprehensively knowledge network of setting up oneself for people, valuable reference also makes the accuracy that obtains relevant information higher simultaneously.
Description of drawings
Fig. 1 is the system architecture synoptic diagram of search information in the specific embodiment of the invention.
Fig. 2 is the process flow diagram of search information in the specific embodiment of the invention.
Embodiment
Further specify technical scheme of the present invention below in conjunction with accompanying drawing and through embodiment.
The main thought of technical scheme of the present invention is through each information is carried out the label processing; Carry out unique identification to the part of the elite of this information through a plurality of brief phrases, will form a huge knowledge network after treatment when magnanimity information like this based on label.Each node is exactly a label in the network, exists the relation of a weight to judge similarity between them between every pair of label.Each label also comprises related specifying information, formed the three-dimensional knowledge network that a label and label, label and information, information and information are closely connected mutually like this.This giant grid also is a dynamic network simultaneously, and along with each new label adds, the relation between the node also can correspondingly be adjusted, and network is from growth and dynamic.
Fig. 1 is the system architecture synoptic diagram of search information in the specific embodiment of the invention.As shown in Figure 1, the system of this search information comprises that tab indexes unit 101, database 102, label excavate unit 103 and client 104, and database excavates the unit with tab indexes unit and label respectively and is connected, and client is excavated the unit with label and is connected.
Wherein, The label of tab indexes unit acquisition of information; The number of times that information that database storing set of tags, label are corresponding and set of tags occur, label excavate the unit and obtain corresponding label according to the keyword of client input from database, and are organized into label network and feed back to client; Client input keyword, selection keyword, and receive the label network that label excavates the unit feedback.
Fig. 2 is the process flow diagram of search information in the specific embodiment of the invention.As shown in Figure 2, the flow process of this search information may further comprise the steps:
The information of step 201, collection magnanimity to a plurality of labels of each bar information setting, is used to identify this information.
Step 202, any two labels of each bar information are divided into one group; And with two labels of each group and between the corresponding relation that forms store in the database; And count value is set each group label occurrence number in the database is counted, promptly occurring once, count value adds 1.
Step 203, the information that each label is corresponding also store in the database.
Step 204, client input are used for the keyword of search information.
Step 205, according to the corresponding label in the keyword search database; Obtain and the whole labels of corresponding label branch,, and first order label sorted according to count value from big to small as the first order label of corresponding label at one group; Count value is big more, representes that the relation between two labels is close more.
Client can be preset a numerical value, as the label progression that obtains corresponding label.For example this numerical value is 2, so for whole first order labels, obtains and the whole labels of each first order label branch at one group again, as the second level label of corresponding label, and second level label is sorted according to count value from big to small.
If this numerical value is 3, can also continue to the second season label remove to obtain branch at whole labels of one group, as the third level label of corresponding label, and third level label sorted according to count value from big to small.
Step 206, give client with corresponding label and first order tag feedback, whole labels that perhaps will obtain are the center with the corresponding label, form label network, feed back to client, and client is according to the label that obtains, the information that search is corresponding.
In addition, also can remove to select a label,, as long as information is abundant, just can ad infinitum explore down, like this to obtain the knowledge network relevant with this keyword to each label to this label repeating step 205 and step 206 through client.
The above; Be merely the preferable embodiment of the present invention, but protection scope of the present invention is not limited thereto, anyly is familiar with this technological people in the technical scope that the present invention disclosed; The variation that can expect easily or replacement all should be encompassed within protection scope of the present invention.Therefore, protection scope of the present invention should be as the criterion with the protection domain of claim.
Claims (6)
1. the method for a search information is characterized in that, may further comprise the steps:
A, acquisition of information are no less than 1 label to each bar information setting, are used for identification information;
B, any two labels of each bar information are divided into one group, and with two labels of each group and between corresponding relation store in the database, and count value is set each group label occurrence number in the database is counted;
C, information stores that each label is corresponding are in database;
D, client input are used for the keyword of search information;
E, according to the corresponding label in the keyword search database, obtain and the whole labels of corresponding label branch at one group, as the first order label of corresponding label, and first order label sorted according to count value from big to small;
F, give client with corresponding label and first order tag feedback, client is according to the label that obtains, the information that search is corresponding.
2. the method for a kind of search information according to claim 1; It is characterized in that; In the step e,, obtain and the whole labels of each first order label branch at one group for whole first order labels; As the second level label of corresponding label, and second level label sorted according to count value from big to small.
3. the method for a kind of search information according to claim 2 is characterized in that, the client default value is as the label progression that obtains corresponding label.
4. according to the method for claim 2 or 3 described a kind of search information, it is characterized in that, is the center with the corresponding label with the whole labels that obtain, and forms label network, feeds back to client.
5. according to the method for the described a kind of search information of arbitrary claim among the claim 1-3, it is characterized in that client is selected a label, repeating step E and step F.
6. the system of a search information; It is characterized in that; Comprise that tab indexes unit, database, label excavate unit and client, database excavates the unit with tab indexes unit and label respectively and is connected, and client is excavated the unit with label and is connected; Wherein, the tab indexes unit is used for the label of acquisition of information; Database is used for the information of storage tags group, label correspondence and the number of times that set of tags occurs; Label excavates the unit and is used for obtaining corresponding label according to the keyword of client input from database, and is organized into label network and feeds back to client; Client is used to import keyword, selects keyword, and receives the label network that label excavates the unit feedback.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2011100297758A CN102622358A (en) | 2011-01-27 | 2011-01-27 | Method and system for information searching |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2011100297758A CN102622358A (en) | 2011-01-27 | 2011-01-27 | Method and system for information searching |
Publications (1)
Publication Number | Publication Date |
---|---|
CN102622358A true CN102622358A (en) | 2012-08-01 |
Family
ID=46562281
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2011100297758A Pending CN102622358A (en) | 2011-01-27 | 2011-01-27 | Method and system for information searching |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN102622358A (en) |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103279513A (en) * | 2013-05-22 | 2013-09-04 | 百度在线网络技术(北京)有限公司 | Method for generating content label and method and device for providing multi-media content information |
CN103366060A (en) * | 2013-07-10 | 2013-10-23 | 江苏省电力设计院 | Method for generating three-dimensional design electrical cross-section diagram equipment material table of transformer substation |
CN103810544A (en) * | 2012-11-06 | 2014-05-21 | 金蝶软件(中国)有限公司 | Method and correlative apparatus for acquiring skill label |
CN104239314A (en) * | 2013-06-09 | 2014-12-24 | 天津海量信息技术有限公司 | Search word expanding method and system |
CN105282177A (en) * | 2015-11-16 | 2016-01-27 | 上海晶赞科技发展有限公司 | Safe and controllable transmission method of audience data |
CN107291930A (en) * | 2017-06-29 | 2017-10-24 | 环球智达科技(北京)有限公司 | The computational methods of weight number |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1648901A (en) * | 2005-02-03 | 2005-08-03 | 中国科学院计算技术研究所 | Method and system for large scale keyboard matching |
JP4024906B2 (en) * | 1997-09-08 | 2007-12-19 | 株式会社東芝 | Tagged document search system |
CN101114295A (en) * | 2007-08-11 | 2008-01-30 | 腾讯科技(深圳)有限公司 | Method for searching on-line advertisement resource and device thereof |
CN101192220A (en) * | 2006-11-21 | 2008-06-04 | 财团法人资讯工业策进会 | Label construction method and system |
CN101458708A (en) * | 2008-12-05 | 2009-06-17 | 北京大学 | Searching result clustering method and device |
CA2666016A1 (en) * | 2008-05-15 | 2009-11-15 | Mathieu Audet | Method for building a search algorithm and method for linking documents with an object |
KR20100071359A (en) * | 2008-12-19 | 2010-06-29 | 한국전자통신연구원 | Apparatus and method for information search on the basis of tag and method for tag management |
CN101876999A (en) * | 2009-12-04 | 2010-11-03 | 中国人民解放军信息工程大学 | Method for generating fax indexes, message analysis device and fax retrieval system |
-
2011
- 2011-01-27 CN CN2011100297758A patent/CN102622358A/en active Pending
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP4024906B2 (en) * | 1997-09-08 | 2007-12-19 | 株式会社東芝 | Tagged document search system |
CN1648901A (en) * | 2005-02-03 | 2005-08-03 | 中国科学院计算技术研究所 | Method and system for large scale keyboard matching |
CN101192220A (en) * | 2006-11-21 | 2008-06-04 | 财团法人资讯工业策进会 | Label construction method and system |
CN101114295A (en) * | 2007-08-11 | 2008-01-30 | 腾讯科技(深圳)有限公司 | Method for searching on-line advertisement resource and device thereof |
CA2666016A1 (en) * | 2008-05-15 | 2009-11-15 | Mathieu Audet | Method for building a search algorithm and method for linking documents with an object |
CN101458708A (en) * | 2008-12-05 | 2009-06-17 | 北京大学 | Searching result clustering method and device |
KR20100071359A (en) * | 2008-12-19 | 2010-06-29 | 한국전자통신연구원 | Apparatus and method for information search on the basis of tag and method for tag management |
CN101876999A (en) * | 2009-12-04 | 2010-11-03 | 中国人民解放军信息工程大学 | Method for generating fax indexes, message analysis device and fax retrieval system |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103810544A (en) * | 2012-11-06 | 2014-05-21 | 金蝶软件(中国)有限公司 | Method and correlative apparatus for acquiring skill label |
CN103279513A (en) * | 2013-05-22 | 2013-09-04 | 百度在线网络技术(北京)有限公司 | Method for generating content label and method and device for providing multi-media content information |
CN103279513B (en) * | 2013-05-22 | 2017-03-01 | 百度在线网络技术(北京)有限公司 | The method of generation content tab is, provide the method and device of multimedia content information |
CN104239314A (en) * | 2013-06-09 | 2014-12-24 | 天津海量信息技术有限公司 | Search word expanding method and system |
CN104239314B (en) * | 2013-06-09 | 2018-01-19 | 天津海量信息技术股份有限公司 | A kind of method and system of query expansion word |
CN103366060A (en) * | 2013-07-10 | 2013-10-23 | 江苏省电力设计院 | Method for generating three-dimensional design electrical cross-section diagram equipment material table of transformer substation |
CN103366060B (en) * | 2013-07-10 | 2016-12-28 | 中国能源建设集团江苏省电力设计院有限公司 | The generation method of three-dimensional design electrical cross-section diagram equipment material table of transformer substation |
CN105282177A (en) * | 2015-11-16 | 2016-01-27 | 上海晶赞科技发展有限公司 | Safe and controllable transmission method of audience data |
CN107291930A (en) * | 2017-06-29 | 2017-10-24 | 环球智达科技(北京)有限公司 | The computational methods of weight number |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101055585B (en) | System and method for clustering documents | |
CN105701216B (en) | A kind of information-pushing method and device | |
CN101408886B (en) | Selecting tags for a document by analyzing paragraphs of the document | |
CN104899273B (en) | A kind of Web Personalization method based on topic and relative entropy | |
CN102236663B (en) | Query method, query system and query device based on vertical search | |
CN101408887B (en) | Recommending terms to specify body space | |
CN103970871B (en) | File metadata querying method and system based on information of tracing to the source in storage system | |
CN103226618B (en) | The related term extracting method excavated based on Data Mart and system | |
CN104008106B (en) | A kind of method and device obtaining much-talked-about topic | |
CN102306176B (en) | On-line analytical processing (OLAP) keyword query method based on intrinsic characteristic of data warehouse | |
CN105335402B (en) | Searching method, index data generation method and device based on static Cache | |
CN108170692A (en) | A kind of focus incident information processing method and device | |
CN102622358A (en) | Method and system for information searching | |
CN104021125B (en) | A kind of method, system and a kind of search engine of search engine sequence | |
CN103294692B (en) | A kind of information recommendation method and system | |
CN107291895B (en) | Quick hierarchical document query method | |
Martin et al. | A framework for business intelligence application using ontological classification | |
CN106503175A (en) | The inquiry of Similar Text, problem extended method, device and robot | |
CN105787097A (en) | Distributed index establishment method and system based on text clustering | |
CN104699786A (en) | Communication network complaint system for semantic intelligent search | |
CN104050235A (en) | Distributed information retrieval method based on set selection | |
KR20150018880A (en) | Information aggregation, classification and display method and system | |
CN110795613B (en) | Commodity searching method, device and system and electronic equipment | |
CN104615734B (en) | A kind of community management service big data processing system and its processing method | |
CN111144831B (en) | Accurate selection screening system and method suitable for recruitment |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C12 | Rejection of a patent application after its publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20120801 |