版權(quán)說(shuō)明:本文檔由用戶提供并上傳,收益歸屬內(nèi)容提供方,若內(nèi)容存在侵權(quán),請(qǐng)進(jìn)行舉報(bào)或認(rèn)領(lǐng)
文檔簡(jiǎn)介
1、<p><b> 英文材料</b></p><p> Information management system</p><p> Wiliam K.Thomson U.S.A</p><p><b> Abstract:</b></p><p> An information
2、 storage, searching and retrieval system for large (gigabytes) domains of archived textual dam. The system includes multiple query generation processes, a search process, and a presentation of search results that is sort
3、ed by category or type and that may be customized based on the professional discipline(or analogous personal characteristic of the user), thereby reducing the amount of time and cost required to retrieve relevant results
4、. </p><p> Keyword:Information management Retrieval system Object-Oriented </p><p> 1.INTRUDUCTION</p><p> This invention relates to an information storage, searching and retri
5、eval system that incorporates a novel organization for presentation of search results from large (gigabytes) domains of archived textual data. </p><p> 2.BACKGROUDN OF THE INVENTION</p><p> On
6、-line information retrieval systems are utilized for searching and retrieving many kinds of information. Most systems used today work in essentially the same manner; that is, users log on (through a computer terminal or
7、personal microcomputer, and typically from a remote location), select a source of information (i.e., a particular database) which is usually something less than the complete domain, formulate a query, launch the search,
8、and then review the search results displayed on the termin</p><p> In many cases end users have been forced to use an intermediary (i.e., a professional searcher) because the current collections of sources
9、are both complex and extensive, and effective search strategies often vary significantly from one source to another. Even with such guidance, potential relevant answers are missed because all potentially relevant databas
10、es or information sources are not searched on every query. Much effort has been expended on refining and improving source selection by groupi</p><p> 3.SUMMARY OF THE INVENTION </p><p> The in
11、vention provides an information storage, searching and retrieval system for a large domain of archived data of various types, in which the results of a search are organized into discrete types of documents and groups of
12、document types so that users may easily identify relevant information more efficiently and more conveniently than systems currently in use. The system of the invention includes means for storing a large domain of data co
13、ntained in multiple source records, at least some of t</p><p> The query generation process may contain a knowledge base including a thesaurus that has predetermined and embedded complex search queries, or
14、use natural language processing, or fuzzy logic, or tree structures, or hierarchical relationship or a set of commands that allow persons seeking information to formulate their queries. </p><p> The search
15、process can utilize any index and search engine techniques including Boolean, vector, and probabilistic as long as a substantial portion of the entire domain of archived textual data is searched for each query and all do
16、cuments found are returned to the organizing process. </p><p> The sorting/categorization process prepares the search results for presentation by assembling the various document types retrieved by the searc
17、h engine and then arranging these basic document types into sometimes broader categories that are readily understood by and relevant to the user.The search results are then presented to the user and arranged by category
18、along with an indication as to the number of relevant documents found in each category. The user may then examine search results in multi</p><p> 4.BRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 is a block diagr
19、am illustrating an information retrieval system of the invention; </p><p> FIG. 2 is a diagram illustrating a query formulation and search process utilized in the invention; </p><p> FIG. 3 i
20、s a diagram illustrating a sorting process for organizing and presenting search results.</p><p> 5.BEST MODE FOR CARRYING OUT THE INVENTION </p><p> As is illustrated in the block diagram of
21、FIG. 1 , the information retrieval system of the invention includes an input/output process ,a query generation process, a search process that involves a large domain of textual data (typically in the multiple gigabyte r
22、ange), an organizing process, presentation of the information to the user, and a process to identify and characterize the types of documents contained in the large domain of data.</p><p> Turning now to FIG
23、. 2, the query generation process preferably includes a knowledge base containing a thesaurus and a note pad, and preferably utilizes embedded predefined complex Boolean strategies. Such a system allows the user to enter
24、 their description of the information needed using simple words/phrases made up of "natural" language and to rely on the system to assist in generating the full search query, which would include, e.g., synonyms
25、 and alternate phraseology. The user can then reques</p><p> FIG. 3 illustrates how five typical sources of information (i.e., source records) can be sorted into many document types and then subsequently in
26、to categories. For example, a typical trade magazine may contain several types of information such as editorials, regular columns, feature articles, news, product announcements, and a calendar of events. Thus, the trade
27、magazine (i.e., the source record) may be sorted into these various document types, and these document types in turn may be categorized</p><p> The information storage, searching and retrieval system of the
28、 invention resolves the common difficulties in typical on-line information retrieval systems that operate on large (e.g., 2 gigabytes or more) domains of textual data, query generation, source selection, and organizing s
29、earch results. The information base with the thesaurus and embedded search strategies allows users to generate expert search queries in their own "natural" language. Source (i.e., database) selection is not an
30、issue be</p><p> While a preferred embodiment of the present invention has been described, it should be understood that various changes, adaptations and modifications may be made therein without departing f
31、rom the spirit of the invention and the scope of the appended claims.</p><p><b> 信息管理系統(tǒng)</b></p><p> Wiliam K.Thomson U.S.A</p><p><b> 摘要:</b></p>
32、<p> 一個(gè)信息存儲(chǔ),查詢和檢索系統(tǒng)主要應(yīng)用于大(千兆字節(jié))的需要存檔的文字領(lǐng)域。該系統(tǒng)包括多個(gè)查詢產(chǎn)生過(guò)程和一個(gè)搜索過(guò)程。而查詢的結(jié)果一般是按類別和類型進(jìn)行排序的,檢索字段是由個(gè)人決定的,在查詢的過(guò)程中,可能基于這個(gè)搜索結(jié)果查看到多個(gè)相關(guān)的信息(或類似的用戶個(gè)人特點(diǎn)介紹),從而減少了搜索結(jié)果是所需的時(shí)間和費(fèi)用。</p><p> 關(guān)鍵詞:信息管理;檢索系統(tǒng);面向?qū)ο?lt;/p><
33、;p><b> 簡(jiǎn)介</b></p><p> 信息的存儲(chǔ),查詢和檢索系統(tǒng),主要應(yīng)用原文檔數(shù)據(jù)比較大的文檔,利用搜索條件和索引字段可以快速查詢結(jié)果。</p><p><b> 開(kāi)發(fā)背景</b></p><p> 網(wǎng)上查詢系統(tǒng)主要用于查詢和檢索在線的各種各樣的信息。今天所使用的多數(shù)系統(tǒng)實(shí)際上采用的是同一方式。也
34、就是說(shuō),用戶登錄(通過(guò)計(jì)算機(jī)終端或個(gè)人微機(jī),或者是遠(yuǎn)程登錄),選擇一個(gè)信息源(比如一個(gè)特定的數(shù)據(jù)庫(kù)),通常是一些不完整的檢索條件,開(kāi)始查詢,啟動(dòng)搜索,然后查詢結(jié)果將顯示在計(jì)算機(jī)終端或個(gè)人微機(jī)上,且查詢結(jié)果一般按照時(shí)間的順序顯示。在查詢過(guò)程中,會(huì)不斷的重復(fù)查詢每一個(gè)數(shù)據(jù)來(lái)源或一組數(shù)據(jù)源,為了確保搜索出所有相關(guān)的文件,這個(gè)重復(fù)是非常必要的。另外,這個(gè)查詢過(guò)程也給用戶帶來(lái)一定的負(fù)擔(dān),他要根據(jù)從同一個(gè)數(shù)據(jù)源查詢出的多個(gè)結(jié)果,進(jìn)行歸納和總結(jié)。而目
35、前的系統(tǒng)可以搜尋大的數(shù)據(jù),在這過(guò)程中要求人們尋求信息或試圖修改他們的查詢條件,以減少不必要的搜索結(jié)果(消滅潛在的相關(guān)結(jié)果),使用戶查詢到真正要查的數(shù)據(jù)。在許多情況下,用戶被迫使用中介(例如專業(yè)的搜索引擎),因?yàn)楫?dāng)前收藏的來(lái)源是復(fù)雜和廣泛的,并且有效的搜索策略經(jīng)常從一個(gè)數(shù)據(jù)來(lái)源變化到另一個(gè)。即使你按照這樣操作,也有可能錯(cuò)過(guò)相關(guān)的答案,因?yàn)樗锌赡芟嚓P(guān)的數(shù)據(jù)庫(kù)或信息來(lái)源并不在每一次搜索查詢中。所以就要付出很大的努力改善和提高數(shù)據(jù)源的選擇,更
36、大的努力在操作查詢時(shí)所</p><p><b> 系統(tǒng)概要</b></p><p> 該系統(tǒng)主要應(yīng)用于對(duì)大量數(shù)據(jù)進(jìn)行信息存儲(chǔ),查詢和檢索,查詢的結(jié)果將被導(dǎo)出成文件類型,比目前的系統(tǒng)更方面,容易的找到用戶想要查詢的有關(guān)數(shù)據(jù)。該系統(tǒng)不僅包括存儲(chǔ)廣泛數(shù)據(jù)領(lǐng)域的復(fù)合數(shù)據(jù)源記錄,還包括多個(gè)文件類型的某些原始記錄。該方式提供了搜索大數(shù)據(jù)領(lǐng)域所進(jìn)行的一次唯一辨認(rèn)文件的重要查詢部
37、分;還提供了文件重要部分的查詢,以及包括對(duì)文件數(shù)量的統(tǒng)計(jì)和屬于各種各樣的預(yù)先確定類別的文件查詢。</p><p> 查詢創(chuàng)建過(guò)程包含一個(gè)知識(shí)庫(kù),該知識(shí)庫(kù)包括被預(yù)先確定和嵌入復(fù)雜查詢的分類詞典,或者是自然語(yǔ)言的處理,或者模糊邏輯,或者樹(shù)型結(jié)構(gòu),或者等級(jí)關(guān)系,或者是一套尋求信息的公式化查詢命令。</p><p> 搜索的過(guò)程可能利用到所有的索引和搜索引擎技術(shù),包括布爾,傳播媒介,機(jī)率查詢。
38、只要每次查詢到一個(gè)原文歸檔數(shù)據(jù)的固有部分,所有建立的文檔就能返回到其組織過(guò)程。</p><p> 排序或分類的過(guò)程是通過(guò)調(diào)用搜索引擎檢索查詢的結(jié)果,從而為引入各種各樣的基本文件類型做準(zhǔn)備,然后組織安排這些容易被理解且與用戶密切相關(guān)的基本文件類型。然后提供給相對(duì)于用戶相關(guān)查詢的結(jié)果與在該查詢結(jié)果中的每個(gè)類別相關(guān)文檔數(shù)量的統(tǒng)計(jì)。用戶可以以多種形式來(lái)檢查查詢的結(jié)果,并且用戶可以根據(jù)自己的需要來(lái)查看相關(guān)的文件。<
39、/p><p><b> 圖例簡(jiǎn)要說(shuō)明</b></p><p> 圖1是信息查詢系統(tǒng)總流程圖;</p><p> 圖2是系統(tǒng)制定查詢和搜索過(guò)程圖;</p><p> 圖3是查詢排序過(guò)程中組織和顯示結(jié)果</p><p><b> 該系統(tǒng)的最佳模式</b></p>
40、<p> 正如圖1所說(shuō)明的那樣,信息檢索系統(tǒng)的開(kāi)發(fā)包括一個(gè)輸入、輸出過(guò)程,一個(gè)查詢創(chuàng)建過(guò)程,一個(gè)大量數(shù)據(jù)范圍的查詢過(guò)程(典型地在多個(gè)千兆字節(jié)范圍),一個(gè)用戶信息的組織過(guò)程,以及一個(gè)辨認(rèn)和描繪在大數(shù)據(jù)領(lǐng)域中文件的類型。</p><p> 如圖2,查詢生成過(guò)程包括分類詞詞典和筆記的一個(gè)知識(shí)庫(kù)和運(yùn)用嵌入被定定義的復(fù)雜戰(zhàn)略。這樣系統(tǒng)允許用戶輸入簡(jiǎn)單的詞或詞組,并且需要的他們的信息的描述由“自然”語(yǔ)言組成
41、和依靠系統(tǒng)協(xié)助引起充分的查詢,將包括同義詞和供選擇文詞。用戶發(fā)出一個(gè)命令然后請(qǐng)求,例如“VI CO 1”,查驗(yàn)從名單挑選的完全文件,在這種情況下,給關(guān)于身分專家的完全信息和證件。</p><p> 圖3說(shuō)明了五種一般的信息源(即原始記錄)可以被寫入多數(shù)類型的文檔,隨后被寫入類。例如,一本典型的商業(yè)雜志也許包含信息的幾個(gè)類型,例如社論、規(guī)則專欄、特寫、新聞、產(chǎn)品公告和事件日歷。因此,商業(yè)雜志(即原始記錄)也許被排
42、序入各種各樣的文件類型和這些文件類型也許反過(guò)來(lái)被分類或被編組入一個(gè)或更多套包含的類別, 每個(gè)文件類型在一套將典型地被排序入一個(gè)類別之內(nèi),但各自的類別在每個(gè)集合之內(nèi)從一個(gè)集合將變化到另一個(gè)。例如,一套類別為用戶的第一個(gè)典型類型建立,并且不同的套類別也許為用戶的第二個(gè)典型類型建立。當(dāng)對(duì)應(yīng)類型#1的用戶執(zhí)行一次查詢時(shí),系統(tǒng)為回顧自動(dòng)地運(yùn)用集合#1類別,對(duì)應(yīng)于用戶的那個(gè)特殊類型,在由用戶組織查詢的結(jié)果。當(dāng)一名用戶從類型#2執(zhí)行一次查詢時(shí),系統(tǒng)提
43、出查詢結(jié)果自動(dòng)地運(yùn)用集合#2類別對(duì)用戶。</p><p> 信息存儲(chǔ)、搜索和檢索系統(tǒng)的開(kāi)發(fā)解決了原文數(shù)據(jù)、查詢方案、資源選擇和組織查詢結(jié)果等大容量數(shù)據(jù)范圍 (即二十億字節(jié)或更多)的在線信息檢索系統(tǒng)的基本難題?;诜诸愒~典和嵌入搜索策略的信息庫(kù),允許用戶使用“自然”語(yǔ)言來(lái)進(jìn)行專業(yè)的信息查詢。數(shù)據(jù)來(lái)源(如數(shù)據(jù)庫(kù))的選擇已不再是個(gè)問(wèn)題,因?yàn)樗阉饕婺軌蛟诿看嗡阉鲿r(shí)可以搜索到整個(gè)數(shù)據(jù)域。查詢結(jié)果的獨(dú)特類設(shè)置介紹不但極大
溫馨提示
- 1. 本站所有資源如無(wú)特殊說(shuō)明,都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請(qǐng)下載最新的WinRAR軟件解壓。
- 2. 本站的文檔不包含任何第三方提供的附件圖紙等,如果需要附件,請(qǐng)聯(lián)系上傳者。文件的所有權(quán)益歸上傳用戶所有。
- 3. 本站RAR壓縮包中若帶圖紙,網(wǎng)頁(yè)內(nèi)容里面會(huì)有圖紙預(yù)覽,若沒(méi)有圖紙預(yù)覽就沒(méi)有圖紙。
- 4. 未經(jīng)權(quán)益所有人同意不得將文件中的內(nèi)容挪作商業(yè)或盈利用途。
- 5. 眾賞文庫(kù)僅提供信息存儲(chǔ)空間,僅對(duì)用戶上傳內(nèi)容的表現(xiàn)方式做保護(hù)處理,對(duì)用戶上傳分享的文檔內(nèi)容本身不做任何修改或編輯,并不能對(duì)任何下載內(nèi)容負(fù)責(zé)。
- 6. 下載文件中如有侵權(quán)或不適當(dāng)內(nèi)容,請(qǐng)與我們聯(lián)系,我們立即糾正。
- 7. 本站不保證下載資源的準(zhǔn)確性、安全性和完整性, 同時(shí)也不承擔(dān)用戶因使用這些下載資源對(duì)自己和他人造成任何形式的傷害或損失。
最新文檔
- 外文翻譯建立信息管理系統(tǒng)
- 建立信息管理系統(tǒng)外文翻譯
- 信息管理系統(tǒng)外文資料翻譯
- 信息管理【外文翻譯】
- 信息管理外文翻譯
- 信息管理系統(tǒng)外文翻譯--電子文件麻醉信息管理系統(tǒng)我們還等什么
- 存貨信息管理【外文翻譯】
- 企業(yè)信息管理[外文翻譯]
- 信息管理與信息系統(tǒng)畢業(yè)設(shè)計(jì)外文翻譯
- 信息管理畢業(yè)論文--藥房信息系統(tǒng)(含外文翻譯)
- 信息管理與信息系統(tǒng)畢業(yè)設(shè)計(jì)(含外文翻譯)
- 學(xué)生成績(jī)信息管理系統(tǒng)(源碼+論文+文獻(xiàn)綜述+外文翻譯)
- 企業(yè)信息管理系統(tǒng)改造經(jīng)驗(yàn)教訓(xùn)和挑戰(zhàn)[外文翻譯]
- 計(jì)算機(jī)外文翻譯--學(xué)生信息管理系統(tǒng)的設(shè)計(jì)與發(fā)展
- 計(jì)算機(jī)專業(yè)外文翻譯---建立一個(gè)信息管理系統(tǒng)
- 教學(xué)樓信息管理系統(tǒng)的設(shè)計(jì)與實(shí)現(xiàn)外文翻譯.doc
- 信息管理系統(tǒng)
- 圖書(shū)信息管理畢業(yè)設(shè)計(jì)(含外文翻譯)
- 信息管理系統(tǒng)
- 小型物流企業(yè)信息管理系統(tǒng)設(shè)計(jì)畢業(yè)設(shè)計(jì)(含外文翻譯)
評(píng)論
0/150
提交評(píng)論