海量數(shù)據(jù)存儲與全文檢索.pdf_第1頁
已閱讀1頁,還剩73頁未讀 繼續(xù)免費閱讀

下載本文檔

版權(quán)說明:本文檔由用戶提供并上傳,收益歸屬內(nèi)容提供方,若內(nèi)容存在侵權(quán),請進行舉報或認領(lǐng)

文檔簡介

1、江蘇科技大學碩士學位論文海量數(shù)據(jù)存儲與全文檢索姓名:苗帥申請學位級別:碩士專業(yè):模式識別與智能系統(tǒng)指導教師:王衛(wèi)東2010-03-13Abstract III Abstract With the rapid development of China's naval vessels, the development of weapon systems and logistics support generated a lot of

2、 technical information and project management documents. Because of poor management of these documents, resulting in duplication of information about equipment and a huge waste of human and material resources. Establish

3、a secure, high availability, integrated management system of technical publications, which is to establish a sound mechanism for logistics support. In order to meet the actual needs for technical staff of naval vessels,

4、the thesis made more in-depth and systematic study on integrated management system of technical publications, so as to provide various, more accurate information. This thesis presents mass data storage environment, anal

5、yzes the current storage mode, and puts focus on object-oriented storage technology. Storage mode of the object has good scalability, high performance, cross-platform and secures data sharing capabilities, which makes it

6、 an ideal mass data storage choice. Second, according to software engineering development processes and user needs, the thesis described in detail feasibility analysis and demand analysis of integrated management system

7、 for technical publications. On the basis, the thesis design outline of the logic architecture and physical architecture of the system, meanwhile, detailed design the logical structure of function modules. Finally, each

8、module of the system was realized. In this thesis, based on the realization of the basic functions of the system, also full-text search of the system was optimized. These include: ⑴ For the full-text search technology, t

9、he thesis found the lack of existing technology about Chinese segmentation and made better the maximum matching algorithm; ⑵ With the base of inverted index, a full-text index model which is concerning incremental B+-Lis

10、ts is adopted. ⑶ In order to improve both precision and recall rate in information retrieva, This thesis proposes a new query optimization method based on analysis of local classification and genetic algorithm. The thesi

11、s uses analysis of local classification which is query expansion method to expand the query, and then uses genetic algorithms to reweight the query vector which is expanded and does experimental verification on the effec

溫馨提示

  • 1. 本站所有資源如無特殊說明,都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請下載最新的WinRAR軟件解壓。
  • 2. 本站的文檔不包含任何第三方提供的附件圖紙等,如果需要附件,請聯(lián)系上傳者。文件的所有權(quán)益歸上傳用戶所有。
  • 3. 本站RAR壓縮包中若帶圖紙,網(wǎng)頁內(nèi)容里面會有圖紙預(yù)覽,若沒有圖紙預(yù)覽就沒有圖紙。
  • 4. 未經(jīng)權(quán)益所有人同意不得將文件中的內(nèi)容挪作商業(yè)或盈利用途。
  • 5. 眾賞文庫僅提供信息存儲空間,僅對用戶上傳內(nèi)容的表現(xiàn)方式做保護處理,對用戶上傳分享的文檔內(nèi)容本身不做任何修改或編輯,并不能對任何下載內(nèi)容負責。
  • 6. 下載文件中如有侵權(quán)或不適當內(nèi)容,請與我們聯(lián)系,我們立即糾正。
  • 7. 本站不保證下載資源的準確性、安全性和完整性, 同時也不承擔用戶因使用這些下載資源對自己和他人造成任何形式的傷害或損失。

評論

0/150

提交評論