多通道機會訪問.pdf_第1頁
已閱讀1頁,還剩101頁未讀 繼續(xù)免費閱讀

下載本文檔

版權(quán)說明:本文檔由用戶提供并上傳,收益歸屬內(nèi)容提供方,若內(nèi)容存在侵權(quán),請進行舉報或認(rèn)領(lǐng)

文檔簡介

1、Formally, we provide a generic analysis on the opportunistic spectrum access problem by casting the problem into the restless multi-armed bandit (RMAB) problem, one of the most well-known generalizations of the classic m

2、ulti-armed bandit (MAB) problem, which is of fun- damental importance in stochastic decision theory. Despite the significant research efforts in the field, the RMAB problem in its generic form still remains open. Until t

3、oday,very little result is reported on the structure of the optimal policy. Obtaining the optimal policy for a general RMAB prCognitive radio, first envisioned by Mitola, is the key enabling technology for future genera-

4、 tions of wireless systems that addresses critical challenges in spectrum efficiency, interference management, and coexistence of heterogeneous networks. The core concept in c:ognitive radio networks is opportunistic spe

5、ctrum access, whose objective is to solve the imbalance between spectrum scarcity and spectrum under-utilization.
   In the thesis, we address the fundamental problem of opportunistic spectrum access in a multi-chann

6、el communication system. Specifically, we consider a conununication system in which a user has access to multiple channels, but is limited to sensing and transmitting only on part of them at a given time. We explore how

7、the smart user should exploit past observations and the knowledge of the stochastic properties of these channels to maximize its transmission rate by switching channels opportunistically.oblem is often intractable due to

8、 the exponential computation complexity. Hence, a natural alternative is to seek a simple myopic policy maximizing the short-term reward.
   We start by conducting a generic analysis in Chapter 3 on the optimality of

9、 the myopic sensing policy where the user senses more than one channel each time and gets one unit of reward if at least one of the sensed channels is in the good state. Through mathematical analysis, we show that the my

10、opic sensing policy is optimal only for a small subset of cases where the user is allowed to sense two channels each slot. In the general case, we give counterexamples to illustrate that the myopic sensing policy is not

11、optimal.
   Motivated by the above analysis, we then study the following natural while fundamentally important, question in Chapter 4 (for the homogeneous system consisting of i.i.d. channels) and Chapter 5 (for the

12、heterogeneous system consisting of non i.i.d. channels): under what conditions is the myopic policy guaranteed to be optimal? We answer the above posed question by performing an axiomatic study. More specifically, we dev

13、elop three axioms characterizing a family of functions which we refer to as regular functions, which are generic and practically important. We then establish the optimality of the myopic policy when the reward function c

14、an be expressed as a regular function and the discount factor is bounded by a closed-form threshold determined by the reward function. We also illustrate how the derived results, generic in nature, are applied to analyze

15、 a class of RMAB problems arising from multi-channel opportunistic access.
   In Chapter 6, we further investigate the more challenging problem where the user has to decide the number of channels to sense iri each sl

16、ot in order to maximize its utility (e.g., throughput). We formulate t.he corresponding optimization problem which hinges on the fol- lowing tradeoff between exploit.ation and explorat.ion: sensing more channels can help

17、 learn and predict the future channel state, thus increasing the long-term reward, but at the price of sacrificing the reward at current slot as sensing more channels reduces the time for data t.ransmission, thus decreas

18、ing the throughput in the current slot. Therefore, to find the optimal uumber of channels to sense consists of striking a balance between the above exploitation and exploration. After showing the exponential complexity o

19、f the problem, we develop a heuristic v-step look-ahead strategy which consists of sensing channels in a myopic way and stopping sensing when the expected aggregated utility from the current slot t to slot t+v begins to

20、de- crease. In the developed strategy, the parameter v allows to achieve a desired tradeoff between social efficiency and computation complexity. We demonstrate the benefits of the proposed strategy via numerical experim

21、ents on several typical settings.
   Finally, Chapter 7 concludes the thesis and outlines several important future research di- rections in this field. Note that despite the focus of this thesis in the domain of oppo

22、rtunistic communication, the problem formulation is applicable in many other engineering fields such as communication jamming, scheduling and object tracking. Hence the results presented in t,his t.hesis are generically

溫馨提示

  • 1. 本站所有資源如無特殊說明,都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請下載最新的WinRAR軟件解壓。
  • 2. 本站的文檔不包含任何第三方提供的附件圖紙等,如果需要附件,請聯(lián)系上傳者。文件的所有權(quán)益歸上傳用戶所有。
  • 3. 本站RAR壓縮包中若帶圖紙,網(wǎng)頁內(nèi)容里面會有圖紙預(yù)覽,若沒有圖紙預(yù)覽就沒有圖紙。
  • 4. 未經(jīng)權(quán)益所有人同意不得將文件中的內(nèi)容挪作商業(yè)或盈利用途。
  • 5. 眾賞文庫僅提供信息存儲空間,僅對用戶上傳內(nèi)容的表現(xiàn)方式做保護處理,對用戶上傳分享的文檔內(nèi)容本身不做任何修改或編輯,并不能對任何下載內(nèi)容負(fù)責(zé)。
  • 6. 下載文件中如有侵權(quán)或不適當(dāng)內(nèi)容,請與我們聯(lián)系,我們立即糾正。
  • 7. 本站不保證下載資源的準(zhǔn)確性、安全性和完整性, 同時也不承擔(dān)用戶因使用這些下載資源對自己和他人造成任何形式的傷害或損失。

評論

0/150

提交評論