• DocumentCode
    2292182
  • Title

    Tuning of reinforcement learning parameters applied to OLSR using a cognitive network design tool

  • Author

    McAuley, A. ; Sinkar, K. ; Kant, L. ; Graff, C. ; Patel, M.

  • Author_Institution
    Telcordia Technol. Inc., Piscataway, NJ, USA
  • fYear
    2012
  • fDate
    1-4 April 2012
  • Firstpage
    2786
  • Lastpage
    2791
  • Abstract
    In wireless mesh networks, with the standard Optimized Link State Routing (OLSR) metric (i.e. hop count), traffic is routed on the shortest path without considering factors such as traffic distribution and link capacities. Consequently, some nodes may get overloaded from the uneven utilization of network resources. OLSR can be modified to use other link cost metrics, with route selection based on lowest cost path. With delay as the metric, OLSR reduces average round trip time but the load-aware routes may cause wide variance in delay and packet reordering due to route oscillations. We describe a new hybrid routing approach that combines the strength of a) link state routing (e.g. fast convergence), b) load-aware routing (e.g., avoiding congested paths) and c) cognitive routing (e.g. learning to avoid path oscillations). In particular, we investigate the use of Q-learning with OLSR to increase network capacity and reduce congestion delay. We present simulation results for a 36 node dynamic mobile ad hoc network, with standard OLSR, a non-cognitive load-aware OLSR (OLSR-D) and our new hybrid cognitive load-aware OLSR (OLSR-Q). We show that OLSR-Q >; OLSR-D >; OLSR in terms of reducing delay and increasing network capacity. Furthermore, we show that, unlike conventional cognitive Q-routing protocols, our hybrid approach does not reduce performance at low load. Although OLSR-Q can significantly reduce delay and improve capacity, the learning time can reduce connectivity and the distribution of more link state information can reduce raw link capacity. We show how adding OLSR, OLSR-D and OLSR-Q as routing options into the Cognitive Network Engineering Design Analytic Toolset (C-NEDAT), we can select the best routing protocol and parameters (e.g., learning rate) for a given network and its mission. We verify simulation performance improvements by implementing the OLSR-Q in on a 9 node wireless testbed.
  • Keywords
    cognitive radio; delays; learning (artificial intelligence); performance evaluation; radio links; routing protocols; telecommunication computing; telecommunication congestion control; telecommunication traffic; wireless mesh networks; C-NEDAT; OLSR metric; OLSR-D; OLSR-Q; Q-learning; cognitive network design tool; cognitive network engineering design analytic toolset; cognitive routing; congestion delay; conventional cognitive Q-routing protocols; dynamic mobile ad hoc network; hop count; hybrid cognitive load-aware OLSR; hybrid routing approach; link cost metrics; link state information; load-aware routes; load-aware routing; lowest cost path; network capacity; network resources; noncognitive load-aware OLSR; packet reordering; raw link capacity; reinforcement learning parameters tuning; route oscillations; route selection; routing options; shortest path; simulation performance improvements; standard OLSR; standard optimized link state routing metric; traffic distribution; uneven utilization; wireless mesh networks; Delay; Load modeling; Logic gates; Routing; Routing protocols; Wireless communication;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Wireless Communications and Networking Conference (WCNC), 2012 IEEE
  • Conference_Location
    Shanghai
  • ISSN
    1525-3511
  • Print_ISBN
    978-1-4673-0436-8
  • Type

    conf

  • DOI
    10.1109/WCNC.2012.6214275
  • Filename
    6214275