• DocumentCode
    3588482
  • Title

    Achieving high performance with TCP over 40GbE on NUMA architectures for CMS data acquisition

  • Author

    Bawej, Tomasz ; Behrens, Ulf ; Branson, James ; Chaze, Olivier ; Cittolin, Sergio ; Darlea, Georgiana-Lavinia ; Deldicque, Christian ; Dobson, Marc ; Dupont, Aymeric ; Erhan, Samim ; Forrest, Andrew ; Gigi, Dominique ; Glege, Frank ; Gomez-Ceballos, Guill

  • Author_Institution
    CERN, Geneva, Switzerland
  • fYear
    2014
  • Firstpage
    1
  • Lastpage
    1
  • Abstract
    TCP and the socket abstraction have barely changed over the last two decades, but at the network layer there has been a giant leap from a few megabits to 100 gigabits in bandwidth. At the same time, CPU architectures have evolved into the multicore era and applications are expected to make full use of all available resources. Applications in the data acquisition domain based on the standard socket library running in a Non-Uniform Memory Access (NUMA) architecture are unable to reach full efficiency and scalability without the software being adequately aware about the IRQ (Interrupt Request), CPU and memory affinities. During the first long shutdown of LHC, the CMS DAQ system is going to be upgraded for operation from 2015 onwards and a new software component has been designed and developed in the CMS online framework for transferring data with sockets. This software attempts to wrap the low-level socket library to ease higher-level programming with an API based on an asynchronous event driven model similar to the DAT uDAPL API. It is an event-based application with NUMA optimizations, that allows for a high throughput of data across a large distributed system. This paper describes the architecture, the technologies involved and the performance measurements of the software in the context of the CMS distributed event building.
  • Keywords
    application program interfaces; data acquisition; distributed processing; high energy physics instrumentation computing; memory architecture; software libraries; API; CMS DAQ system; CMS data acquisition; CMS distributed event building; CPU; CPU architectures; IRQ; LHC; NUMA architectures; NUMA optimizations; TCP; asynchronous event driven model; data transfer; distributed system; event-based application; high throughput; higher-level programming; interrupt request; low-level socket library; memory affinities; network layer; nonuniform memory access architecture; online CMS framework; performance measurements; socket abstraction; software component; standard socket library; Bandwidth; Data acquisition; Libraries; Memory management; Sockets; Software; Data acquisition systems; data communication; distributed computing; fast networks; high energy physics computing; software performance;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Real Time Conference (RT), 2014 19th IEEE-NPSS
  • Print_ISBN
    978-1-4799-3658-8
  • Type

    conf

  • DOI
    10.1109/RTC.2014.7097439
  • Filename
    7097439