summaryrefslogtreecommitdiffstats
path: root/src/spdk/dpdk/doc/guides/nics/ena.rst
blob: bec97c3326a72a5d9017bcc921b5d0e30df5458d (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
..  SPDX-License-Identifier: BSD-3-Clause
    Copyright (c) 2015-2020 Amazon.com, Inc. or its affiliates.
    All rights reserved.

ENA Poll Mode Driver
====================

The ENA PMD is a DPDK poll-mode driver for the Amazon Elastic
Network Adapter (ENA) family.

Overview
--------

The ENA driver exposes a lightweight management interface with a
minimal set of memory mapped registers and an extendable command set
through an Admin Queue.

The driver supports a wide range of ENA adapters, is link-speed
independent (i.e., the same driver is used for 10GbE, 25GbE, 40GbE,
etc.), and it negotiates and supports an extendable feature set.

ENA adapters allow high speed and low overhead Ethernet traffic
processing by providing a dedicated Tx/Rx queue pair per CPU core.

The ENA driver supports industry standard TCP/IP offload features such
as checksum offload and TCP transmit segmentation offload (TSO).

Receive-side scaling (RSS) is supported for multi-core scaling.

Some of the ENA devices support a working mode called Low-latency
Queue (LLQ), which saves several more microseconds.

Management Interface
--------------------

ENA management interface is exposed by means of:

* Device Registers
* Admin Queue (AQ) and Admin Completion Queue (ACQ)

ENA device memory-mapped PCIe space for registers (MMIO registers)
are accessed only during driver initialization and are not involved
in further normal device operation.

AQ is used for submitting management commands, and the
results/responses are reported asynchronously through ACQ.

ENA introduces a very small set of management commands with room for
vendor-specific extensions. Most of the management operations are
framed in a generic Get/Set feature command.

The following admin queue commands are supported:

* Create I/O submission queue
* Create I/O completion queue
* Destroy I/O submission queue
* Destroy I/O completion queue
* Get feature
* Set feature
* Get statistics

Refer to ``ena_admin_defs.h`` for the list of supported Get/Set Feature
properties.

Data Path Interface
-------------------

I/O operations are based on Tx and Rx Submission Queues (Tx SQ and Rx
SQ correspondingly). Each SQ has a completion queue (CQ) associated
with it.

The SQs and CQs are implemented as descriptor rings in contiguous
physical memory.

Refer to ``ena_eth_io_defs.h`` for the detailed structure of the descriptor

The driver supports multi-queue for both Tx and Rx.

Configuration information
-------------------------

**DPDK Configuration Parameters**

  The following configuration options are available for the ENA PMD:

   * **CONFIG_RTE_LIBRTE_ENA_PMD** (default y): Enables or disables inclusion
     of the ENA PMD driver in the DPDK compilation.

   * **CONFIG_RTE_LIBRTE_ENA_DEBUG_RX** (default n): Enables or disables debug
     logging of RX logic within the ENA PMD driver.

   * **CONFIG_RTE_LIBRTE_ENA_DEBUG_TX** (default n): Enables or disables debug
     logging of TX logic within the ENA PMD driver.

   * **CONFIG_RTE_LIBRTE_ENA_COM_DEBUG** (default n): Enables or disables debug
     logging of low level tx/rx logic in ena_com(base) within the ENA PMD driver.

**Runtime Configuration Parameters**

   * **large_llq_hdr** (default 0)

     Enables or disables usage of large LLQ headers. This option will have
     effect only if the device also supports large LLQ headers. Otherwise, the
     default value will be used.

**ENA Configuration Parameters**

   * **Number of Queues**

     This is the requested number of queues upon initialization, however, the actual
     number of receive and transmit queues to be created will be the minimum between
     the maximal number supported by the device and number of queues requested.

   * **Size of Queues**

     This is the requested size of receive/transmit queues, while the actual size
     will be the minimum between the requested size and the maximal receive/transmit
     supported by the device.

Building DPDK
-------------

See the :ref:`DPDK Getting Started Guide for Linux <linux_gsg>` for
instructions on how to build DPDK.

By default the ENA PMD library will be built into the DPDK library.

For configuring and using UIO and VFIO frameworks, please also refer :ref:`the
documentation that comes with DPDK suite <linux_gsg>`.

Supported ENA adapters
----------------------

Current ENA PMD supports the following ENA adapters including:

* ``1d0f:ec20`` - ENA VF
* ``1d0f:ec21`` - ENA VF with LLQ support

Supported Operating Systems
---------------------------

Any Linux distribution fulfilling the conditions described in ``System Requirements``
section of :ref:`the DPDK documentation <linux_gsg>` or refer to *DPDK Release Notes*.

Supported features
------------------

* MTU configuration
* Jumbo frames up to 9K
* IPv4/TCP/UDP checksum offload
* TSO offload
* Multiple receive and transmit queues
* RSS hash
* RSS indirection table configuration
* Low Latency Queue for Tx
* Basic and extended statistics
* LSC event notification
* Watchdog (requires handling of timers in the application)
* Device reset upon failure

Prerequisites
-------------

#. Prepare the system as recommended by DPDK suite.  This includes environment
   variables, hugepages configuration, tool-chains and configuration.

#. ENA PMD can operate with ``vfio-pci``(*) or ``igb_uio`` driver.

   (*) ENAv2 hardware supports Low Latency Queue v2 (LLQv2). This feature
   reduces the latency of the packets by pushing the header directly through
   the PCI to the device, before the DMA is even triggered. For proper work
   kernel PCI driver must support write combining (WC). In mainline version of
   ``igb_uio`` (in DPDK repo) it must be enabled by loading module with
   ``wc_activate=1`` flag (example below). However, mainline's vfio-pci
   driver in kernel doesn't have WC support yet (planed to be added).
   If vfio-pci used user should be either turn off ENAv2 (to avoid performance
   impact) or recompile vfio-pci driver with patch provided in
   `amzn-github <https://github.com/amzn/amzn-drivers/tree/master/userspace/dpdk/enav2-vfio-patch>`_.

#. Insert ``vfio-pci`` or ``igb_uio`` kernel module using the command
   ``modprobe vfio-pci`` or ``modprobe uio; insmod igb_uio.ko wc_activate=1``
   respectively.

#. For ``vfio-pci`` users only:
   Please make sure that ``IOMMU`` is enabled in your system,
   or use ``vfio`` driver in ``noiommu`` mode::

     echo 1 > /sys/module/vfio/parameters/enable_unsafe_noiommu_mode

   To use ``noiommu`` mode, the ``vfio-pci`` must be built with flag
   ``CONFIG_VFIO_NOIOMMU``.

#. Bind the intended ENA device to ``vfio-pci`` or ``igb_uio`` module.

At this point the system should be ready to run DPDK applications. Once the
application runs to completion, the ENA can be detached from attached module if
necessary.

**Note about usage on \*.metal instances**

On AWS, the metal instances are supporting IOMMU for both arm64 and x86_64
hosts.

* x86_64 (e.g. c5.metal, i3.metal):
   IOMMU should be disabled by default. In that situation, the ``igb_uio`` can
   be used as it is but ``vfio-pci`` should be working in no-IOMMU mode (please
   see above).

   When IOMMU is enabled, ``igb_uio`` cannot be used as it's not supporting this
   feature, while ``vfio-pci`` should work without any changes.
   To enable IOMMU on those hosts, please update ``GRUB_CMDLINE_LINUX`` in file
   ``/etc/default/grub`` with the below extra boot arguments::

    iommu=1 intel_iommu=on

   Then, make the changes live by executing as a root::

    # grub2-mkconfig > /boot/grub2/grub.cfg

   Finally, reboot should result in IOMMU being enabled.

* arm64 (a1.metal):
   IOMMU should be enabled by default. Unfortunately, ``vfio-pci`` isn't
   supporting SMMU, which is implementation of IOMMU for arm64 architecture and
   ``igb_uio`` isn't supporting IOMMU at all, so to use DPDK with ENA on those
   hosts, one must disable IOMMU. This can be done by updating
   ``GRUB_CMDLINE_LINUX`` in file ``/etc/default/grub`` with the extra boot
   argument::

    iommu.passthrough=1

   Then, make the changes live by executing as a root::

    # grub2-mkconfig > /boot/grub2/grub.cfg

   Finally, reboot should result in IOMMU being disabled.
   Without IOMMU, ``igb_uio`` can be used as it is but ``vfio-pci`` should be
   working in no-IOMMU mode (please see above).

Usage example
-------------

Follow instructions available in the document
:ref:`compiling and testing a PMD for a NIC <pmd_build_and_test>` to launch
**testpmd** with Amazon ENA devices managed by librte_pmd_ena.

Example output:

.. code-block:: console

   [...]
   EAL: PCI device 0000:00:06.0 on NUMA socket -1
   EAL:   Invalid NUMA socket, default to 0
   EAL:   probe driver: 1d0f:ec20 net_ena

   Interactive-mode selected
   testpmd: create a new mbuf pool <mbuf_pool_socket_0>: n=171456, size=2176, socket=0
   testpmd: preferred mempool ops selected: ring_mp_mc
   Warning! port-topology=paired and odd forward ports number, the last port will pair with itself.
   Configuring Port 0 (socket 0)
   Port 0: 00:00:00:11:00:01
   Checking link statuses...

   Done
   testpmd>