1Raw TCP/IP interface for lwIP
2
3Authors: Adam Dunkels, Leon Woestenberg, Christiaan Simons
4
5lwIP provides three Application Program's Interfaces (APIs) for programs
6to use for communication with the TCP/IP code:
7* low-level "core" / "callback" or "raw" API.
8* higher-level "sequential" API.
9* BSD-style socket API.
10
11The sequential API provides a way for ordinary, sequential, programs
12to use the lwIP stack. It is quite similar to the BSD socket API. The
13model of execution is based on the blocking open-read-write-close
14paradigm. Since the TCP/IP stack is event based by nature, the TCP/IP
15code and the application program must reside in different execution
16contexts (threads).
17
18The socket API is a compatibility API for existing applications,
19currently it is built on top of the sequential API. It is meant to
20provide all functions needed to run socket API applications running
21on other platforms (e.g. unix / windows etc.). However, due to limitations
22in the specification of this API, there might be incompatibilities
23that require small modifications of existing programs.
24
25** Threading
26
27lwIP started targeting single-threaded environments. When adding multi-
28threading support, instead of making the core thread-safe, another
29approach was chosen: there is one main thread running the lwIP core
30(also known as the "tcpip_thread"). The raw API may only be used from
31this thread! Application threads using the sequential- or socket API
32communicate with this main thread through message passing.
33
34      As such, the list of functions that may be called from
35      other threads or an ISR is very limited! Only functions
36      from these API header files are thread-safe:
37      - api.h
38      - netbuf.h
39      - netdb.h
40      - netifapi.h
41      - sockets.h
42      - sys.h
43
44      Additionaly, memory (de-)allocation functions may be
45      called from multiple threads (not ISR!) with NO_SYS=0
46      since they are protected by SYS_LIGHTWEIGHT_PROT and/or
47      semaphores.
48
49      Only since 1.3.0, if SYS_LIGHTWEIGHT_PROT is set to 1
50      and LWIP_ALLOW_MEM_FREE_FROM_OTHER_CONTEXT is set to 1,
51      pbuf_free() may also be called from another thread or
52      an ISR (since only then, mem_free - for PBUF_RAM - may
53      be called from an ISR: otherwise, the HEAP is only
54      protected by semaphores).
55
56
57** The remainder of this document discusses the "raw" API. **
58
59The raw TCP/IP interface allows the application program to integrate
60better with the TCP/IP code. Program execution is event based by
61having callback functions being called from within the TCP/IP
62code. The TCP/IP code and the application program both run in the same
63thread. The sequential API has a much higher overhead and is not very
64well suited for small systems since it forces a multithreaded paradigm
65on the application.
66
67The raw TCP/IP interface is not only faster in terms of code execution
68time but is also less memory intensive. The drawback is that program
69development is somewhat harder and application programs written for
70the raw TCP/IP interface are more difficult to understand. Still, this
71is the preferred way of writing applications that should be small in
72code size and memory usage.
73
74Both APIs can be used simultaneously by different application
75programs. In fact, the sequential API is implemented as an application
76program using the raw TCP/IP interface.
77
78--- Callbacks
79
80Program execution is driven by callbacks. Each callback is an ordinary
81C function that is called from within the TCP/IP code. Every callback
82function is passed the current TCP or UDP connection state as an
83argument. Also, in order to be able to keep program specific state,
84the callback functions are called with a program specified argument
85that is independent of the TCP/IP state.
86
87The function for setting the application connection state is:
88
89- void tcp_arg(struct tcp_pcb *pcb, void *arg)
90
91  Specifies the program specific state that should be passed to all
92  other callback functions. The "pcb" argument is the current TCP
93  connection control block, and the "arg" argument is the argument
94  that will be passed to the callbacks.
95
96
97--- TCP connection setup
98
99The functions used for setting up connections is similar to that of
100the sequential API and of the BSD socket API. A new TCP connection
101identifier (i.e., a protocol control block - PCB) is created with the
102tcp_new() function. This PCB can then be either set to listen for new
103incoming connections or be explicitly connected to another host.
104
105- struct tcp_pcb *tcp_new(void)
106
107  Creates a new connection identifier (PCB). If memory is not
108  available for creating the new pcb, NULL is returned.
109
110- err_t tcp_bind(struct tcp_pcb *pcb, struct ip_addr *ipaddr,
111                 u16_t port)
112
113  Binds the pcb to a local IP address and port number. The IP address
114  can be specified as IP_ADDR_ANY in order to bind the connection to
115  all local IP addresses.
116
117  If another connection is bound to the same port, the function will
118  return ERR_USE, otherwise ERR_OK is returned.
119
120- struct tcp_pcb *tcp_listen(struct tcp_pcb *pcb)
121
122  Commands a pcb to start listening for incoming connections. When an
123  incoming connection is accepted, the function specified with the
124  tcp_accept() function will be called. The pcb will have to be bound
125  to a local port with the tcp_bind() function.
126
127  The tcp_listen() function returns a new connection identifier, and
128  the one passed as an argument to the function will be
129  deallocated. The reason for this behavior is that less memory is
130  needed for a connection that is listening, so tcp_listen() will
131  reclaim the memory needed for the original connection and allocate a
132  new smaller memory block for the listening connection.
133
134  tcp_listen() may return NULL if no memory was available for the
135  listening connection. If so, the memory associated with the pcb
136  passed as an argument to tcp_listen() will not be deallocated.
137
138- struct tcp_pcb *tcp_listen_with_backlog(struct tcp_pcb *pcb, u8_t backlog)
139
140  Same as tcp_listen, but limits the number of outstanding connections
141  in the listen queue to the value specified by the backlog argument.
142  To use it, your need to set TCP_LISTEN_BACKLOG=1 in your lwipopts.h.
143
144- void tcp_accepted(struct tcp_pcb *pcb)
145
146  Inform lwIP that an incoming connection has been accepted. This would
147  usually be called from the accept callback. This allows lwIP to perform
148  housekeeping tasks, such as allowing further incoming connections to be
149  queued in the listen backlog.
150
151- void tcp_accept(struct tcp_pcb *pcb,
152                  err_t (* accept)(void *arg, struct tcp_pcb *newpcb,
153                                   err_t err))
154
155  Specified the callback function that should be called when a new
156  connection arrives on a listening connection.
157
158- err_t tcp_connect(struct tcp_pcb *pcb, struct ip_addr *ipaddr,
159                    u16_t port, err_t (* connected)(void *arg,
160                                                    struct tcp_pcb *tpcb,
161                                                    err_t err));
162
163  Sets up the pcb to connect to the remote host and sends the
164  initial SYN segment which opens the connection.
165
166  The tcp_connect() function returns immediately; it does not wait for
167  the connection to be properly setup. Instead, it will call the
168  function specified as the fourth argument (the "connected" argument)
169  when the connection is established. If the connection could not be
170  properly established, either because the other host refused the
171  connection or because the other host didn't answer, the "err"
172  callback function of this pcb (registered with tcp_err, see below)
173  will be called.
174
175  The tcp_connect() function can return ERR_MEM if no memory is
176  available for enqueueing the SYN segment. If the SYN indeed was
177  enqueued successfully, the tcp_connect() function returns ERR_OK.
178
179
180--- Sending TCP data
181
182TCP data is sent by enqueueing the data with a call to
183tcp_write(). When the data is successfully transmitted to the remote
184host, the application will be notified with a call to a specified
185callback function.
186
187- err_t tcp_write(struct tcp_pcb *pcb, void *dataptr, u16_t len,
188                  u8_t copy)
189
190  Enqueues the data pointed to by the argument dataptr. The length of
191  the data is passed as the len parameter. The copy argument is either
192  0 or 1 and indicates whether the new memory should be allocated for
193  the data to be copied into. If the argument is 0, no new memory
194  should be allocated and the data should only be referenced by
195  pointer.
196
197  The tcp_write() function will fail and return ERR_MEM if the length
198  of the data exceeds the current send buffer size or if the length of
199  the queue of outgoing segment is larger than the upper limit defined
200  in lwipopts.h. The number of bytes available in the output queue can
201  be retrieved with the tcp_sndbuf() function.
202
203  The proper way to use this function is to call the function with at
204  most tcp_sndbuf() bytes of data. If the function returns ERR_MEM,
205  the application should wait until some of the currently enqueued
206  data has been successfully received by the other host and try again.
207
208- void tcp_sent(struct tcp_pcb *pcb,
209                err_t (* sent)(void *arg, struct tcp_pcb *tpcb,
210                u16_t len))
211
212  Specifies the callback function that should be called when data has
213  successfully been received (i.e., acknowledged) by the remote
214  host. The len argument passed to the callback function gives the
215  amount bytes that was acknowledged by the last acknowledgment.
216
217
218--- Receiving TCP data
219
220TCP data reception is callback based - an application specified
221callback function is called when new data arrives. When the
222application has taken the data, it has to call the tcp_recved()
223function to indicate that TCP can advertise increase the receive
224window.
225
226- void tcp_recv(struct tcp_pcb *pcb,
227                err_t (* recv)(void *arg, struct tcp_pcb *tpcb,
228                               struct pbuf *p, err_t err))
229
230  Sets the callback function that will be called when new data
231  arrives. The callback function will be passed a NULL pbuf to
232  indicate that the remote host has closed the connection. If
233  there are no errors and the callback function is to return
234  ERR_OK, then it must free the pbuf. Otherwise, it must not
235  free the pbuf so that lwIP core code can store it.
236
237- void tcp_recved(struct tcp_pcb *pcb, u16_t len)
238
239  Must be called when the application has received the data. The len
240  argument indicates the length of the received data.
241
242
243--- Application polling
244
245When a connection is idle (i.e., no data is either transmitted or
246received), lwIP will repeatedly poll the application by calling a
247specified callback function. This can be used either as a watchdog
248timer for killing connections that have stayed idle for too long, or
249as a method of waiting for memory to become available. For instance,
250if a call to tcp_write() has failed because memory wasn't available,
251the application may use the polling functionality to call tcp_write()
252again when the connection has been idle for a while.
253
254- void tcp_poll(struct tcp_pcb *pcb,
255                err_t (* poll)(void *arg, struct tcp_pcb *tpcb),
256				u8_t interval)
257
258  Specifies the polling interval and the callback function that should
259  be called to poll the application. The interval is specified in
260  number of TCP coarse grained timer shots, which typically occurs
261  twice a second. An interval of 10 means that the application would
262  be polled every 5 seconds.
263
264
265--- Closing and aborting connections
266
267- err_t tcp_close(struct tcp_pcb *pcb)
268
269  Closes the connection. The function may return ERR_MEM if no memory
270  was available for closing the connection. If so, the application
271  should wait and try again either by using the acknowledgment
272  callback or the polling functionality. If the close succeeds, the
273  function returns ERR_OK.
274
275  The pcb is deallocated by the TCP code after a call to tcp_close().
276
277- void tcp_abort(struct tcp_pcb *pcb)
278
279  Aborts the connection by sending a RST (reset) segment to the remote
280  host. The pcb is deallocated. This function never fails.
281
282  ATTENTION: When calling this from one of the TCP callbacks, make
283  sure you always return ERR_ABRT (and never return ERR_ABRT otherwise
284  or you will risk accessing deallocated memory or memory leaks!
285
286
287If a connection is aborted because of an error, the application is
288alerted of this event by the err callback. Errors that might abort a
289connection are when there is a shortage of memory. The callback
290function to be called is set using the tcp_err() function.
291
292- void tcp_err(struct tcp_pcb *pcb, void (* err)(void *arg,
293       err_t err))
294
295  The error callback function does not get the pcb passed to it as a
296  parameter since the pcb may already have been deallocated.
297
298
299--- Lower layer TCP interface
300
301TCP provides a simple interface to the lower layers of the
302system. During system initialization, the function tcp_init() has
303to be called before any other TCP function is called. When the system
304is running, the two timer functions tcp_fasttmr() and tcp_slowtmr()
305must be called with regular intervals. The tcp_fasttmr() should be
306called every TCP_FAST_INTERVAL milliseconds (defined in tcp.h) and
307tcp_slowtmr() should be called every TCP_SLOW_INTERVAL milliseconds.
308
309
310--- UDP interface
311
312The UDP interface is similar to that of TCP, but due to the lower
313level of complexity of UDP, the interface is significantly simpler.
314
315- struct udp_pcb *udp_new(void)
316
317  Creates a new UDP pcb which can be used for UDP communication. The
318  pcb is not active until it has either been bound to a local address
319  or connected to a remote address.
320
321- void udp_remove(struct udp_pcb *pcb)
322
323  Removes and deallocates the pcb.
324
325- err_t udp_bind(struct udp_pcb *pcb, struct ip_addr *ipaddr,
326                 u16_t port)
327
328  Binds the pcb to a local address. The IP-address argument "ipaddr"
329  can be IP_ADDR_ANY to indicate that it should listen to any local IP
330  address. The function currently always return ERR_OK.
331
332- err_t udp_connect(struct udp_pcb *pcb, struct ip_addr *ipaddr,
333                    u16_t port)
334
335  Sets the remote end of the pcb. This function does not generate any
336  network traffic, but only set the remote address of the pcb.
337
338- err_t udp_disconnect(struct udp_pcb *pcb)
339
340  Remove the remote end of the pcb. This function does not generate
341  any network traffic, but only removes the remote address of the pcb.
342
343- err_t udp_send(struct udp_pcb *pcb, struct pbuf *p)
344
345  Sends the pbuf p. The pbuf is not deallocated.
346
347- void udp_recv(struct udp_pcb *pcb,
348                void (* recv)(void *arg, struct udp_pcb *upcb,
349                                         struct pbuf *p,
350                                         struct ip_addr *addr,
351                                         u16_t port),
352                              void *recv_arg)
353
354  Specifies a callback function that should be called when a UDP
355  datagram is received.
356
357
358--- System initalization
359
360A truly complete and generic sequence for initializing the lwip stack
361cannot be given because it depends on the build configuration (lwipopts.h)
362and additional initializations for your runtime environment (e.g. timers).
363
364We can give you some idea on how to proceed when using the raw API.
365We assume a configuration using a single Ethernet netif and the
366UDP and TCP transport layers, IPv4 and the DHCP client.
367
368Call these functions in the order of appearance:
369
370- stats_init()
371
372  Clears the structure where runtime statistics are gathered.
373
374- sys_init()
375
376  Not of much use since we set the NO_SYS 1 option in lwipopts.h,
377  to be called for easy configuration changes.
378
379- lwip_mem_init()
380
381  Initializes the dynamic memory heap defined by MEM_SIZE.
382
383- memp_init()
384
385  Initializes the memory pools defined by MEMP_NUM_x.
386
387- pbuf_init()
388
389  Initializes the pbuf memory pool defined by PBUF_POOL_SIZE.
390
391- etharp_init()
392
393  Initializes the ARP table and queue.
394  Note: you must call etharp_tmr at a ARP_TMR_INTERVAL (5 seconds) regular interval
395  after this initialization.
396
397- ip_init()
398
399  Doesn't do much, it should be called to handle future changes.
400
401- udp_init()
402
403  Clears the UDP PCB list.
404
405- tcp_init()
406
407  Clears the TCP PCB list and clears some internal TCP timers.
408  Note: you must call tcp_fasttmr() and tcp_slowtmr() at the
409  predefined regular intervals after this initialization.
410
411- netif_add(struct netif *netif, struct ip_addr *ipaddr,
412            struct ip_addr *netmask, struct ip_addr *gw,
413            void *state, err_t (* init)(struct netif *netif),
414            err_t (* input)(struct pbuf *p, struct netif *netif))
415
416  Adds your network interface to the netif_list. Allocate a struct
417  netif and pass a pointer to this structure as the first argument.
418  Give pointers to cleared ip_addr structures when using DHCP,
419  or fill them with sane numbers otherwise. The state pointer may be NULL.
420
421  The init function pointer must point to a initialization function for
422  your ethernet netif interface. The following code illustrates it's use.
423
424  err_t netif_if_init(struct netif *netif)
425  {
426    u8_t i;
427
428    for(i = 0; i < ETHARP_HWADDR_LEN; i++) netif->hwaddr[i] = some_eth_addr[i];
429    init_my_eth_device();
430    return ERR_OK;
431  }
432
433  For ethernet drivers, the input function pointer must point to the lwip
434  function ethernet_input() declared in "netif/etharp.h". Other drivers
435  must use ip_input() declared in "lwip/ip.h".
436
437- netif_set_default(struct netif *netif)
438
439  Registers the default network interface.
440
441- netif_set_up(struct netif *netif)
442
443  When the netif is fully configured this function must be called.
444
445- dhcp_start(struct netif *netif)
446
447  Creates a new DHCP client for this interface on the first call.
448  Note: you must call dhcp_fine_tmr() and dhcp_coarse_tmr() at
449  the predefined regular intervals after starting the client.
450
451  You can peek in the netif->dhcp struct for the actual DHCP status.
452
453
454--- Optimalization hints
455
456The first thing you want to optimize is the lwip_standard_checksum()
457routine from src/core/inet.c. You can override this standard
458function with the #define LWIP_CHKSUM <your_checksum_routine>.
459
460There are C examples given in inet.c or you might want to
461craft an assembly function for this. RFC1071 is a good
462introduction to this subject.
463
464Other significant improvements can be made by supplying
465assembly or inline replacements for htons() and htonl()
466if you're using a little-endian architecture.
467#define LWIP_PLATFORM_BYTESWAP 1
468#define LWIP_PLATFORM_HTONS(x) <your_htons>
469#define LWIP_PLATFORM_HTONL(x) <your_htonl>
470
471Check your network interface driver if it reads at
472a higher speed than the maximum wire-speed. If the
473hardware isn't serviced frequently and fast enough
474buffer overflows are likely to occur.
475
476E.g. when using the cs8900 driver, call cs8900if_service(ethif)
477as frequently as possible. When using an RTOS let the cs8900 interrupt
478wake a high priority task that services your driver using a binary
479semaphore or event flag. Some drivers might allow additional tuning
480to match your application and network.
481
482For a production release it is recommended to set LWIP_STATS to 0.
483Note that speed performance isn't influenced much by simply setting
484high values to the memory options.
485
486For more optimization hints take a look at the lwIP wiki.
487
488--- Zero-copy MACs
489
490To achieve zero-copy on transmit, the data passed to the raw API must
491remain unchanged until sent. Because the send- (or write-)functions return
492when the packets have been enqueued for sending, data must be kept stable
493after that, too.
494
495This implies that PBUF_RAM/PBUF_POOL pbufs passed to raw-API send functions
496must *not* be reused by the application unless their ref-count is 1.
497
498For no-copy pbufs (PBUF_ROM/PBUF_REF), data must be kept unchanged, too,
499but the stack/driver will/must copy PBUF_REF'ed data when enqueueing, while
500PBUF_ROM-pbufs are just enqueued (as ROM-data is expected to never change).
501
502Also, data passed to tcp_write without the copy-flag must not be changed!
503
504Therefore, be careful which type of PBUF you use and if you copy TCP data
505or not!
506