Exim's Interfaces To Mail Filtering
Exim's Interfaces To Mail Filtering
1. Forwarding and ltering in Exim ............................................................................................ 1 1.1 1.2 1.3 1.4 1.5 1.6 Introduction .............................................................................................................................. Filter operation ........................................................................................................................ Testing a new lter le ........................................................................................................... Installing a lter le ................................................................................................................ Testing an installed lter le .................................................................................................. Details of ltering commands ............................................................................................... 1 1 1 2 3 3
2. Sieve lter les ............................................................................................................................ 4 2.1 Recognition of Sieve lters ................................................................................................... 2.2 Saving to specied folders .................................................................................................... 2.3 Strings containing header names ........................................................................................ 2.4 Exists test with empty list of headers .................................................................................. 2.5 Header test with invalid MIME encoding in header ........................................................... 2.6 Address test for multiple addresses per header ................................................................ 2.7 Semantics of keep .................................................................................................................. 2.8 Semantics of leinto ............................................................................................................... 2.9 Semantics of redirect ............................................................................................................. 2.10 String arguments .................................................................................................................. 2.11 Number units ........................................................................................................................ 2.12 RFC compliance ................................................................................................................... 4 4 4 4 4 5 5 5 5 5 5 5
3. Exim lter les ............................................................................................................................. 7 3.1 Format of Exim lter les ....................................................................................................... 7 3.2 Data values in lter commands ............................................................................................ 7 3.3 String expansion ..................................................................................................................... 7 3.4 Some useful general variables ............................................................................................. 8 3.5 Header variables .................................................................................................................... 9 3.6 User variables ....................................................................................................................... 10 3.7 Current directory .................................................................................................................. 10 3.8 Signicant deliveries ............................................................................................................ 10 3.9 Filter commands ................................................................................................................... 10 3.10 The add command ............................................................................................................. 11 3.11 The deliver command ........................................................................................................ 11 3.12 The save command ........................................................................................................... 11 3.13 The pipe command ............................................................................................................ 12 3.14 Mail commands .................................................................................................................. 14 3.15 Logging commands ........................................................................................................... 16 3.16 The nish command .......................................................................................................... 16 3.17 The testprint command ..................................................................................................... 16 3.18 The fail command .............................................................................................................. 16 3.19 The freeze command ........................................................................................................ 16 3.20 The headers command ..................................................................................................... 17 3.21 Obeying commands conditionally ................................................................................... 17 3.22 String testing conditions .................................................................................................... 17 3.23 Numeric testing conditions ............................................................................................... 19 3.24 Testing for signicant deliveries ....................................................................................... 19 3.25 Testing for error messages ............................................................................................... 19 3.26 Testing a list of addresses ................................................................................................ 19 3.27 Testing for personal mail ................................................................................................... 20 3.28 Alias addresses for the personal condition .................................................................... 21 3.29 Details of the personal condition ..................................................................................... 21 i
Testing delivery status ....................................................................................................... Multiple personal mailboxes ............................................................................................. Ignoring delivery errors ..................................................................................................... Examples of Exim lter commands .................................................................................
21 22 22 22
ii
1.1 Introduction
Most Unix mail transfer agents (programs that deliver mail) permit individual users to specify automatic forwarding of their mail, usually by placing a list of forwarding addresses in a le called .forward in their home directories. Exim extends this facility by allowing the forwarding instructions to be a set of rules rather than just a list of addresses, in effect providing .forward with conditions. Operating the set of rules is called ltering, and the le that contains them is called a lter le. Exim supports two different kinds of lter le. An Exim lter contains instructions in a format that is unique to Exim. A Sieve lter contains instructions in the Sieve format that is dened by RFC 3028. As this is a standard format, Sieve lter les may already be familiar to some users. Sieve les should also be portable between different environments. However, the Exim ltering facility contains more features (such as variable expansion), and better integration with the host environment (such as the use of external processes and pipes). The choice of which kind of lter to use can be left to the end-user, provided that the system administrator has congured Exim appropriately for both kinds of lter. However, if interoperability is important, Sieve is the only choice. The ability to use ltering or traditional forwarding has to be enabled by the system administrator, and some of the individual facilities can be separately enabled or disabled. A local document should be provided to describe exactly what has been enabled. In the absence of this, consult your system administrator. This document describes how to use a lter le and the format of its contents. It is intended for use by end-users. Both Sieve lters and Exim lters are covered. However, for Sieve lters, only issues that relate to the Exim implementation are discussed, since Sieve itself is described elsewhere. The contents of traditional .forward les are not described here. They normally contain just a list of addresses, le names, or pipe commands, separated by commas or newlines, but other types of item are also available. The full details can be found in the chapter on the redirect router in the Exim specication, which also describes how the system administrator can set up and control the use of ltering.
/usr/sbin/sendmail -bf myfilter <test-message The -bf option tells Exim that the following item on the command line is the name of a lter le that is to be tested. There is also a -bF option, which is similar, but which is used for testing system lter les, as opposed to user lter les, and which is therefore of use only to the system administrator. The test message is supplied on the standard input. If there are no message-dependent tests in the lter, an empty le (/dev/null) can be used. A supplied message must start with header lines or the From message separator line that is found in many multi-message folder les. Note that blank lines at the start terminate the header lines. A warning is given if no header lines are read. The result of running this command, provided no errors are detected in the lter le, is a list of the actions that Exim would try to take if presented with the message for real. For example, for an Exim lter, the output Deliver message to: [email protected] Save message to: /home/lemuel/mail/archive means that one copy of the message would be sent to [email protected], and another would be added to the le /home/lemuel/mail/archive, if all went well. The actions themselves are not attempted while testing a lter le in this way; there is no check, for example, that any forwarding addresses are valid. For an Exim lter, if you want to know why a particular action is being taken, add the -v option to the command. This causes Exim to output the results of any conditional tests and to indent its output according to the depth of nesting of if commands. Further additional output from a lter test can be generated by the testprint command, which is described below. When Exim is outputting a list of the actions it would take, if any text strings are included in the output, non-printing characters therein are converted to escape sequences. In particular, if any text string contains a newline character, this is shown as \n in the testing output. When testing a lter in this way, Exim makes up an envelope for the message. The recipient is by default the user running the command, and so is the sender, but the command can be run with the -f option to supply a different sender. For example, /usr/sbin/sendmail -bf myfilter \ -f [email protected] <test-message Alternatively, if the -f option is not used, but the rst line of the supplied message is a From separator from a message folder le (not the same thing as a From: header line), the sender is taken from there. If -f is present, the contents of any From line are ignored. The return path is the same as the envelope sender, unless the message contains a Return-path: header, in which case it is taken from there. You need not worry about any of this unless you want to test out features of a lter le that rely on the sender address or the return path. It is possible to change the envelope recipient by specifying further options. The -bfd option changes the domain of the recipient address, while the -bfl option changes the local part, that is, the part before the @ sign. An adviser could make use of these to test someone elses lter le. The -bfp and -bfs options specify the prex or sufx for the local part. These are relevant only when support for multiple personal mailboxes is implemented; see the description in section 3.31 below.
Exim violates RFC 2822, section 3.6.8, by accepting 8-bit header names, so this implementation repeats this violation to stay consistent with Exim. This is in preparation for UTF-8 data. Sieve scripts cannot contain NUL characters in strings, but mail headers could contain MIME encoded NUL characters, which could never be matched by Sieve scripts using exact comparisons. For that reason, this implementation extends the Sieve quoted string syntax with \0 to describe a NUL character, violating \0 being the same as 0 in RFC 3028. Even without using \0, the following tests are all true in this implementation. Implementations that use C-style strings will only evaluate the rst test as true. Subject: =?iso-8859-1?q?abc=00def header :contains "Subject" ["abc"] header :contains "Subject" ["def"] header :matches "Subject" ["abc?def"] Note that by considering Sieve to be an MUA, RFC 2047 can be interpreted in a way that NUL characters truncating strings is allowed for Sieve implementations, although not recommended. It is further allowed to use encoded NUL characters in headers, but thats not recommended either. The above example shows why. RFC 3028 states that if an implementation fails to convert a character set to UTF-8, two strings cannot be equal if one contains octets greater than 127. Assuming that all unknown character sets are one-byte character sets with the lower 128 octets being US-ASCII is not sound, so this implementation violates RFC 3028 and treats such MIME words literally. That way at least something could be matched. The folder specied by leinto must not contain the character sequence .. to avoid security problems. RFC 3028 does not specify the syntax of folders apart from keep being equivalent to fileinto "INBOX"; This implementation uses inbox instead. Sieve script errors currently cause messages to be silently led into inbox. RFC 3028 requires that the user is notied of that condition. This may be implemented in the future by adding a header line to mails that are led into inbox due to an error in the lter.
everything that Exim can do with strings, you should consult the chapter on string expansion in the Exim documentation. In lter les, by far the most common use of string expansion is the substitution of the contents of a variable. For example, the substring $reply_address is replaced by the address to which replies to the message should be sent. If such a variable name is followed by a letter or digit or underscore, it must be enclosed in curly brackets (braces), for example, ${reply_address} If a $ character is actually required in an expanded string, it must be escaped with a backslash, and because backslash is also an escape character in quoted input strings, it must be doubled in that case. The following two examples illustrate two different ways of testing for a $ character in a message: if $message_body contains \$ then ... if $message_body contains "\\$" then ... You can prevent part of a string from being expanded by enclosing it between two occurrences of \N. For example, if $message_body contains \N$$$$\N then ... tests for a run of four dollar characters.
$original_local_part: When an address that arrived with the message is being processed, this contains the same value as the variable $local_part. However, if an address generated by an alias, forward, or lter le is being processed, this variable contains the local part of the original address. $reply_address: The contents of the Reply-to: header, if the message has one; otherwise the contents of the From: header. It is the address to which normal replies to the message should be sent. $return_path: The return path that is, the sender eld that will be transmitted as part of the messages envelope if the message is sent to another host. This is the address to which delivery errors are sent. In many cases, this variable has the same value as $sender_address, but if, for example, an incoming message to a mailing list has been expanded, $return_path may have been changed to contain the address of the list maintainer. $sender_address: The sender address that was received in the envelope of the message. This is not necessarily the same as the contents of the From: or Sender: header lines. For delivery error messages (bounce messages) there is no sender address, and this variable is empty. $tod_full: A full version of the time and date, for example: Wed, 18 Oct 1995 09:51:40 +0100. The timezone is always given as a numerical offset from GMT. $tod_log: The time and date in the format used for writing Exims log les, without the timezone, for example: 1995-10-12 15:32:29. $tod_zone: The local timezone offset, for example: +0100.
If the message does not contain a header of the given name, an empty string is substituted. Thus it is important to spell the names of headers correctly. Do not use $header_Reply_to when you really mean $header_Reply-to.
pipe to a command (section 3.13) save to a le (section 3.12) print while testing (section 3.17) tailored form of mail (section 3.14)
The headers command has additional parameters that can be used only in a system lter. The fail and freeze commands are available only when Exims ltering facilities are being used as a system lter, and are therefore usable only by the system administrator and not by ordinary users. They are mentioned only briefly in this document; for more information, see the main Exim specication.
If the le name does not start with a / character, the contents of the $home variable are prepended, unless it is empty, or the system administrator has disabled this feature. In conventional congurations, this variable is normally set in a user lter to the users home directory, but the system administrator may set it to some other path. In some congurations, $home may be unset, or prepending may be disabled, in which case a non-absolute path name may be generated. Such congurations convert this to an absolute path when the delivery takes place. In a system lter, $home is never set. The user must of course have permission to write to the le, and the writing of the le takes place in a process that is running as the user, under the users primary group. Any secondary groups to which the user may belong are not normally taken into account, though the system administrator can congure Exim to set them up. In addition, the ability to use this command at all is controlled by the system administrator it may be forbidden on some systems. An optional mode value may be given after the le name. The value for the mode is interpreted as an octal number, even if it does not begin with a zero. For example: save /some/folder 640 This makes it possible for users to override the system-wide mode setting for le deliveries, which is normally 600. If an existing le does not have the correct mode, it is changed. An alternative form of delivery may be enabled on your system, in which each message is delivered into a new le in a given directory. If this is the case, this functionality can be requested by giving the directory name terminated by a slash after the save command, for example save separated/messages/ There are several different formats for such deliveries; check with your system administrator or local documentation to nd out which (if any) are available on your system. If this functionality is not enabled, the use of a path name ending in a slash causes an error.
Documentation for some programs that are normally run via this kind of pipe often suggest that the command should start with IFS=" " This is a shell command, and should not be present in Exim lter les, since it does not normally run the command under a shell. However, there is an option that the administrator can set to cause a shell to be used. In this case, the entire command is expanded as a single string and passed to the shell for interpretation. It is recommended that this be avoided if at all possible, since it can lead to problems when inserted variables contain shell metacharacters. The default PATH set up for the command is determined by the system administrator, usually containing at least /bin and /usr/bin so that common commands are available without having to specify an absolute le name. However, it is possible for the system administrator to restrict the pipe facility so that the command name must not contain any / characters, and must be found in one of the directories in the congured PATH. It is also possible for the system administrator to lock out the use of the pipe command altogether. When the command is run, a number of environment variables are set up. The complete list for pipe deliveries may be found in the Exim reference manual. Those that may be useful for pipe deliveries from user lter les are: DOMAIN HOME LOCAL_PART LOCAL_PART_PREFIX LOCAL_PART_SUFFIX LOGNAME MESSAGE_ID PATH RECIPIENT SENDER SHELL USER the domain of the address your home directory see below see below see below your login name the unique id of the message the command search path the complete recipient address the sender of the message /bin/sh see below
LOCAL_PART, LOGNAME, and USER are all set to the same value, namely, your login id. LOCAL_PART_PREFIX and LOCAL_PART_SUFFIX may be set if Exim is congured to recognize prexes or sufxes in the local parts of addresses. For example, a message addressed to [email protected] may cause the lter for user pat to be run. If this sets up a pipe delivery, LOCAL_PART_SUFFIX is -suf2 when the pipe command runs. The system administrator has to congure Exim specially for this feature to be available. If you run a command that is a shell script, be very careful in your use of data from the incoming message in the commands in your script. RFC 2822 is very generous in the characters that are permitted to appear in mail addresses, and in particular, an address may begin with a vertical bar or a slash. For this reason you should always use quotes round any arguments that involve data from the message, like this: /some/command '$SENDER' so that inserted shell meta-characters do not cause unwanted effects. Remember that, as was explained earlier, the pipe command is not run at the time the lter le is interpreted. The lter just denes what deliveries are required for one particular addressee of a message. The deliveries themselves happen later, once Exim has decided everything that needs to be done for the message. A consequence of this is that you cannot inspect the return code from the pipe command from within the lter. Nevertheless, the code returned by the command is important, because Exim uses it to decide whether the delivery has succeeded or failed. 13 Exim lter les
The command should return a zero completion code if all has gone well. Most non-zero codes are treated by Exim as indicating a failure of the pipe. This is treated as a delivery failure, causing the message to be returned to its sender. However, there are some completion codes that are treated as temporary errors. The message remains on Exims spool disk, and the delivery is tried again later, though it will ultimately time out if the delivery failures go on too long. The completion codes to which this applies can be specied by the system administrator; the default values are 73 and 75. The pipe command should not normally write anything to its standard output or standard error le descriptors. If it does, whatever is written is normally returned to the sender of the message as a delivery error, though this action can be varied by the system administrator.
from mailing lists or elsewhere. Sending an automatic response to a mailing list or a mailing list manager is an Internet Sin. For both commands, the key/value argument pairs can appear in any order. At least one of text or le must appear (except with vacation, where there is a default for le); if both are present, the text string appears rst in the message. If expand precedes le, each line of the le is subject to string expansion before it is included in the message. Several lines of text can be supplied to text by including the escape sequence \n in the string wherever a newline is required. If the command is output during lter le testing, newlines in the text are shown as \n. Note that the keyword for creating a Reply-To: header is reply_to, because Exim keywords may contain underscores, but not hyphens. If the from keyword is present and the given address does not match the user who owns the forward le, Exim normally adds a Sender: header to the message, though it can be congured not to do this. The extra_headers keyword allows you to add custom header lines to the message. The text supplied must be one or more syntactically valid RFC 2822 header lines. You can use \n within quoted text to specify newlines between headers, and also to dene continued header lines. For example: extra_headers "h1: first\nh2: second\n continued\nh3: third" No newline should appear at the end of the nal header line. If no to argument appears, the message is sent to the address in the $reply_address variable (see section 3.3 above). An In-Reply-To: header is automatically included in the created message, giving a reference to the message identication of the incoming message. If return message is specied, the incoming message that caused the lter le to be run is added to the end of the message, subject to a maximum size limitation. If a log le is specied, a line is added to it for each message sent. If a once le is specied, it is used to hold a database for remembering who has received a message, and no more than one message is ever sent to any particular address, unless once_repeat is set. This species a time interval after which another copy of the message is sent. The interval is specied as a sequence of numbers, each followed by the initial letter of one of seconds, minutes, hours, days, or weeks. For example, once_repeat 5d4h causes a new message to be sent if at least 5 days and 4 hours have elapsed since the last one was sent. There must be no white space in a time interval. Commonly, the le name specied for once is used as the base name for direct-access (DBM) le operations. There are a number of different DBM libraries in existence. Some operating systems provide one as a default, but even in this case a different one may have been used when building Exim. With some DBM libraries, specifying once results in two les being created, with the sufxes .dir and .pag being added to the given name. With some others a single le with the sufx .db is used, or the name is used unchanged. Using a DBM le for implementing the once feature means that the le grows as large as necessary. This is not usually a problem, but some system administrators want to put a limit on it. The facility can be congured not to use a DBM le, but instead, to use a regular le with a maximum size. The data in such a le is searched sequentially, and if the le lls up, the oldest entry is deleted to make way for a new one. This means that some correspondents may receive a second copy of the message after an unpredictable interval. Consult your local information to see if your system is congured this way. More than one mail or vacation command may be obeyed in a single lter run; they are all honoured, even when they are to the same recipient. 15 Exim lter les
istrator, and not enabled for use by ordinary users, it is described in more detail in the main Exim specication rather than in this document.
There may be any number of elif and then sections (including none) and the else section is also optional. Any number of commands, including nested if commands, may appear in any of the <commands> sections. Conditions can be combined by using the words and and or, and round brackets (parentheses) can be used to specify how several conditions are to combine. Without brackets, and is more binding than or. For example: if $h_subject: contains "Make money" or $h_precedence: is "junk" or ($h_sender: matches ^\\d{8}@ and not personal) or $message_body contains "this is not spam" then seen finish endif A condition can be preceded by not to negate it, and there are also some negative forms of condition that are more English-like.
<text1> begins <text2> <text1> does not begin <text2> e.g. $header_from: begins "Friend@" A begins test checks for the presence of the second string at the start of the rst, both strings having been expanded. <text1> ends <text2> <text1> does not end <text2> e.g. $header_from: ends "public.com.example" An ends test checks for the presence of the second string at the end of the rst, both strings having been expanded. <text1> is <text2> <text1> is not <text2> e.g. $local_part_suffix is "-foo" An is test does an exact match between the strings, having rst expanded both strings. <text1> contains <text2> <text1> does not contain <text2> e.g. $header_subject: contains "evolution" A contains test does a partial string match, having expanded both strings. <text1> matches <text2> <text1> does not match <text2> e.g. $sender_address matches "(bill|john)@" For a matches test, after expansion of both strings, the second one is interpreted as a regular expression. Exim uses the PCRE regular expression library, which provides regular expressions that are compatible with Perl. The match succeeds if the regular expression matches any part of the rst string. If you want a regular expression to match only at the start or end of the subject string, you must encode that requirement explicitly, using the ^ or $ metacharacters. The above example, which is not so constrained, matches all these addresses: [email protected] [email protected] [email protected] [email protected] To match only the rst two, you could use this: if $sender_address matches "^(bill|john)@" then ... Care must be taken if you need a backslash in a regular expression, because backslashes are interpreted as escape characters both by the string expansion code and by Exims normal processing of strings in quotes. For example, if you want to test the sender address for a domain ending in .com the regular expression is \.com$ The backslash and dollar sign in that expression have to be escaped when used in a lter command, as otherwise they would be interpreted by the expansion code. Thus, what you actually write is if $sender_address matches \\.com\$ An alternative way of handling this is to make use of the \N expansion flag for suppressing expansion: if $sender_address matches \N\.com$\N Everything between the two occurrences of \N is copied without change by the string expander (and in fact you do not need the nal one, because it is at the end of the string). If the regular expression is given in quotes (mandatory only if it contains white space) you have to write either 18 Exim lter les
if $sender_address matches "\\\\.com\\$" or if $sender_address matches "\\N\\.com$\\N" If the regular expression contains bracketed sub-expressions, numeric variable substitutions such as $1 can be used in the subsequent actions after a successful match. If the match fails, the values of the numeric variables remain unchanged. Previous values are not restored after endif. In other words, only one set of values is ever available. If the condition contains several sub-conditions connected by and or or, it is the strings extracted from the last successful match that are available in subsequent actions. Numeric variables from any one sub-condition are also available for use in subsequent sub-conditions, because string expansion of a condition occurs just before it is tested.
B.Simpson <[email protected]>, [email protected] (his sister) then $thisaddress would take on the values [email protected] and [email protected] in turn. If there are no valid addresses in the list, the whole condition is false. If the internal condition is true for any one address, the overall condition is true and the loop ends. If the internal condition is false for all addresses in the list, the overall condition is false. This example tests for the presence of an eight-digit local part in any address in a To: header: if foranyaddress $h_to: ( $thisaddress matches ^\\d{8}@ ) then ... When the overall condition is true, the value of $thisaddress in the commands that follow then is the last value it took on inside the loop. At the end of the if command, the value of $thisaddress is reset to what it was before. It is best to avoid the use of multiple occurrences of foranyaddress, nested or otherwise, in a single if command, if the value of $thisaddress is to be used afterwards, because it isnt always clear what the value will be. Nested if commands should be used instead. Header lines can be joined together if a check is to be applied to more than one of them. For example: if foranyaddress $h_to:,$h_cc: .... This scans through the addresses in both the To: and the Cc: headers.
delivery process successfully terminates; if there is a crash or a power failure (for example), the next delivery attempt is also a rst delivery. In a user lter le rst_delivery will be false if there was previously an error in the lter, or if a delivery for the user failed owing to, for example, a quota error, or if forwarding to a remote address was deferred for some reason. The condition manually_thawed is true if the message was frozen for some reason, and was subsequently released by the system administrator. It is unlikely to be of use in users lter les.
then save $home/mail/f+e endif Save all non-urgent messages by weekday: # Exim filter if $header_subject: does not contain "urgent" and $tod_full matches "^(...)," then save $home/mail/$1 endif Throw away all mail from one site, except from postmaster: # Exim filter if $reply_address contains "@spam.site.example" and $reply_address does not contain "postmaster@" then seen finish endif Handle multiple personal mailboxes: # Exim filter if $local_part_suffix is "-foo" then save $home/mail/foo elif $local_part_suffix is "-bar" then save $home/mail/bar endif
23