apparmor

mirror of https://gitlab.com/apparmor/apparmor.git synced 2025-03-04 16:35:02 +01:00

Author	SHA1	Message	Date
John Johansen	35d55fce81	Move state label, nodes, and permission setting into the State constructor Signed-off-by: John Johansen <john.johansen@canonical.com>	2010-11-11 16:14:12 -08:00
John Johansen	5578299445	Group dfa stats into a single structure. Move the dfa stats into a structure to provide a single access point to them. Signed-off-by: John Johansen <john.johansen@canonical.com>	2010-11-11 16:12:50 -08:00
John Johansen	99a7991664	Rename the match_count variable to duplicates The match_count variable is a sum of the number of duplicates node sets that have been encountered and discarded. Rename it to better reflect what it is doing. Signed-off-by: John Johansen <john.johansen@canonical.com>	2010-11-11 16:09:05 -08:00
John Johansen	15567a55dc	Embedded the temporary computed nodes as part of the state Embedding the nodes are part of the state gives fast back reference from the state to the nodes that created it. This is useful for the state to nodes mapping dump as it lets us output the states in order. It will also let us avoid certain nodemap lookup in the future. Overlay the nodes field (used only in dfa construction) with the partition field which is only used during dfa minimization to avoid making the state any larger. Signed-off-by: John Johansen <john.johansen@canonical.com>	2010-11-11 16:08:02 -08:00
John Johansen	5b68e0f7c4	Fix comment about what state information is being dumped Signed-off-by: John Johansen <john.johansen@canonical.com>	2010-11-11 16:06:52 -08:00
Kees Cook	eaa6a3c297	This cleans up a number of warnings that appeared after the parser rework commits were made (as well as a few other minor warnings elsewhere). The Makefile change is to avoid passing -Wstrict-prototypes and -Wnested-externs to the C++ compiler, which the compiler yells about and then ignores. Since we compile with -Wmissing-field-initializers I dropped the unreferenced zero-width fields in the header structs, and then explicitly initialized the remaining fields. I tagged several unused function parameters to silence those warnings. And finally, I dropped the unused filter_escapes() too.	2010-11-09 13:39:18 -08:00
John Johansen	d53bb7f811	Embedded the State to partition mapping into the State. Embedding the the partition mapping into the State structure significantly speeds up dfa minimization, by converting rbtree finds to straight direct references when checking for same mappings. The overall time improvement is small but it can half the time spent in minimization.	2010-11-09 11:57:43 -08:00
John Johansen	29c6f7e3ac	Re-enable the ability to invoke remove-unreachable-states. Now that removing unreachable states is not on by default re-enable the ability to turn it on.	2010-11-09 11:56:58 -08:00
John Johansen	14e7d94701	Add ability to dump unique permission sets	2010-11-09 11:56:28 -08:00
John Johansen	318351376c	Add the ability to dump NodeSet to dfa state mapping	2010-11-09 11:55:40 -08:00
John Johansen	af8b3b84ef	Use nodemap.size() to label state node The nodemap.size() increases by one with each node added, every time we add a state we label it so this provides the proper labeling without needing a separate variable.	2010-11-09 11:55:05 -08:00
John Johansen	b64921a5ec	Add tracking of the node set (proto state) max, and average size	2010-11-09 11:54:20 -08:00
John Johansen	fae7cac15c	Rename trans-XXXX transition to compress- compression trans- isn't a very good name for this phase of compilation. It is the compression phase, rename to trans- to compress- to reflect this.	2010-11-09 11:49:18 -08:00
John Johansen	0ad84d93f9	Factor out expr tree rotation into its own function	2010-11-09 11:48:29 -08:00
John Johansen	ac9553de19	Rework tests against Epsnodes to compare to the singleton Dynamic casts are slower than plain comparisons so rework epsnode comparison to use comparisons to the singleton epsnode instead of dynamic_casts.	2010-11-09 11:47:37 -08:00
John Johansen	6801346b81	Add cnode class as a base class of all expr nodes that contain character info	2010-11-09 11:46:05 -08:00
John Johansen	04d6c727e1	Add a leafnode class to clearly indicate what node types are leaf nodes	2010-11-09 11:44:26 -08:00
John Johansen	aec77cecde	Move nodes around to put one child node together and two child nodes together	2010-11-09 11:38:20 -08:00
John Johansen	0f26d8f097	Further split up innernode, to be able to better identify the types of inner nodes. This is part of a serious of patches to cleanup expr nodes, by separating out functionality and reducing the number of dynamic casts.	2010-11-09 11:36:14 -08:00
John Johansen	cb2ebc3102	Rework the depth first traversal of expr trees, to remove the use of the unneeded visited table, and give a little speed up and cleanup.	2010-11-09 11:35:38 -08:00
John Johansen	d2581332db	This is part of a serious of patches to cleanup expr nodes, by separating out functionality and reducing the number of dynamic casts.	2010-11-09 11:34:59 -08:00
John Johansen	adb0973d61	Update Makefile to pass CFLAGS into libapparmor_re	2010-11-09 11:33:40 -08:00
John Johansen	7f987f93d1	As from a library pov they should be seperately callable fns, and this will help reduce peak memory usage in some cases. Also disbale remove_unreachable, as the current dfa code isn't generating unreachable states, and minimization removes any states that are connected but redundant.	2010-11-09 11:28:56 -08:00
John Johansen	c5fa0e98b3	Reference counting of Nodes exists to shared the special accept nodes that hold permission information. We currently keep them in a table with a refcount so that they don't go away, until we delete the table. We can simulate this by getting rid of the refcount, and making dup and release virtual, and overriding it for the special accept nodes.	2010-11-09 11:28:22 -08:00
John Johansen	a84844cea5	Do not use permission hashing for minimization by default. While this improves minimization performance, it can slow down total creation time and result in larger compressed dfas. This is because it results in the dfa not being completely minimized which with the current O(n2) dfa table compression algorithm can result in slower compressed dfa generation.	2010-11-09 11:27:36 -08:00
John Johansen	51f443c7b6	Update state progress/stats output to dump the number of accepting states/partitions occur in the minimized dfa.	2010-11-09 11:26:50 -08:00
John Johansen	c2601dbd30	Cleanup the perm_map as soon as it is no longer needed. Cleaning up the map before the end of the functions reduces the peak memory of the function	2010-11-09 11:26:18 -08:00
John Johansen	2fb64fa85e	When hashing Nodes ensure that cases.otherwise == NULL is treated the same as pointing to the nonmatching state. Having this mix shouldn't currently exist but adding the extra check makes the code more robust.	2010-11-09 11:25:44 -08:00
John Johansen	4e80416a4f	Do permission accumulation in dfa minimization. This is necessary if accept states with different permissions are to ever share a partition.	2010-11-09 11:24:51 -08:00
John Johansen	a949b075b4	The dfa flags currently are a weird mix of position and negative assertions. Its cleaner just to have them all assert one way and let the cmd line options apply them correctly.	2010-11-09 11:23:45 -08:00
John Johansen	36e99af7fb	Split dfa minimizing hashing into two seperately controllable hashes. The first hash does hashing on state just state transitions, which always results in a performance improvement. The second does hashing based off of accept permissions, which can create more initial states but can result in not being able to achieve a true minimum dfa. This can also lead to slowing down total dfa creation because while minimization, compression can take longer if the dfa isn't completely minimized. permission hashing is currently required, as minimization does not accumulate redundant Node permissions.	2010-11-09 11:22:54 -08:00
John Johansen	9b99039fdb	Convert Nodemap comparision to use a hash value. This uses a little more memory than just using the NodeSet size to short circuit comparison but it improves on the case where compared sets have the same size. It is possible that this will slow down small dfa generation slightly but the trade off for large dfa's (which are the slow ones to generate) is worth it. This results in another performance bump over using the NodeSize is NodeSet comparison, and the amount of improvement increases with larger dfas	2010-11-09 11:20:08 -08:00
John Johansen	344e11a539	Use set size as part of set comparison, short circuiting comparing sets of pointers when it isn't necessary. This results in a nice little performance increase in dfa creation. This is more of a proof of concept patch, and is replaced by the next patch which does better short circuiting via hashing	2010-11-09 11:18:46 -08:00
John Johansen	ca1d891799	This patch reworks the internal structures used to compute the dfa. It is on the large side, and I experimented with different ways to split this up but in the end, anything I could do would result in a series of dependent patches that would require all of them to be applied to get meaningful functional changes. The patch structural reworks the dfa so that - there is a new State class, it takes the place of sets of nodes in the dfa, and allows storing state information within the state - removes the dfa transition table, which mapped sets of nodes to a transition table, by moving the transition into the new state class - computes dfa state permissions once (stored in the state) - expression tree nodes are independent from a created dfa. This allows computed expression trees, and sets of Nodes (used as protostates when computing the dfa). To be managed independent of the dfa life time. This will allow reducing the amount of memory used, in the future, and will also allow separating the expression tree logic out into its own file. The patch has some effect on reducing peak memory usage, and computation time. The actual amount of reduction is dependent on the number of states in the dfa with larger saving being achieved on larger dfas. Eg. for the test evince profile I was using it makes the parser about 7% faster with a peak memory usage about 12% less. This patch changes the initial partition hashing of minimization resulting in slightly smaller dfas.	2010-11-09 11:14:55 -08:00
John Johansen	291066dcbd	On certain graphs the dfa graph dump output can become messed up as it isn't properly handling non-printing characters in the case of single character output. Drop the cast to signed character which messes up the output.	2010-08-17 08:02:27 -07:00
John Johansen	6259edac38	Update and expand comments on regex tree normalization	2010-08-04 10:23:22 -07:00
John Johansen	f0220611aa	Epsnodes carry no information beyond the node type. Convert to using a single static node, which will reduce allocations and peak memory use slightly.	2010-08-04 09:53:46 -07:00
John Johansen	4be07c3265	This adds a basic debug dump for the conversion of each rule in a profile to its expression tree. It is limited in that it doesn't currently handle the permissions of a rule. conversion output presents an aare -> prce conversion followed by 1 or more expression tree rules, governed by what the rule does. eg. aare: /** -> /[^/\x00][^\x00]* rule: /[^/\x00][^\x00]* -> /[^\0000/]([^\0000])* eg. echo "/foo { / rwlkmix, } " \| ./apparmor_parser -QT -D rule-exprs -D expr-tree aare: /foo -> /foo aare: / -> /[^/\x00][^\x00]* rule: /[^/\x00][^\x00]* -> /[^\0000/]([^\0000])* rule: /[^/\x00][^\x00]\x00/[^/]. -> /[^\0000/]([^\0000])\0000/[^/](.) DFA: Expression Tree (/[^\0000/]([^\0000])(((((((((((((<513>\|<2>)\|<4>)\|<8>)\|<16>)\|<32>)\|<64>)\|<8404992>)\|<32768>)\|<65536>)\|<131072>)\|<262144>)\|<524288>)\|<1048576>)\|/[^\0000/]([^\0000])\0000/[^/](.)((<16>\|<32>)\|<262144>)) This simple example shows many things 1. The profile name under goes pcre conversion. But since no regular expressions where found it doesn't generate any expr rules 2. /* is converted into the pcre expression /[^\0000/]([^\0000])* 3. The pcre expression /[^\0000/]([^\0000])* is converted into two rules that are then converted into expression trees. The reason for this can not be seen by the output as this is actually triggered by permissions separation for the rule. In this case the link permission is separated into what is shown as the second rule: statement. 4. DFA: Expression Tree dump shows how these rules are combined together You will notice that the rule conversion statement is fairly redundant currently as it just show pcre to expression tree pcre. This will change when direct aare parsing occurs, but currently serves to verify the pcre conversion step. It is not the prettiest patch, as its touching some ugly code that is schedule to be cleaned up/replaced. eg. convert_aaregex_to_pcre is going to replaced with native parse conversion from an aare straight to the expression tree, and dfaflag passing will become part of the rule set.	2010-07-23 13:29:35 +02:00
John Johansen	837f47c921	This is the user space fix for launchpad.net/busgs/599450 It changes the table resizing so that there is always sufficient high entries in the table, preventing bounds violations from occurring. Previously the resize allocation was always based on the character set range for a state, which could be more or less than actually required, and packing would waste some space when over allocation was done. As a result this patch in general results in slightly smaller transition tables even though it enforcing the minimum required padding to avoid bounds violations.	2010-07-23 04:30:31 +02:00
John Johansen	bfb96638f6	This is a preparatory patch for the fix to launchpad.net/bugs/599450. It combines the two separate table resize code segments into a single functionally equivalent segment. It does not fix the bug.	2010-07-23 04:29:54 +02:00
John Johansen	6453a41a28	Add extra transition table labeling to help with interpretation of the dump output.	2010-07-23 04:29:29 +02:00
John Johansen	af3476afb9	The templatization of deref_less_than is unnecessary and complicates the code replace it with its none templatized version.	2010-07-10 17:53:04 -07:00
John Johansen	4f8e01ff36	expression tree node labeling is used during debugging dumps. Currently the node labels are computed and stored in a map, that is not cleaned up. This means that the labeling is retained across different dfas. Move the labeling into expr node as this takes less memory than using a map and will also separates node labeling so its per dfa instead of global. In addition this means the labeling is cleanedup/freed when the expr tree is freed without any extra work.	2010-07-10 17:52:13 -07:00
John Johansen	d0dcab10f1	Make the transition table dump easier to understand by labeling each entry with its index.	2010-07-10 17:49:32 -07:00
John Johansen	1004f039ec	When creating the dfa the sets firstpos, lastpos, and followpos are computed for each expression tree node and then used as input to create the dfa states. Currently they are not being freed until the nodes are destroyed, but the information is no longer needed once the dfa has been created. Cleaning them up early reduces peak memory usage.	2010-07-10 17:47:25 -07:00
John Johansen	9efd526f6f	Fix memory leak during dfa minimization. Dfa minimization wasn't deleting the states it eliminated during the minimization process, and hence leaking memory.	2010-03-13 02:23:23 -08:00
John Johansen	8dd795dec1	Rework the partitioning to take advantage of Partitions now being a list	2010-01-31 23:21:00 -08:00
John Johansen	8bcfa1a32f	Move partitions from using sets to lists as this is a better match for what is being done.	2010-01-31 23:19:54 -08:00
John Johansen	e984b6ff74	Seperate Partition definition for States. This is a small step to cleaning up the code	2010-01-31 23:18:14 -08:00
John Johansen	1179c1a42c	Improve partitioning performance slightly by inserting new partitions imediately after the current partition being considered, instead of at the back of the parition list. This does two things, it makes it more likely the data is in cache, and it also in general results in more partitions being created in a single pass.	2010-01-31 23:12:33 -08:00

1 2

95 commits