lourenco/mgmt - mgmt - código do assilvestrar

Author	SHA1	Message	Date
James Shubin	63f21952f4	golang: Split things into packages This makes this logically more separate! :) As an aside... I really hate the way golang does dependencies and packages. Yes, some people insist on nesting their code deep into a $GOPATH, which is fine if you're a google dev and are forced to work this way, but annoying for the rest of the world. Your code shouldn't need a git commit to switch to a a different vcs host! Gah I hate this so much.	2016-09-26 12:30:28 -04:00
James Shubin	fc24c91dde	Resources: Add retry and retry delay meta parameters All resources can now set a retry limit (-1 for infinite) and a delay between retries. This applies to both the CheckApply methods, and the Watch methods as well. They each have their own separate counts, but use the same input meta param, since I decided it wouldn't be useful to have a separate watchRetry and watchDelay set of meta parameters. In the process, we got rid of about 15 error cases which would normally panic. This patch required a slight overhaul of the Event system. The previous commit is an earlier version of this patch which I decided to leave in to "show my work" as I used to have to do in math class. It's slightly more correct with the current event system, and this version is less correct and has a few bugs, but that is because the event system needs a massive overhaul, and once that's done this should all work properly for the corner cases.	2016-09-19 06:32:21 -04:00
James Shubin	b46432b5b6	misc: Golint fixes	2016-09-02 01:46:45 -04:00
James Shubin	ff01e4a5e7	remote: Add distributed converged timeout This patch extends the --converged-timeout argument so that when used with --remote it waits for the entire set of remote mgmt agents to converge simultaneously before exiting. purpleidea says: This particular part of the patch probably took as much work as all of the work required for the initial remote patches alone!	2016-08-31 21:55:19 -04:00
James Shubin	6794aff77c	miscellaneous cleanups and fixes	2016-08-31 21:55:19 -04:00
James Shubin	eee652cefe	converger: Add new timer system for determining convergence This adds a new method of marking whether a particular UUID has converged or not. You can now Start, Stop, or Reset a convergence timer on the individual UUID's. This wraps the existing SetConverged calls with a hidden go routine. It is not recommended to use the SetConverged calls and the Timer calls on the same UUID.	2016-08-31 21:55:19 -04:00
James Shubin	db4de12767	Add more flexibility around the prefixes available This allows you to specify a custom prefix, or a tmp prefix which is chosen automatically.	2016-08-31 21:55:19 -04:00
James Shubin	d429795737	Improve the configWatcher array to allow N files This simplifies the code in main and hides it in the watcher!	2016-08-31 21:55:19 -04:00
James Shubin	03c1df98f4	Improve prefix creation and feedback This makes the default /var/lib/mgmt/ directory, but also allows you to use a prefix in /tmp/ automatically if you can't write anywhere else.	2016-08-31 21:55:19 -04:00
James Shubin	79ba750dd5	Automatically update remote files on change This extends the automatic watching of graph definition files across the remote SSH boundary.	2016-08-31 21:55:19 -04:00
James Shubin	1d0e187838	Etcd: switch to using a directory prefix	2016-08-31 21:55:19 -04:00
James Shubin	ad1e48aa2d	Add caching for remote execution This speeds up copying of the binary for slow connections. It also finally adds a universal directory prefix for mgmt!	2016-08-31 21:55:19 -04:00
James Shubin	7032eea045	Remote "agent-less" mode This is a new mode to be used for bootstrapping mgmt clusters or in situations with tight operational restrictions. This includes the basics, additional functionality will follow!	2016-08-31 21:55:19 -04:00
James Shubin	bdb970203c	Start converger even if graph is empty	2016-08-30 17:47:25 -04:00
James Shubin	a5e9f6a6fc	Fix stupid gofmt issue	2016-06-19 02:47:36 -04:00
James Shubin	f821afdf3e	Update cli library path	2016-06-19 02:35:45 -04:00
James Shubin	5363839ac8	Embedded etcd This monster patch embeds the etcd server. It took a good deal of iterative work to tweak small details, and survived a rewrite from the initial etcd v2 API implementation to the beta version of v3. It has a notable race, and is missing some features, but it is ready for git master and external developer consumption.	2016-06-18 04:43:19 -04:00
Felix Frank	8f83ecee65	add puppet integration code Puppet can be used on the basis of the ffrank-mgmtgraph module. There are three modes available: * fetching catalogs from the master (--puppet agent) * compiling a manifest from a local file (--puppet /path/to/file.pp) * compiling a manifest from the cli (--puppet "<manifest>") Catalogs from the master are currently never refreshed. We should add some more code to re-run the parsing function at an interval equal to Puppet's local 'runinterval' setting. There is also still a distinct lack of tests. Still, this fixes #8	2016-06-01 00:34:40 +02:00
James Shubin	9f56e4a582	Add global --noop support This is part two of the earlier patch in `6bbce039aa` We also rename GetMeta to just Meta to clean up the API.	2016-05-18 14:28:34 -04:00
James Shubin	12ea860eba	Expand the event system slightly This also adds a cleaner exit for the inner main loop. I'm not sure if it's absolutely needed, but this will give me more confidence that we won't end in the middle of some action.	2016-05-18 11:57:36 -04:00
James Shubin	b876c29862	Add logging workaround when embedding etcd This was discussed in: https://github.com/coreos/etcd/issues/4115	2016-05-18 11:56:51 -04:00
James Shubin	ab73261fd4	Fix cli API change I guess this is why builds were breaking. Remember to go get -u ...	2016-05-14 16:26:09 -04:00
Paul Morgan	3120628d8a	allow to specify CLI options via environment vars Use `git show -w` to inspect this commit diff since it changes whitespace due to `gofmt`. This commit allows to use environment variables in place of CLI parameters to change program behavior. This supports the notion of [12-factor apps](http://12factor.net/) and makes it easier to dockerize the app as well as create a systemd unit file. It establishes a pattern of `MGMT_*` naming convention for environment variables.	2016-04-26 02:19:24 +00:00
James Shubin	6f3ac4bf2a	Rework the converged detection and provide a clean interface The old converged detection was hacked in code, instead of something with a nice interface. This cleans it up, splits it into a separate file, and removes a race condition that happened with the old code. We also take the time to get rid of the ugly Set* methods and replace them all with a single AssociateData method. This might be unnecessary if we can pass in the Converger method at Resource construction. Lastly, and most interesting, we suspend the individual timeout callers when they've already converged, thus reducing unnecessary traffic, and avoiding fast (eg: < 5 second) timers triggering more than once if they stay converged! A quick note on theory for any future readers... What happens if we have --converged-timeout=0 ? Well, for this and any other positive value, it's important to realize that deciding if something is converged is actually a race between if the converged timer will fire and if some random new event will get triggered. This is because there is nothing that can actually predict if or when a new event will happen (eg the user modifying a file). As a result, a race is always inherent, and actually not a negative or "incorrect" algorithm. A future improvement could be to add a global lock to each resource, and to lock all resources when computing if we are converged or not. In practice, this hasn't been necessary. The worst case scenario would be (in theory, because this hasn't been tested) if an event happens during the converged calculation, and starts running, the exit command then runs, and the event finishes, but it doesn't get a chance to notify some service to restart. A lock could probably fix this theoretical case.	2016-03-29 20:27:38 -04:00
James Shubin	81c5ce40d4	Add grouping algorithm This might not be fully correct, but it seems to be accurate so far. Of particular note, the vertex order needs to be deterministic for this algorithm, which isn't provided by a map, since golang intentionally randomizes it. As a result, this also adds a sorted version of GetVertices called GetVerticesSorted.	2016-03-28 21:16:03 -04:00
James Shubin	1b01f908e3	Add resource auto grouping Sorry for the size of this patch, I was busy hacking and plumbing away and it got out of hand! I'm allowing this because there doesn't seem to be anyone hacking away on parts of the code that this would break, since the resource code is fairly stable in this change. In particular, it revisits and refreshes some areas of the code that didn't see anything new or innovative since the project first started. I've gotten rid of a lot of cruft, and in particular cleaned up some things that I didn't know how to do better before! Here's hoping I'll continue to learn and have more to improve upon in the future! (Well let's not hope _too_ hard though!) The logical goal of this patch was to make logical grouping of resources possible. For example, it might be more efficient to group three package installations into a single transaction, instead of having to run three separate transactions. This is because a package installation typically has an initial one-time per run cost which shouldn't need to be repeated. Another future goal would be to group file resources sharing a common base path under a common recursive fanotify watcher. Since this depends on fanotify capabilities first, this hasn't been implemented yet, but could be a useful method of reducing the number of separate watches needed, since there is a finite limit. It's worth mentioning that grouping resources typically _reduces_ the parallel execution capability of a particular graph, but depending on the cost/benefit tradeoff, this might be preferential. I'd submit it's almost universally beneficial for pkg resources. This monster patch includes: * the autogroup feature * the grouping interface * a placeholder algorithm * an extensive test case infrastructure to test grouping algorithms * a move of some base resource methods into pgraph refactoring * some config/compile clean ups to remove code duplication * b64 encoding/decoding improvements * a rename of the yaml "res" entries to "kind" (more logical) * some docs * small fixes * and more!	2016-03-28 20:54:41 -04:00
James Shubin	d3f7432861	Update context since etcd upstream moved to vendor/ dir Follow suit with what etcd upstream is doing.	2016-03-28 05:27:28 -04:00
James Shubin	3a85384377	Rename type to resource (res) and service to svc Naming the resources "type" was a stupid mistake, and is a huge source of confusion when also talking about real types. Fix this before it gets out of hand.	2016-02-21 15:51:52 -05:00
James Shubin	fc14e5c70e	Make an initial RPM package for COPR I'm not a RPM pro, so patches welcome! I'm surely doing something wrong.	2016-02-12 14:00:49 -05:00
James Shubin	b3e5f77d5d	Make it easier to use converged-timeout	2016-02-02 11:45:23 -05:00
James Shubin	0ca9351665	Don't generate file watch events if disabled This previously ignored the events, but they were still generated!	2016-01-20 02:05:26 -05:00
James Shubin	491e9fd9bc	Golint fixes I used: `golint \| grep -v comment \| grep -v stringer` to avoid crap.	2016-01-19 23:35:33 -05:00
James Shubin	8308680a50	Make sure to unpause all elements when resuming The indegree code added a regression because elements with an indegree would not be unpaused! This is now corrected. Time to add more tests :)	2016-01-19 22:01:51 -05:00
James Shubin	8db5d630d5	Exit if program was not compiled correctly Catch the missing injection of program name.	2016-01-15 16:49:50 -05:00
James Shubin	4c6647d807	Fixup state related items * Fixup graph state readability * Rename original SetState() to SetConvergedState() and friends... * Add type state management for proper BackPoke() commands... * Add better DEBUG logging This is an important optimization that prevents running a BackPoke on a parent which is in the process of running and will most certainly poke the caller back in a moment. This avoids unnecessary roundtrips. Unfortunately, there are still other algorithms required so that races can't cause the graph to run for longer than necessary.	2016-01-12 04:57:05 -05:00
James Shubin	8ea8ef8d0e	Simplify converge checker Not sure why I didn't write it this way before...	2016-01-10 02:40:31 -05:00
James Shubin	ea7fd76f93	Add exec type and fix up a few other things * Add exec type * Switch erroneous use of fmt to log instead * Check for edge existence for safety before using * Avoid recalling etcd channel maker * Clean up logging output	2016-01-09 21:50:21 -05:00
James Shubin	d769309cc0	Hello 2016!	2016-01-08 02:43:38 -05:00
James Shubin	d2bcfdc7aa	Fix go vet error	2016-01-07 00:52:42 -05:00
James Shubin	72525d30b1	Refactor etcd into object and add exit timers This refactors my etcd use into a struct (object) wrapper, which makes it easier to add an exit on converged timer.	2016-01-06 19:40:09 -05:00
James Shubin	904ace8027	Fix up go vet errors and integrate with ci	2016-01-04 21:02:22 -05:00
James Shubin	d8cbeb56f9	Support N distributed agents This is the third main feature of this system. The code needs a bunch of polish, but it actually all works :) I've tested this briefly with N <= 3. Currently you have to build your own etcd cluster. It's quite easy, just run `etcd` and it will be ready. I usually run it in a throw away /tmp/ dir so that I can blow away the stored data easily.	2016-01-04 21:00:13 -05:00
James Shubin	1ba6be2957	Add graphviz generation and visualization This requires graphviz to be installed on your machine. If you run the command with sudo, it will create the files with the original user ownership to make it easier to remove them without root.	2015-12-29 01:04:03 -05:00
James Shubin	6b4fa21074	Mega patch This is still a dirty prototype, so please excuse the mess. Please excuse the fact that this is a mega patch. Once things settle down this won't happen any more. Some of the changes squashed into here include: * Merge vertex loop with type loop (The file watcher seems to cache events anyways) * Improve pgraph library * Add indegree, outdegree, and topological sort with tests * Add reverse function for vertex list * Tons of additional cleanup! Amazingly, on my first successful compile, this seemed to run! A special thanks to Ira Cooper who helped me talk through some of the algorithmic decisions and for his help in finding better ones!	2015-12-21 03:27:25 -05:00
James Shubin	25ad05cce3	Initial public commit of mgmt This is a prototype that i'm attempting to "release early". Expect a lot of changes! It is intended to be a config management tool that will: * be event based * execute actions in parallel * function as a distributed system There are a bunch more design ideas going into this, please stay tuned!	2015-09-25 01:16:03 -04:00

45 Commits