High signal post-training datasets and tool-rich configurable sandboxes for building production grade AI agents