[compiler][amd64] Some more optimizations and changes Added branch_check_threads_etc calls that went missing. Also changed how branch_check_threads_etc is called, the code now maintains a counter on the C-stack, if adding 1 to it (as a signed byte) causes it to overflow the C-function is called, after adding 128 to the in-memory counter. This saves rather a lot of calls. Inlined F_{DUMB_,}RETURN, F_BRANCH_WHEN_{EQ,NE} F_ADD_NEG_INT, F_ADD_INT and F_ADD_INTS.