Using tcmalloc could consume more heap. Enlarging kernel heap allocation can fix this. Also print result for Python test.
Currently, only a subset of tests are enabled. Other tests will be enabled in the future.