prefer NATIVE implementations over SIMDE_VECTOR_SUBSCRIPT_OPS, remove some broken optimized implementations
Commit: a5c494c ↗ by mr-c