Show HN: I Parallelized RNN Training from O(T) to O(log T) Using CUDA