What is rational behind this design,
The only thing the timestamp is intended to be used for is calculating the new proof-of-work difficulty every 2016 blocks. A variation in timestamps of 120 minutes or so isn't a problem for that use case, and it means that the decentralized peer-to-peer system doesn't need a centralized source of time that everyone can agree on.
This could lead to cases where timestamp of block n < timestamp of block n-1
Yes. It happens all the time.