So I was right. Next step will be lowering that to 22nm or less.
Not anytime soon. Also process node alone doesn't determine efficiency. The 28nm chips by Cointerra and HashFast (just specs not yet delivered yet) have roughly 50% higher efficiency (~0.8 J/GH) than the first 28nm chip by KNC (~1.2J/GH). Bitfury 55nm design is ~ 1 J/GH. All these numbers are measured at the wall, companies often market the chip only efficiency but total power conusmption for a rig is higher.
Still the rate of improvement is going to slow dramatically. In the span of less than year we went from GPU (330 J/GH) to high process node chips (~8 J/GH) and if Cointerra or HashFast deliver before the end of the year we are looking at <0.8 J/GH.
No foundry is offering 22nm (only lines at 22nm are internal fabs like Samsung and Intel) instead all the foundries (TSMC, Global Foundries, UMC) are going to 20nm and then 16nm. However it is simply getting hard and harder to maintain Moore's law. Economical 20nm chips are probably not going to happen until at least 2015 and 16nm seems doubtful before 2016 (IMHO 2017 seems more likely). Neither provide the "massive" efficiency gains we have seen this year. TSMC marketing (real world tends to be lower than marketing claims) on 20nm, is EITHER a 30% higher clock or 20% better efficiency at the same clock so if companies opt for the high performance route (faster chips) it wouldn't offer any efficiency improvement at all, and if they keeps clocks the same (no faster performance) we are still only looking at 20% lower power consumption. 16nm looks to offer up to 30% higher clocks AND 30% higher efficiency but it is even further out.
Summary
GPU ~330 J/GH Jan 2013
First ASIC Miners ~8 J/GH March 2013
Best available today ~1 J/GH Aug 2013
28nm designs in production ~0.8 J/GH (spec) Dec 2013
20nm higher clock speed ~0.8 J/GH (est) ?2015
20nm same clock speed ~0.6 J/GH (est) ?2015
16nm in theory ~0.5 J/GH (est) ?2016-2017
As you can see the efficiency gains starts getting smaller pretty quick dropping real quick and the timelines start to get pushed out. When you consider that in theory we might not have chips offering better than double current efficiency for more than a couple years and we went from 330 J/GH to 0.8 J/GH in a single year it kinda shows that in the short term how much of the potential efficiency has already been realized. The prior poster didn't say it will never be better than 28nm simply that it is probably as good as it get for a while and it is a sentiment I agree with.