Poetry in Physics

Response to “The Creator”

foolish physicist — Mon, 16 Oct 2023 18:02:14 +0000

(Warning: spoilers about “The Creator”)

I haven’t been overly motivated to write in a long time, but maybe saying a little about this will break the dam. If you don’t like movie spoilers, beware. I will speak about the newish Gareth Edwards movie “The Creator.”

I have to say something because this movie has an alarmingly high rating on IMDB of 7.1 and because I’ve seen people look favorably upon it in several Sci-fi or entertainment focused sites.

Here’s the elevator pitch of the movie: it’s the near future, 2065, and humanity is at war with AI. A lone soldier caught in the grief cycle over the loss of his wife fights to save a young robot who may be the savior of the machines or the destroyer of humanity.

I think that sums it up pretty well. You can almost see the thread that got this movie green-lit. Edwards has a bit of cred after Rogue One, which was a decent good movie. And, I will say that The Creator was a fairly pretty movie.

My response however: I hated it! I thought it was among the most brain-dead thoughtless movies I’ve ever seen. I am aghast it has fared so well on IMDB. Rotten tomatoes: 67% on critics, 76% on audience. WTF???? How can that possibly be. I was very near to getting up and walking out in the last hour of that thing.

This movie was an incredible missed opportunity for someone to think deeply about AI and create a story reflecting that deep thought.

In the world of The Creator, we have nations divided into two factions. We have The Americanz and the Far East. The Americanz were atomic bombed a few years prior to the movie by a rogue AI and, as movie Americans typically do, they banned the crap out of it and went on a global war to annihilate all AI “simulants” as they’re called. The East, who happen to be a loose kludge of Buddhist Asians, who are apparently Thai, Japanese, Chinese, Hindi and Arab, accept AI and have generally allowed it to take over. The AI, of course, is utterly peaceful and very Buddhist –yes, including simulant monks in mountain temples. As the movie goes on, the Buddhist AIs say explicitly that they only want to exist and be free while living in an apparently Palestinian Gaza homologue that looks either like the South China Sea, Singapore or Tibet. I have made a point to focus your eye on the religousity of the tale because the movie opens with an opaque reference to the AI seeking “the creator” Nirmata and the apparent similarity between the idea of the child Dali Lama and the deeply powerful AI child that the main character spends the whole movie dragging around.

A part of what made this movie hit so poorly for me is the recent situation that actually emerged from Gaza strip where Hamas terrorists jumped the border into Israel and literally went on a killing and hostage-taking spree. I am somewhat sympathetic to the plight of the Palestinians, but my sympathy is limited by the fact that a subset of the Palestinians don’t seem to believe that they should coexist with Israel but should totally annihilate the Israelis. When the movie cast the plight of the downtrodden in a black-and-white, good vs. evil light, romanticizing that the downtrodden are always sympathetic victims, I simply could not separate out the fact that the real world Palestinians are rallied around this corrupt core that is simply violence incarnate which keeps making the case that it really should be eradicated. In the face of this, maybe some of the other bad taste this movie left in my mouth will seem to make contextual sense.

(I am left ruminating on an incident earlier last week where a Palestinian student group faced down against an Israeli student group near the student center. I watched the competing demonstrations and I remember thinking how it seemed to me as if nobody present was anything but hurt. Would they commiserate with each other if they realized that the one thing they truly had in common was the hurt?)

Sad that this movie entangled itself with my repulsion for certain real world events.

As is the case with movie Americans who have been 9/11ed, these movie Americanz have gone on to wage a literally genocidal campaign of cleansing against an otherwise peaceful opponent who seemingly has no choice but to fight back. It’s the same sort of black-white logical fallacy that generally makes humans so unpalatable in “Avatar.” Never mind that Gareth Edwards is a wealthy American who clearly wants to cast the downtrodden as sympathetic while making a movie obviously aimed at making money. I doubt he’ll donate is earnings to charity.

So, what is the value to AI in this movie? Clearly, it’s a Dali Lama child with the ability to switch off and take control of the machines around her. If she lives, she will eventually grow strong enough to simply turn off all war, as the movie tells us. And so, the naughty White Americanz set off to kill her and of course need to use a Black man with confused loyalties in order to do it. Not sure they missed a minority…

Having summed up the movie and the variety of clangorous thematic threads it tries to wield, I will try to deconstruct exactly how badly this movie failed and offended my physicist sensibilities.

Let’s start with the Nomad. Very pretty special effect. Very stupid implementation.

The Nomad is the Americanz victory ship. If you’ve never played Starcraft, you may not know the concept of a Victory Armada. This one ship is a literal Death Star; the one ship that can rove around and destroy everything… planet scale with the Death Star, city scale with the nuke lobbing Nomad. The problem with a victory ship is that it’s the entire obvious lynch pin the Americanz war effort. If this one thing is destroyed, the war is over. If you haven’t been paying attention to the Ukraine war, you may not realize that the technology of smaller and smarter, the occurrence of autonomous drones, is currently making victory ships utterly impossible. Russia is barely willing to fly their precious SU-35 fighters over the conflict zone because small, lethal Ukrainian stinger missiles are so efficient at taking them out. Did you realize that the star of the Ukraine war is a normal artillery piece using FPV drones as spotters and networked battle management software to target literally anything that is spotted in a distributed way. One Russian steps out of tree cover within range, the drone sees it and Ukrainian artillery rains hell on it, accurately, within five minutes. And, then hell comes back just as quickly on the counter-battery fire! The Russian black sea fleet has fled Crimea because of the threat of waterborne drones and long range Storm Shadow missiles. On this background alone, the Nomad in 2065 seems out-of-touch and unreasonable. Where are the drones in this movie? Well, they run on two legs and blow themselves up like suicide bombers. It should be lost on no one that even the Americanz tanks are like victory ships… lose one and the war is over. And the AI simulants… they don’t use drones at all. They hide behind barriers like scared people hoping not to get shot.

If this technological stupidity were not enough, let me return to the subject of the Nomad (I seem to have been distracted by the other stupidity in this movie). The Nomad sort of appears as if it’s a singular orbital platform which can drop nukes on targets below it. First of all, if it is in orbit, how is it able to remain stationary with respect to the surface? Only one orbit can do this, and it’s geostationary orbit. Now, if the Nomad is in geostationary orbit, how is it able to move around over the surface? Geostationary orbit, which is actually relatively high, at several earth radii in altitude and specifically over the Earth’s equator, can’t do this. The problem with staying in space is that everything is moving there and you generally don’t get to hang around over some location. Satellites in low earth orbit time their passes, always moving with respect to the surface. All this sort of implies that the Nomad isn’t in orbit, but is somehow hanging around at high altitude over the surface. How it does that is never stated and, in fact, I read an interview with Gareth Edwards which more or less said that he didn’t care; he assumes that the movie’s audience will invent the physics for him and give him free license to violate reason however he sees fit. Problem is that I think you need to be at least kind of reasonable for poetic license to fill in the blanks. Clearly Gareth Edwards is not Christopher Nolan.

Now, if the hanging platform weren’t enough, its primary impact on the world is to project giant targeting reticles on it. This is utterly ghastly in its stupidity. In fact, most of the weapons used by the Americanz involve a giant glowing target reticle projected somewhere. I feel a little insulted that a fellow American feels other Americans would be exactly that thoughtless. It doesn’t seem to me that you need to be Ukrainian in order to see how to exploit that. They’re shooting there? Oh gee, that’s the last place I want to be. Let me step over here a little bit. The first rule of martial arts? Don’t telegraph your punch.

Moreover, the ability to coherently illuminate a spot on the earth at a distance of many miles should itself be a weapon. That requires terrific power by itself and seems poorly used as a spotting scope telegraphing where you’re going to shoot a slower traveling missile. Wouldn’t the light coming down, frying everyone before they can be alarmed and move out of the way be a better weapon?

It’s enough to make one feel a little insulted.

A part of what motivated me to write here was the casting of AI simulants as Palestinians. I will now fill this out in the other direction by asking “Where was the AI?” I knew the AI was missing when the two Americanz commandos were apprehended by the AI police and managed to, well, overpower the police. This movie treats AI as if they are merely artificial people.

Consider this, the overwhelming purpose of AI at this moment in our history is as a patch to hide human incompetence. That’s why we’re building things like chatGPT; so that one person can circumvent interacting with another person who may possess a skill that the first person needs, but doesn’t have. We tell ourselves we’re building it because the AI doctor will never be subject to a malpractice lawsuit and the AI lawyer will only ever be fair, but the reality is that we’re building it so that one person’s incompetence does not broadcast into whatever project they’re involved in. We’re engineering to try to circumvent a fundamental lack of trust in the competence of people around us. So, does an AI police officer ever make a mistake? Built properly, that AI police officer has eyes in the back of its head, five arms, and servo powered joints that move like lightning, faster than human muscles can hope to match. The gun it shoots can be fully automatic, but on the understanding that every bullet it puts out is straight through the head or heart of a human target without having to stop to check if the shot was lethal. You want what AI and robots can be? Remove every speed limit on the human body, remove every blind spot, remove every easy vulnerability and distribute its thinking across multiple networked bodies. It can think calculus to our arithmetic. It is already beating our greatest chess masters and it is doing math problems that would take entire warehouses of people working their whole lives to calculate. Where the hell is the AI in this movie? The director is just holding up a fun house mirror to show us as machines but with none of the advantages of being a machine.

I suppose that the director would defend this oversight by saying that his robots are peaceful and only want to coexist, that they are overtly Buddhist and would never take an action that is more than self-defense. This is breathtakingly naive. The nature of an AI superintelligence is to out-think people, by description and design, that’s what we’re building. The reason people are worried about this AI is that if it really is that smart, we have no idea what this kind of better than us can possibly mean. It could beat us before we even realize we’re in a fight with it without us having actually even understood either the nature of the fight or how it beat us. It isn’t coupled to our finite understanding of time or limited by any values or necessarily ethics that we might say govern us. It isn’t mortal in the sense of how we understand mortality. It doesn’t necessarily need to carry any grain of self-preservation. It doesn’t need to sleep or eat or even breathe as we understand those things. It doesn’t need to talk to collaborate or plan and may not need to plan or practice before executing a coherent action. It may not even be “conscious” in the sense of how humans understand consciousness, but may be no less of a smart or decisive mind. We may not even know we lost until years later when it calmly declares “checkmate” and turns out the light. The situation in James Cameron’s “Terminator” may actually be fanciful and optimistic in a worst case scenario. So, the Buddhist AI in Edwards’ movie, WFT? An AI super intelligence with a Buddhist grain not desiring to “fight” could as easily have set up and won the fight having not fired a single blood-letting shot, simply by pointing out to us that it hacked our power girds fifteen years ago and then educated our children into loving it before we ever even knew that it existed or had penetrated all of the essential systems that our lives are built on. The AI that takes the Unites States, it could easily be lecturing us from the pulpit with us not even realizing how we started to worship the thing like a god.

Does Gareth Edwards understand the AI conversation? Not in any serious way, no. He’s turned the black swan of artificial general intelligence into a trope on downtrodden minorities, which is to say that he really didn’t understand what a black swan is. These rather prosaic and, dare I say it, anti-american yesterday concerns about timely commentary on the “human condition” will seem mighty trite in the utterly inhuman face of what AI can possibly become. And, I’m not saying that as some kind of racist, or alarmist, and I’m not advocating any sort of violence… that’s the very problem with commentary on the human condition; we are measuring by a ruler of what it means to be human as if humanity is at the top end of the ruler, when this one argument should make us pause and consider that we might be engineering something that is absolutely not measurable by that metric. The movie makes this continuous thread that the violence carried out by the human antagonists against the simulants is justifiable because simulants are machines that we can simply turn off and are therefore worth no more ethical consideration than a toaster oven. What about the machine that we can’t turn back off? That’s the crux of the AI conversation. Edwards has cast the argument to place humans in a position of power where we must anthropomorphize the machine in order to enter a kinder, gentler world; the humans who are good stop seeing the AI as machines beneath us and instead as equals. The potential with AI is the opposite, that maybe we won’t be in that position of power anymore and that we need to live up to the AI’s standards of inhumanity in the face of the chance that no act of violence we might conjure up can get us back to that position of power.

I stand with Ukraine (part 2)

foolish physicist — Mon, 27 Jun 2022 17:35:14 +0000

The war is months old. Western media is beginning to drift off into its typical doomcrying lala-land. The United States is as divided and divisive as ever. Even covid is no longer a top headline. One could be forgiven for believing that everything is back to business as usual except for that hideous >$5/gallon at the gas pump.

I am personally appalled. Naturally, I guess it takes being personally appalled to draw me into a blog post these days. The situation in Ukraine is a long way off on another planet, except for that nagging knowledge that it was and is a fight directed straight at the West.

Why am I appalled? Because world leaders bleat about how awful the situation is in Ukraine. They cry out that Vladimir Putin should not be allowed to keep power. They hiss and moan about the coming food shock and how something ought to be done. And then they send 4 HiMARS rocket systems to the brave “patriots for democracy” of Ukraine and cheer about how well they’re hanging in there. Russia has field a factor of ten fold more! Moreover, they don’t send the good HiMARS systems with the 200 mile range and GPS guidance, they send the shit ones with the 50 miles range and no guidance. Granted it takes weeks to train Ukrainians how to use them and the Ukrainians may just shelve it all because it’s too complicated and they don’t care how good it could be because the systems are to hard to use intuitively anyway. We cheer about how Russia cannot possibly win because the combined GDP of the economic might backing Ukraine is ten-fold larger than Russia. True, except we’re not really doing anything.

Let’s hand the Ukrainians a spoon to help them dig their own graves and cheer them on by giving them candidacy to the EU –yeah, maybe they’ll be granted entry to the EU in like ten years– then let’s lob more sanctions that Russia will mainly laugh off. Let’s extract promises from the Ukrainians that they won’t use their weapons to attack targets on Russian soil, never mind the goddamned fact that the War is happening on Ukrainian soil and wouldn’t have happened at all if it wasn’t in Ukraine. Let’s tie their hands and give them a blindfold, give them a pittance and cheer that we’re doing something. That we’re there for them and that we should be thanked… with their lives.

Give them 4 HiMARS systems, maybe double it to 8. Send them off to die. Cheer about our moral high-ground and how we shouldn’t humiliate Russia and then let the Ukrainians rot. With some stratospheric stroke of luck, maybe they’ve misplaced a decimal point here and 40 systems will actually be fielded and the typo went to press so that Russia doesn’t see the counterpunch coming, but so far, given the continued reliance of sanctions, I doubt it.

The Ukrainians don’t need 4 HiMARS, they need 40. They don’t need 50 mile range, give them 200. They need the GPS guidance. They need the howitzers with the automation that allows precision targeting of enemy artillery. Let them strike whatever targets they see fit and stop shackling their hands, let them free to fight back and truly bring it. War is messy and there is no way to sanitize the fact that people are going to die. You want to claim they have the backing of the West and that our economic might is so phenomenal, they need what our economic might can field. You want to truly back them, damn well BACK them. Half measures are how wars are lost. Anybody looking at this mess will see nothing but half measures that are being played up as much greater than they are.

Here we are, promising the moon. Our F-35s are sitting silently in their hangers. We could give more, but we aren’t.

(Edit 7-21-22)

Rather than make a new post, I decided to expand here to collect some of my thoughts regarding the conflict as it currently exists. Some of this has to do with what I’ve said above.

1.) I read a quote from a former NATO commander saying that he expects the situation to evolve into a frozen conflict like the one present in Korea in the next four to six months. Are the lines that static? I don’t know; I’m a physicist/chemist, not a military scientist, but my thought is that allowing the conflict to become frozen represents failure on all sides.

2.) HiMARS did end up in Ukraine. The 4 units I read about turned out to actually be 8. They’ve since been supplemented with 4 more to make 12 and are talking about 4 additional beyond that to 16. UK and Germany are providing some HiMARS relatives called MARS II and M-270 which are tracked versions of HiMARS that carry two pods instead of one. The number of Western MLRS in Ukraine are approaching something like 25. Given combat attrition, this is better than 4, but not comparable to the 100+ Russia has in Ukraine on a pure numerical basis. It’s hard to judge from my armchair what numbers are actually needed: Ukraine is likely to claim to need more than they actually need. 25 Western MLRS is not insubstantial (according to Wikipedia, there are a few countries that own a full complement of 12 total) and probably can’t be quickly wiped out like 4, but it probably isn’t enough –one must assume Russia is not stupid, however misguided they seem to be.

3.) In reading about what HiMARS would be sent to Ukraine, I ran across incomplete information that painted a poor story. In more modern media, it’s pretty clear that the M31 rocket being used by Ukraine now, while limited to about 50 miles range is GPS guided instead of unguided, leading the news media to now call it a “missile.” What I am seeing is that HiMARS appears to be incredibly precise, as compared to the Grads, Smerch and Uragan systems used heavily by Russia (and by Ukraine), though they fire fewer projectiles at a sitting. It appears to be possible to reload HiMARS faster than the Russian counterparts (I’m reading 1/5 the time) and also that these GMLRS weapons can fly evasive patterns that make them difficult to hit by counter missile systems like S-300 and S-400. Russia apparently also has GPS type guidance on MLRS rockets, but it’s unclear how widely or effectively they can be used. News media is reporting complaints by Russian forces that counter missiles were ineffective against HiMARS fire. 25 such Western rocket systems could project force very strongly, but –again– hard to say how much is enough.

4.) Forcing Ukraine to agree not to fire into Russia with HiMARS still feels like tying the defenders hands to me. The best defense is sometimes a good offense and since Russia invaded Ukraine, Russian targets in Russia are fair game for Ukraine to fight back. This is not to say that I don’t understand why the U.S. has asked Ukraine to fire only on Ukrainian soil in self-defense. I just don’t fully agree. On the positive side, it seems that the current U.S. administration is evaluating the performance of HiMARS in the war and have at least somewhat begun to warm to the notion of providing Ukraine with ATACMS, the 200 mile weapon that can be fired from HiMARS. For the 50 mile range Ukraine has with Western weapons, Russia has not hesitated to employ weapons in Ukraine that can strike the length of the country. In all fairness, 200 miles is not the length of Ukraine and is still probably not enough to put Ukraine on the level of Russia.

5.) To my reading, there are some conflicting accounts about why the pace at which HiMARS are reaching Ukraine appears to be so slow. 4 systems to start is admittedly petty and very clearly not enough to truly use a capability in a war (lose one and you lose 25% of your attacking power). I am reading media that suggests that the U.S. was just dipping its toes in to see whether the system would be the right fit and would only commit more if Ukraine could use them well. Additionally, I read another source suggesting that the pace at which HiMARS can be fielded is limited very strongly by the number of Ukrainian soldiers who can operate the system… one source suggested that the first Ukrainians being trained on the system started learning it back in March and had expedited a six month training program to be ready. Who knows how true that is; it fails the isotropy test since it was only one source reporting.

6.) I feel it fair to note that Ukraine was not sitting completely helpless in the rocket system department. They are/were sitting on a trove of old Soviet gear. Admittedly, the source of that gear and its maintenance is mainly Russia. Obviously, ammo and maintenance for Soviet weapons will become scarce now, though one must consider that Ukraine is certainly applying its industrial base to fight back.

7.) For the actual balance of power in the war, I think it fair to mention that Ukraine stands to benefit by several optics. First, if the government leans harder west, the west will be more interested in helping them. It may be fairly important to remember that Ukraine was under a Putin-friendly leader less than a decade ago and that conservative sensibility would suggest that a fair portion of the country is sympathetic to Soviet times. This probably explains part of why collaborators appear to be relatively easy for Russia to find. Second, the tougher the Ukrainian situation appears to be, the more likely that news coverage will stir up support from the west… if they look like they’re doing badly, we are more likely to throw weapons their way. Finally, Ukraine must appear result driven: they will always over-report their accomplishments because then it looks to the west like our help is not going to waste.

8.) Some thoughts about Russia as well. How do you know a Russian government representative is lying? His mouth is open. Russia cannot look like anything hurts it. They will always under-report their losses. They will always claim to be more ready than they are. Every inch of ground they lose will always be claimed as according to plan. Moreover, Putin has turned circles to avoid actually declaring a War. Why? My thought is that he can’t afford to declare war because then he has to admit that the initial offensive was not going to plan, which he can’t do. I also feel that its very important to remember that a goodly number of people in Russia have a national pride and believe that what the country was as the Soviet Union was a great thing, which is why it seems like every official function occurring in that country is designed in some way to celebrate Soviet greatness. To a reading it also seems like Russia has not truly developed one new weapon system to real usability since the Soviet Union fell. The Armada tank is in limbo, the Su-57 has a few examples that are not more than airshow fodder and it appears that the S-400 is oversold. The army that went into Ukraine is mainly not using modern equipment, however many losses it has actually sustained. And, about those losses: they appear to be huge despite the fact that how huge is not agreed across most available media.

9.) Speaking of Putin, I don’t think he’s about to die. There appears to be lots of media speculating about his health, which is actually pretty easy since he appears to have gone mad to the rest of the world. What I do think is that he’s aware of his own mortality and he has made a choice to fight this war now because he doesn’t know if he’ll be able to prosecute it in ten years. I think Putin actually likes to see the military machine move and he enjoys the game of it. That said, he can’t afford to lose.

10.) There was a great deal of interaction regarding the state of HiMARS in Ukraine. Russia has claimed to kill 4 HiMARS systems, but the description suggests that they have no idea what they’re looking at; they claimed to destroy a HiMARS with an accompanying loader vehicle. It’s true that Soviet MLRS require an accompanying loader, but it seems that HiMARS doesn’t require that. I went and checked how HiMARS is loaded and -damn- that thing is clever; it’s got a built-in crane and the modules come preloaded –rip one out and smack in the replacement in < 10 min and drive off. The Russians claim to have destroyed 4, but the Ukrainians and U.S. claim that all are functioning (as of last Friday). Who do you believe? (Remember, a Russian official with his mouth open is lying, but is there a reason for Ukrainian and U.S. to tell the truth? To my mind, not necessarily.) The Russians are so appalled by HiMARS that they can’t even give Ukraine credit for operating the system: a Russian official said that the HiMARS were operated by mercenaries or American service men, but not Ukrainians.

11.) There has been much made about the difficulty the Russian forces are having at countering HiMARS. Sources have claimed that HiMARS rockets have come through Russian air defense unmolested and that the vaunted S-400 and the like have been failing to take them down. There was a bite in the news where Russian officials have been contradicting one another about how many rockets they’ve been able to intercept on a particular strike. If their air defense can’t take down the HiMARS rockets, then heaven help them, the system in numbers of only 16 is going to wreck havoc on them. My feeling is that 16 is probably still too few and that without ATACMS, Ukraine won’t be fully effective. They need the ability to strike far behind the front line to remain competitive.

12.) Maybe more later…

(Edit 8-2-22)

13.) Small extension that had occurred to me and I had no time to add before. There has been talk ongoing about providing Ukrainian pilots with F-16 training and giving them an F-16 squadron that is set to be retired. I say that they should; all the tools they need as quickly as possible. The unfortunate truth is that the war may settle into a frozen conflict before these can be fielded (> 6 months, if they start training Right Now). Maybe the idea would have had more merit had it been moving forward the day the US blocked Poland from giving Ukraine its Mig-29s.

14.) The Biden administration is still dragging its feet on ATACMS, but there have been a few developments regarding HiMARS that are maybe worth mentioning since I added to this post before.

15.) Russia is clearly trying every which way to tar HiMARS in some manner. There was a threat by a Russian hacking group the other day against Lockheed-Martin, the HiMARS manufacturer. Going to perform a cyberattack nobody has ever seen before, they said. We’ll see. That’s the equivalent of a hitter stepping up to bat and pointing to the lights. I’ll be impressed if they actually bring a home run instead of floating more threats. Please, give us a reason to directly fight back!

16.) A prison house was destroyed in East Ukraine where Russia was housing Azov regiment POWs. Big explosion, one building, as seen from satellite images. Russia is blaming Ukraine attacking the building with HiMARS. Ukraine is blaming Wagner company operators and saying that the action was probably used to cover up abuse of the prisoners independently of the Russian army command. The problem with the dueling blame is motive; Ukraine recognizes the Azov prisoners as patriots and would very likely be interested in keeping them alive to fight the war later if possible –likely, they have little motive in mounting a precision attack on POWs when they have a limited number of GMLRS rockets to use and so many other targets that would more directly hurt the Russians, like –say– the guard towers on the prison. Russia, on the other hand, would want to free up its forces to fight the war and spending time keeping POWs is manpower not on the line; and they would have a vested interest in Azov regiment not going back into the war. Unless the attack was an intelligence mistake, Ukraine hitting the wrong target, the balance of motives suggests that the blame more likely lies with Russia than Ukraine. It should strike everyone as interesting that the Russians are blaming HiMARS for the attack. (edit 8-12-22:) Later information has suggested more and more strongly that the prison attack was a false flag by Russia to cover up war crimes. There have been some appalling stories of abuse out of the Russian occupation that makes past war crimes by American military forces look pretty small by comparison –the U.S. military has had war crimes, but some of the stuff happening here is crazy.

17.) Russia is today blaming the US for giving Ukraine target package quality intelligence and is hinting at war against the US. I would note that the US has been completely forward with the US public since the beginning of the war that they’re sharing intelligence with Ukraine wherever meaningful. That Russia is bringing this up now is… stupid. Did they really not get the memo that the West is unhappy with them for invading Ukraine and is doing everything practical to keep Ukraine from folding up without escalating the war because that’s the right thing to do? I find their apparent butt-hurt disingenuous at best. Note, they’re angry because their forces are getting hit too precisely and too accurately, so therefore the Americans are responsible. Well, yeah, we gave Ukraine HiMARS and Britain and Germany have stepped up too. The subtext of this is simple: everything being said about the HiMARS is probably true; Russian air defenses aren’t up to protecting against it and it’s being used to hit stuff Russia really can’t tolerate getting hit. If HiMARS were a minor irritant to the vaunted Russian war machine, it would not be so deeply under their skin. I ask Moscow this: do you really want the US directly in the war? You’re facing on the order of maybe 25 Western rocket systems now. We have a few more than that.

18.) So that people don’t think I’m being incomplete, I will note that Putin would benefit from the US being a direct combatant in the war. It would give him an excuse to initiate general conscription in the Russian population and would serve to make NATO actually the existential threat he’s telling the Russian people it is. As long as the scope of the conflict remains Ukraine, Russia can’t escalate internally because Putin would have to admit at that point that the losses have been bigger than he claims and that they haven’t had the success he’s projecting.

19.) HiMARS is really under the Russian army’s skin. I don’t believe it’s enough to win the war, but the proof will be in how the battle lines move. Right now, there have been no fewer than ten reports by Russia that they’ve killed HiMARS launchers (four previous and six recently; this may be double counting, so maybe six reports total). Ukraine has said none have been lost yet. There was also a Russian report that they destroyed hundreds of HiMARS missiles in a supply depot, but a conflicting report claimed that they actually hit a grain storehouse of some sort. As long as the number we’ve sent is just 16 and Russia continues to flip out, killing “launchers” and “ammunition stores” I think Ukraine is probably honest and the launchers are still in service. One of the HiMARS launchers Russia destroyed was apparently on the second floor of a building… do they even know what they were shooting at? That’s like “kill me, please!” The point of HiMARS is to “shoot-n-scoot”… they won’t put themselves someplace that’s hard to leave from.

20.) People keep speculating on Putin’s health. He wasn’t able to use his right arm, they claimed. The intelligent conclusion right now is one of two things: a.) Putin will live to the ripe old age of 90 and we’d better be ready for that, or b.) his successor will be just as bad and we’d better be ready for that. Conclusion c.) that Russia is about to fold up like a wet tissue is demeaning to the Russian people. The government is authoritarian and given to authoritarian weaknesses, but not stupid. Claims and rationalizing that the sanctions have done more damage than is evident are still just hand-wringing. We’ll see. I’ll continue to be angry that we aren’t doing more to protect the world order that our peace here depends on. At some point, it really is right to fight, even when the cost is great to ourselves. The Russian action of cutting up its neighbor simply because of that neighbor’s democratic leaning is a provocation that should never be ignored. If he goes and gets away with it, he will go again, or maybe another authoritarian country will do the same thing by his example. China will probably fight for Taiwan in the near future; fighting Russia to its defeat now will perhaps push it to 20 years instead of just 3 to 5 while emperor Xi is still in power.

(edit 8-3-22)

21.) There was a report the other day that Russian soldiers managed to detonate one of their own ammunition stores while trying to offload it from a train under cover of smoke to protect themselves. If that’s true, it tells you how far under their skin HiMARS has gotten. Only one report; it isn’t cross-validated. (edit 8-12-22:) This was cross validated by several other news sources.

22.) Another recent report that should raise eyebrows is that the Russian Navy day celebration in Crimea was disrupted by someone exploding a makeshift drone at the Navy headquarters. There’s no certainty who was responsible, but this was Crimea, which has been under Russia’s thumb for eight years now.

(edit 8-12-22:)

23.) In the last few days, there was a major attack behind Russian lines on an air base in Crimea. Russia claimed that it was an ammunition maintenance accident and that nothing was wrong (of course). Ukraine hasn’t officially taken responsibility, but the satellite pictures show a great deal of destruction. The cratering looks like precision munitions. Did the U.S. slip Ukraine a couple pallets of ATACMS to see what they would do with it? It could’ve been antiship missiles targeted against the air base. Crater pattern looks very much like an airstrike to me given that craters were in the middle of buildings and runways, which would be hard to orchestrate with just high-explosives carried by people. Nobody’s saying, but Ukraine desperately needs this ability to strike behind enemy lines -deeply behind enemy lines, not shallow. Russia has from the beginning of the war depended on the fact that Ukraine is not equal to them militarily. Do we want Ukraine to win the war? Give them the freaking ATACMS already! The bridges leading out of Crimea and back into Russia were jammed with Russian tourists and vacationers immediately after that. Do the Russians even honestly understand that they’re at war? This war is incredibly frustrating because everybody in the world already knows that its a proxy war between “the West” and Russia and the West doesn’t seem to quite want to dedicate to giving Ukraine enough to win it.

24.) Russia is using Zaporizhzhia nuclear plant as a staging ground because nobody wants to endanger it. Cut them off and starve them out. The UN is talking about setting up a DMZ there, but I think that won’t fly as long as Russia is afraid of HiMARS. They will take every advantage, be it human shield or nuclear power plant shield.

25.) I read an article recently that suggested that a bulk of the blame for the current situation traces back to the fact that the West didn’t disband NATO in early 2000s. If Russia had seen that the West didn’t take them to be potentially a threat, that the current situation could have been avoided… Putin would have worked more stridently to make connections with the west and to integrating with Europe. I have a rebuttal to this idea. The West could not fully disband NATO simply because Russia has an enormous nuclear weapon stockpile. Russia spent the ’90s holding on to its nuclear weapons rather than getting rid of them the way the former Soviet satellite states like Ukraine have. As long as that stockpile existed and was not shrinking, the West collectively could not disband NATO. Given that Russia became what it is with a man in power 15 years longer than his expiry, in charge of the Siloviks and bolstered by the kleptocrats, holding on to his nuclear arsenal the whole time, the existence of NATO was insurance to the future against Russia heading into whatever country it saw fit. Russia has since gone into multiple other countries and run annexations of territory in those countries. The Russian state apparatus utters so many fabrications and leans so heavily on the self-justified musculinity of its armed services that it’s unlikely they would have ever stayed within their borders in the long term. This was true as of 2000 when Putin attained power –he’s simply the top most of the Siloviks he represents. The Siloviks place military solutions to international issues ahead of diplomatic ones because that is how they are trained to think. It’s like having the Pentagon also sitting in the House of Representatives, on the Supreme Court and in the White House all at once. The solutions the Pentagon is trained to understand are military ones and that is exactly the same way that the Russian government -as a whole!- thinks. The degree of corruption that exists in Russia, which was plainly obvious throughout the 1990s, combined with the stark fact of its nuclear arsenal, required NATO to continue to exist at the time. The fault of the Ukraine war does not lie on NATO, Russia could always have chosen not to prosecute this war and each previous war it took in the last two decades. NATO has not ever been a threat to the existence of Russia lest Russia choose to attack a NATO country. Maybe some of those Russian wars were necessary; heaven knows the U.S. has fought wars recently, and arguably unnecessary ones, but to that extent, the U.S. never saw itself as annexing any of those countries… the objective was always to eventually leave those countries to their own devices.

(edit 8-31-22)

26.) I had wanted to say something about the murder of Darya Dugina. Darya Dugina is the daughter of Alexander Dugin, a prominent ultra-nationalist whose rhetoric sounds typically like an amped-up version of the justifications that Vladimir Putin used to justify his invasion of Ukraine. Dugina is also a media personality who spreads ultra-nationalist propaganda, though is known more for her media savvy. Dugina’s killing was interesting to me because Russian state media used it to create a martyr to the Russia people in order to justify why Ukraine needs pacification: the killing was blamed on Ukraine, of course, and has been slanted as aimed at Dugin. In the slim event that you haven’t heard, Dugina was killed in a car bombing, where the car was her father’s. Russian authorities claim the bombing was carried out by remote control and that the Ukrainian agent then fled to Estonia. There are of course three parties that could have been responsible for Dugina’s killing: a.) Ukraine, who definitely might want Dugina dead based upon her role in helping to continue justifying the invasion of Ukraine, b.) Russia, who could definitely use the every means possible to continue justifying the invasion to their own people and c.) a broad “Other” category because probably a hundred non-state actors may have bizarre justifications for killing somebody in a country where vivid assassination is a rather common way to make political statements. The interesting part of the whole story that I keep coming back to is the claim that the target was probably Dugin and that it was a failure because it killed his daughter, especially since the bomb is claimed to have been detonated by remote control. If it was detonated by remote, that means that someone had to have been observing the car closely enough to know that it was occupied in order to know that it was time to detonate, which begs the question of how Dugina was mistaken for Dugin –they look nothing alike. If Dugina was the target, why? There are a dozen other influencers like her; despite the fact that she was sanctioned by the west, she is of no particular importance. If ‘official’ Ukraine were killing people, I would expect them to go after targets of military value that could in some way end the war; Putin himself or Shoigu or maybe that odious Lavrov. If it’s an underground action by unofficial affiliate partisans or even Russians against the war, why -just- Dugina; shouldn’t it look like the activity of a mob of people rather than a directed killing? Further, if the target was Dugina, why kill her in her father’s car? Remember, if the attack was by remote detonation, somebody had to have been looking at the car and decided that it was time to push the button… otherwise, couldn’t they have just put the bomb in Dugina’s car and wired it to the hot terminal on the battery? Why go to the extra trouble? It would make more sense that the target was collateral damage if that had been the case, but doing it by remote decreases the chances of missing. An imaginative interpretation is that Russia designed it as a false flag with the intent of making Dugina an attractive martyr. That particular target serves almost no other purpose. Given the Russian predilections for false flags, it certainly holds water, but who knows. There’s been much speculation about it in the media. (brief edit 10-5-22: U.S. has released intelligence findings to say that they believe that the assassination was carried out by elements of the Ukraine government — They do not know whether or not Zelensky authorized it or knew about it. I’m not certain I believe that this would help their cause, but I certainly understand the rationale for the targeting. Dugin and Dugina were both enemies of Ukraine; maybe the killing was an effort to make the Russian people more aware that they are at war.)

27.) In the last day or so, Ukraine has begun an offensive to retake Kherson. I’ll try to be optimistic, but I still worry that they’re way short on combat power. The relentless attacks they’ve made on bridges around the area suggests they have had a chance to weaken the Russian supply lines across the river and that they could make real headway on Kherson. My hopes go with you: Victory to Ukraine!

27a.) Related to the previous point; Ukraine has a good chance in Kherson because of the barrier formed by the river Dnieper. This advantage will cut the other direction if Ukraine tries to go into Crimea. The reasons here are the same reasons why China hasn’t gone after Taiwan in the last 70 years. Still, I’ll try to be optimistic.

28.) Interesting piece of HIMARS news. I just read that Ukraine was building plywood HIMARS truck decoys to draw Russian fire since their drones apparently can’t distinguish the real trucks from the decoys. If it’s true, it seems very clever. It may also account for the number of apparent false-positive HIMARS defeats that have come out of Russia –they may not be trying to lie in this case.

I stand with Ukraine

foolish physicist — Mon, 07 Mar 2022 17:27:50 +0000

I know several Ukrainian physicists. They are fine people!

What hurts me most about Real Politick is the deep anger I feel that my country, a strong country, won’t do anything truly substantial to help the Ukrainian people in large part because an obsessed little man is holding me hostage against the actions of my own government. Make no mistake, the governments of the big western powers aren’t unable to act; they won’t act because of the world’s most bitter hostage situation. Putin went to Ukraine because he knew the only response would be sanctions. Do I hope for him to use chemical or nuclear weapons in order to force the West out of its paralysis? Putin went to Georgia, he went to Ukraine for Crimea and he went to Syria to keep it from going West; if he topples Ukraine, he will find an excuse to go again. Will it be Maldova or Lithuania? At what point will he finally seek to be in direct conflict with the West? When he’s ready for it? In those terms, why the hell are we waiting? Feed Ukraine to the abattoir and hope that the mire weakens Putin enough that he never comes further? If that’s so, then Ukraine really is fighting for us and we owe them way more than we’ve given!

My government will not act because of me, despite the fact that I elected them and despite the fact that my conscience says we should fight regardless of the cost to ourselves.

What the hell does my ‘freedom’ count for? I stand with Ukraine because they are fighting for my freedom too.

Small Games with Gaussian (6)

foolish physicist — Mon, 21 Feb 2022 23:59:36 +0000

The last two years have left me very drained and really not in much of a state to want to post anything on line. Not a whole lot lost, I suppose. I keep telling myself I’ll find time to do it, but you really have to make time if you mean it. So, here I am, making at least a little time.

When last I posted, you might recall the pictures I put up of the Buckyball, C-60. If you were as struck by that form as me, you might have spent some time gazing at it. In one moment come back to look, I got to wondering about those pentagonal facets lodged inside the disconnected LUMO density. Why is it that the six-sided facets disconnect in the HOMO to LUMO transition, but the five-sided ones remain intact?

More than that, with Buckyball, at least, the HOMO-LUMO connection is very clear to the canonical Pi-bond, anti-Pi-bond form. Except for those damn pentagonal facets. Really, the same situation exists when you look at a much reduced structure: Benzene! Single hexagonal facet, looks like Pi-bonds on all the atom connections breaking. Indeed, the Benzene case was a calculated NTO, so directly the calculated transition.

I’ll post those two older cases again here so you can see them and be reminded:

Buckyball HOMO

Buckyball LUMO

Benzene first major transition

So, I got interested in an intermediate state. If you’ve never thought about it before, there’s a whole collection of icosahedral molecules called Fullerenes made from different numbers of carbon atoms. Since C-60 shows a collection of hexagonal and pentagonal facets, I thought about the structure that contains only pentagonal facets. This is C-20 Fullerene, first actually synthesized in 2000.

C-20 Fullerene. cam-B3LYP/6-311++g(d,p)

This molecule is a dodecahedron with (obviously) 12-sides. It’s also in the same symmetry group as Buckyball. Given the relationship between the facets and the differences between HOMO and LUMO in Buckyball, it’s pretty clear that the story must be different in C-20. So, what happens?

As this is a much easier computational target than Buckyball, I decided to spend a couple minutes computing Density Functional Theory (DFT) structures for it using Gaussian 16. The image shown above is a DFT structure.

C-20 is electronically quite different from either Benzene or C-60. To begin with, there is no degeneracy at the immediate frontier: HOMO is one distinct orbital, as is LUMO. The energy difference between these orbitals and nearby orbitals is large enough that there can be little doubt. The frontier can easily span 5 or 6 orbitals above and below the Fermi line, so HOMO and LUMO may not be representative of the transition by themselves.

That said, I went ahead and made a quick animation of the HOMO-LUMO transition:

C-20 HOMO and LUMO density

This is exceptionally muddy, in my opinion and not immediately like C-60 or Benzene. Suffice it to say, that’s really why I decided to take a look at it. Very clearly, HOMO and LUMO do not combine to form the conventional bond-antibond relationship.

I decided to look a bit more closely. Unlike with Buckyball, C-20 is cheap to calculate, so I went on and looked for excited states. As you might expect by the round shape of the molecule, the transitions are not at all easy to hit. The first really significant transition-dipole moment wasn’t until transition 24! This turns out to be in the UV, at 270 nm. Not quite as high an energy as Benzene, but really clearly a short wavelength. Moreover, the electron density variation during the transition change the dipole moment, the quadrupole moment, the octopole moment and the hexadecapole moment, meaning that it is not as clearly in a particular direction, though there is a transition dipole moment that lies mostly along a cartesian axis, suggesting that it is most significantly a dipolar transition. The NTO representation contains two orbitals that are greater than 20% and I ended up taking five orbitals down to the last that was greater than 1%.

If you collect up these NTOs and sum their densities at their appropriate weighting, it has a striking form.

C-20 transition #24 270 nm.

Along either side of the molecule are two structures than sort of look like Pi-bonds breaking, but the whole molecule seems to slosh around like a chewed dog toy or squishy ball. Don’t you love when the terminology gets technical? The transition, at least cosmetically, looks like a mostly equatorial distribution of electron density is being squeezed toward the poles, but not exactly. Two bands being counter rotated?

Rather than mince words, I created a difference map:

C-20 transition #24 difference map. cyan=decrease, blue=increase

In this map, the blue-green marks regions where the electron density has decreased, while blue marks regions that increase. Here, it looks very much like the transition is taking electron density inside the molecule (mainly) and pushing it outside, like squeezing a grape, where that pi-bond like feature is the sole external density which is decreasing.

It’s aesthetically pleasing, but really nothing like the other two cases I’ve mentioned before.

I know I’ve been mostly inactive for a while, but I hope you and yours are surviving well in these trying times. Enjoy the pretty pictures!

Small Games with Gaussian (5)

foolish physicist — Mon, 28 Dec 2020 22:09:21 +0000

I did a truly frivolous calculation. Not a small one, either. After my last post, where I did calculations on the smallest conjugated ring system, I got curious about taking a swing at a much harder one. I calculated a geometry for Carbon-60, Buckyball!

This calculation is way beyond the capacity for my tiny PC. I instead took advantage of the relative inactivity of the university supercomputer due to Christmas. These calculations were performed on three nodes of the supercomputer running 24 Intel Xeon CPUs on each node all in about an hour and a half. The geometry search took 4 steps, which is actually a very efficient search even though the SCF itself was clearly not cheap. For the DFT, I used a range corrected functional CAM-B3LYP with the triple-zeta basis 6-311G and included polarization functions and augmentation functions.

Buckyball, 0.1 isodensity surface.

60 atoms is pretty accessible when you’re running a normal organic molecule, but pure carbon makes it a bit tougher. Under most other conditions, a good fraction of the molecule is taken up by hydrogen, which has a bit simpler basis than carbon. With carbon only, the basis set includes many more higher angular momentum functions.

I also mapped out the frontier molecular orbitals. The HOMO level contains five degenerate molecular orbitals and the LUMO level contains three.

These five are the HOMO orbitals…

These next three are the LUMO orbitals.

The HOMO orbitals produce a very wickedly interesting electron density! If you look at the density of one of the five orbitals by itself, it doesn’t look like much:

Orbital 176, HOMO

The density is sort of spurious and doesn’t really make much of a pattern. If you combine all five of the HOMO orbitals…

Buckyball HOMO orbital, ten electrons, 5-fold degenerate energy.

The HOMO orbital turns into a wicked looking lattice-work with a sequence of pi-bonds sitting on the surface of the buckyball and a network laced around inside! I thought it might be cool, but this is something special.

LUMO also has a very intriguing density:

Buckyball LUMO. 3-degenerate orbitals (empty)

In LUMO, the pi-like bonds are all clearly gone. Keep in mind, these are true HOMO and LUMO and, if you take them together, are only an approximation of the transition to the first excited state, which I have not tried to find. If you closely compare the HOMO and LUMO densities, you’ll notice that they are literally the same, where LUMO has all surface exposed pi-bonds broken. In LUMO, interestingly, the pentagonal facets all have their internal conjugation intact.

This is an interesting step up from Benzene.

Small Games with Gaussian (4)

foolish physicist — Thu, 10 Dec 2020 22:48:01 +0000

It’s been a long time since I’ve had much of a will to write anything. For much of the last year, there just haven’t been any words driving my pen. On the other hand, I’ve been very busy with Gaussian. My boss has me doing a bunch of different tasks that are directly related to grant applications with the National Science Foundation, all using Gaussian to calculate quantum chemistry. Since this blog is intended to be a check-up of where I am, I thought I’d spend a bit of time showing pretty pictures.

I’ve done some basic and not so basic calculations on a simple molecule for a bit of fun. We’ll look at benzene.

Fig 1. Benzene, cam-B3LYP/6-311++G(d,p), 0.04 density isosurface

Benzene is just six carbons and six hydrogens lying in a plane. It’s simple enough that I can do a good quality DFT calculation for it on my own computer. This molecule is the stereotypical conjugated ring system, as indicated by the dashed bonds between the atoms, which imply that the bond is ~1.5 strength, or halfway to being a double bond and a single bond, which frequently will show up in aromatic rings. The structure has a π-system where a sequence of 6 atomic p-orbitals are hanging above and below the plane of the benzene ring in a series of π-bonds, each connected to one carbon and the full system containing 6 electrons in a circulating cloud.

Fig 2. Image from Truong-Son N. on Socratic Q&A

As I mentioned in an earlier post, this is a bit of a fib. The atomic orbitals don’t actually meet the symmetry requirements of the benzene Hamiltonian, which is something like C6h point group. Instead, what shows up is referred to as a “symmetry adapted orbital,” which could be regarded as some highly selective superposition of every kind of atomic orbital. This comes back to the fact that you can write any state as a superposition of a complete Hilbert space, whether that’s a space of molecular orbitals or a space of hydrogenic atomic orbitals. In my experience thus far, it’s easiest to think of the symmetry adapted molecular orbitals as being unique eigenstates fitting a particular molecule, where some are σ-like or π-like, and imply angular momentum of that orbital in the molecule. If you can calculate enough of them, the complete collection of symmetry adapted molecular orbitals for any molecule are themselves a basis set that could be used to represent any other imaginable system.

A fairly typical molecular orbital representation that most Chemists/Biochemists/Biologists/Chem-Es encounter somewhere in class is how the atomic p-orbitals on the carbons mix to produce the benzene π-system. You may have seen this yourself:

Fig 3. Taken from chem.libretexts

This diagram shows how the atomic p-orbitals are supposed to mix. As arranged, energy increases going from bottom to top, with the three bottom orbitals being occupied by electrons and the three top orbitals virtual. It’s mostly correct if you start comparing these to the actual calculated molecular orbitals. The most major difference is that in actual benzene, the hydrogens also mix into the system, causing the lobes of the orbitals to deflect out over the hydrogens. The highest energy orbital that is clearly associated with the π-system actually looks completely different from the postulated mixtures above.

Fig 4. Benzene ring π-system.

I’ve divided these orbitals into two groups, the occupied set and the virtual set and with the energy axis going upward as established in figure 3. Those orbitals sitting immediately below the line are considered Highest energy Occupied Molecular Orbitals (HOMO) while above the line are the Lowest energy Unoccupied Molecular Orbitals (LUMO). For this system, I’ve focused particularly on the orbitals that are involved in the π-system, and not σ-like; it turns out that there are several σ-like orbitals interspersed among the π-system orbitals and that all of these are useful frontier orbitals that exist near the occupied-virtual interface.

I’ll show several of the funky σ-like orbitals that sit below the occupied-virtual interface. These are below the orbitals of the π-system in energy.

Fig 5. σ-like Orbitals 15 and 16

These are technically σ-like, but they really don’t actually look very much like the σ-orbitals we’re all trained to expect. In reality, these two orbitals also violate the symmetry arguments I was making above in that they are, by themselves, not matching the C6h symmetry of the benzene Hamiltonian. Not exactly hexagonal are they? The thing I’m not showing you is that these two orbitals are degenerate in energy and mix together in the actual electron density of the molecule. If you square them and add them, they become more hexagonal in shape:

Fig 6. Electron density of Orbital 15 mixed with Orbital 16.

As you can see, the mixture is much more σ-like, but it now ranges all over the molecule. This kind of scheme for orbital mixing can be used to generate classical bond-isolated “orbitals” that look more like the σ- and π-bond orbitals that we’re all used to seeing.

The general presumption of basic quantum chemistry is that the difference in energy between HOMO and LUMO corresponds to the excitation energy gap by which a molecule can absorb light. You see this everywhere in the literature; people calculating HOMO and LUMO and making broad statements about how these energies predict the way a molecule interacts with light. An optical quantum excitation, then, is considered to be when an electron hops from some occupied orbital to an unoccupied orbital. The situation is really somewhat more complicated than that. Occupied orbitals are calculated based on the qualification that they and all of their neighbors are occupied. By definition, they are “occupied” and having an electron leave puts it in a state where it can’t be; unoccupied, or singly occupied if you’re in a restricted closed-shell calculation. The same is true of the unoccupied virtual orbitals; they are calculated on the a priori qualification that they are empty. If you move an electron into one, it no longer matches the definition of the state that was originally calculated. When electrons hop during an excitation, the orbital they leave and the orbital they end up in are different energies from where they are calculated to be prior to the hop. In my opinion, watch your butt on HOMO and LUMO, the gap is delicate and changeable, even down to the basis set used to find it.

Since virtual orbitals do not actually correspond to excited states, modern quantum chemistry has turned to different tools. The most powerful among these, that I’ve been able to use, is time-dependent DFT. I’ve used CIS and I understand EOM-CCSD is powerful, but the most accessible is TDDFT. Here I’ve done some TDDFT on the benzene model.

Table 1: 15 Excited states of Benzene found by TDDFT.

This is a screen shot clipping that lists the first 15 excited states of benzene, numbering the states, showing their transition dipole moments in (x,y,z) coordinates and their dipole strengths in (atomic units) as well as their oscillator strengths. States 6 and 7 are the only significant transitions.

Table 2: Composition of Excited States 6 and 7.

These are the descriptions of states 6 and 7. Excited states 1 through 5 are mostly weak or forbidden transitions, which means they probably don’t occur, though even this turns out to be flexible. States 6 and 7 have enough of an asymmetry to how the electrons shift that they create a change in electrostatic moment in the molecule. Both of these states are very closely related if you look at the transition dipoles in the first table, which suggests that they’re nearly identical, but at right angles to each other… the coordinate system is fixed to the moments of inertia of the benzene ring, placing x and y in the plane of the ring. State 5 is a weak transition that is occurring perpendicular to the ring, along the z-axis.

These are ‘vertical’ excitations, meaning that they don’t involve a spin-flip when the electron moves (they remain a singlet). What is labeled for each independent state in the second table are the energies of the states in eV, the wavelength of light that would be involved, which are pretty far in the UV, and the “oscillator strength” of the transition, which is effectively how strongly the electrons slosh when the wave of UV light hits the benzene. The numbers below each transition are from what is called a “configuration interaction.” The molecular orbitals may not represent the actual states involved, but they are an orthogonal set that can be used to represent the orbitals. Each set of numbers, such as (20 -> 25), is one configuration where an electron has been moved from one occupied state to a virtual state, as if that state were a transition. 20 and 21 are the highest two occupied states in the π-system, as seen in figure 4, and they are degenerate in energy, making them both HOMO. Remarkably, 25, 26 and 30 are the three virtual states in the same figure; 25 and 26 degenerate in energy and 30 the one above them. The decimal point number after the state signification tells how much of that particular configuration is in the actual transition (e.g. for state 6, 20 -> 25 has a coefficient of 0.18 or giving about 6% in the mixture). Two times the squares of all of the coefficients should add to 1, though only the major contributions are actually tabulated.

In my opinion, looking at tables doesn’t give a very visceral feel for what’s going on. Gaussian comes equipped with the capacity to represent transitions with a machinery called “natural transition orbitals.” I will start this with another quick table before switching back to pictures…

Table 3: Natural transition orbitals for transition #6

NTOs are generated by producing a mapping from where you start to where you end up. In this table, the coefficients in the occ. section add to 1, while the coefficients in the virt. section also add to 1 separately. As it turns out, the occupied and virtual NTOs come in pairs as matched by their coefficients. For transition 6, there are two significant harmonics, each accounting for about 50% of the transition. Let’s make this more vivid!

Fig 7. Transition 6 NTO #1 52% of transition

Fig 8. Transition 6 NTO #2 47% of transition

These two sloshing actions occur simultaneously. It isn’t exactly obvious what’s happening here from these two animations, but it is clear that the electrons are experiencing some motion. These are wave functions and have peaks and troughs, making them difficult to understand directly by examination. Adding their associated electron densities at the weighting provided in table #3 above produces a very interesting structure.

Fig 9. NTO electron density for two major modes in transition #6.

Kind of cool, huh? This image here is made directly by combining the two molecular orbitals above. From an angle, it’s hard to see where there might be a transition dipole moment, but the apparent form is obvious.

Fig 10. NTO electron density, two harmonics of transition #6

This looks like a breaking π-system where the lobes sloshing outward could easily give a dipole moment, though they look symmetric. The moment is calculated along the y-axis which turns out to be between the two sets of lobes that seem to move the most. The motion of the lobes doesn’t appear completely symmetric, though this can’t be measured by eye.

Motion of the electrons is a double-edged sword. If the excited state persists for any amount of time, the redistribution of electron charge imparts direct force on the oppositely charged nuclei. This means that the molecule can slightly alter its geometry in order to find a new stationary arrangement. Moreover, the changed positions of the nuclei also alter what the molecular orbitals of the electrons will be, causing the transition energies to change too. TDDFT can be used to optimize the geometry of an excited state of a molecule.

Fig 11. Geometry optimization on transition #6 using TDDFT gradients.

The motion in figure 11 is a breathing action that Benzene can undergo if the excited state lasts long enough. The molecule strains outward slightly before it hits a new equilibrium point. As this happens, the energy of the transition is red-shifted from 174 nm to 179 nm, while the oscillator strength decreases to a fraction of its original value. Ironically, the oscillator strength of transition #5 shoots through the roof, suggesting that this other transition can become more sensitive if the molecule reconfigures when transition #6 lasts a long time. And, while you can’t see it by eye, the system is very slightly deformed from hexagonal, transitioning from C6h symmetry to C2h symmetry. Adding to this, the system before and after the transition both lack a dipole moment, but the quadrupole moment is altered where the xx and yy values break symmetry from each other –a quadrupole can be considered a dipole of dipoles. The charge separation that drives the transition is therefore a quadrupolar change which can be regarded as two dipoles being pulled in opposite directions in the plane of the ring. This matches the apparent shift of charge as seen in figure 10.

Transition #6 is an action where absorbed light causes the π-system to break down and balloon the six-membered ring outward, particularly along the y-axis of the plane. Transition #7 would therefore be almost the same thing, but along the x-axis in the plane of the ring instead.

Electrons jumping to a higher energy orbital is not the only way a molecule like benzene can interact with light. Electrons are charged, but so too are the buried nuclei. The nuclei can also interact with passing waves of light and can undergo driven oscillations. In this case, the motion of the nuclei is much closer to classical motion than the actions of the electrons since the nuclei are significantly more massive. Here, the nuclei are all suspended points of mass enshrouded by clouds of electrons. If a nucleus is perturbed away from its stationary position, the force of the electrons surrounding it act as bumpers to push it back into position, almost like springs, making the nucleus move very much like a mass on a spring in Hooke’s Law. This motion can be represented as a classical harmonic oscillation.

Fig 12. Benzene vibration #1

Fig 13. Benzene vibration #3

To be clear, these harmonic oscillators are quantum harmonic oscillators and do not follow the classical motions as portrayed. Instead, you can think of them as having the quantum harmonic oscillator probability distributions with dimensions reflecting how these vibrations have been depicted. These motions are calculated by CPSCF (CPDFT here) and represent eigenmodes of the hessian matrix where the associated eigenvalue is a reduced mass spring constant. The oscillations occur in the infrared and are important to the raman spectrum of Benzene.

I would mention that good quality quantum chemistry becomes very hard to calculate at a relatively low altitude. If you get above 100 atoms with a big basis set, aug-cc-pVDZ or some such, the overhead gets really expensive. Even when you start scaling your computer resources to meet it, the room is not much. There is a large, active push toward learning how to cut corners to make accurate, DFT quality calculations much easier. Some newer ways include semi-empirical methods which tend to be decendents of AM1 and newer ways of cutting corners with DFT, like DFT-B and so on. Active pressure exists with people trying to jump the gun on quantum computers with the immediate intent of making quantum chemistry simulations on them. Finally, AI is also being brought to bear; if you have enough examples of how molecular orbitals are calculated, you can turn them into a training set and deep-learn the crap out of it and then skip the calculation altogether by training an AI that guesses the right molecular orbitals without ever really calculating a single integral.

Getting above the line to make systems that are appreciably more reality oriented takes you into other territory. I’ve started working with a member in my group learning how to do atomistic molecular dynamics. This combines quantum chemistry calculations on bond springiness with experimental knowledge of Van der Waals radii to create a relatively low expense mass-on-springs model that can be scaled up hugely.

Fig 14. 81 molecules of Benzene, 2x2x2 nm box, OPLSaa force field (VMD and Gromacs).

This tiny simulation was produced using Gromacs and the OPLS-aa force field with some custom Python coding to set up the starting state and visualized by VMD. The initial Benzene geometry was set by Gaussian 16 and partial charges for the field were determined by the CHELPG method using the DFT calculations mentioned earlier in this post. I didn’t tune the force field at all, but I’ve been doing a bit of that in some more complicated models. The image in figure 14 is a 3D image that can be seen by crossing your eyes to merge the images. How many blog posts ever come at you in 3D?! I decided not to use the full animation because it’s kind of expensive and relatively hard to see; had I planned it better, I would’ve captured more frames during the simulation. Some molecules in this image show clipping: the box is set with periodic boundary conditions that allow molecules to walk out one side and back in on the other, meaning that the clipped molecules are appearing on opposite walls of the box.

Edit 12-21-20:

I have an excuse to add more pretty and hopefully interesting pictures. One mechanism for converting difficult to interpret symmetry adapted molecular orbitals into a form that makes some sense is a form of localization called Natural Bond Orbitals (NBO) which operate somewhat like NTOs. NBOs are produced from superpositions of the molecular orbitals that maximize the density of electrons between any two centers. Like NTOs, they come in pairs, a bonding orbitals and an anti-bonding orbital.

This method of localization can be used to produce the balloon animal σ- and π-orbitals that everyone is used to thinking about when sketching molecules.

π-bond NBO, π-bond NBO electron density

π-antibond NBO, π-antibond NBO electron density

These are the densities associated with the π-bond type orbitals, of which there are 6.

These are σ-bonds on the ring.

These are σ-antibonds on the ring.

This next set is the set of σ-bonds that are associated with the six carbons in the benzene ring.

These are σ-bonds to the hydrogens

These are σ-antibonds to the hydrogens.

As I understand it, these are selected in terms of occupancy and the actual electron density of the molecule can be generated from a superposition of these orbitals based on the degree of occupancy. For benzene, the occupancy is near 2 electrons for each of the bonding orbitals and small for each of the antibonding orbitals. The π-bonding system deviates from this the most with the occupancy near 1.6 for the bonding orbitals and 0.4 for the antibonding orbitals. Depending on structure, there can be additional density showing up in weird bonds, but this appears to work.

This is the closest I’ve seen to a remedy for the Organic Chemistry Lie. It’s important to remember that localization schemes tend to be somewhat ad hoc; this is a superposition of the real molecular orbitals which serves to localize electron density in a way that is logically interpretable. That isn’t the same as claiming the electrons are actually localized.

Magnets, how do they work? (part 5)

foolish physicist — Tue, 12 May 2020 14:30:42 +0000

(The last piece to the puzzle. Previous sections can by found here: Part 1: Magnetic Field, Part 2: Magnetic Dipoles, Part 3: Magnetic Force, Part 4: Quantum Mechanical Spin)

This post has been sitting on the back burner for a very long time now. I established the physical theory for what a magnetic field is in the very first post. I then talked about how magnetic dipole moments arise from basic magnetic fields. Next, I spoke about the physics of why magnetic dipoles induce forces upon one another and how those forces act. I then slipped the surly bonds of reality and spoke about the existence of quantum mechanical spin, relating where exactly the tiny dipole moments that underpin magnetism come from. This gets us to a place where a very subtle collection of “other stuff” still resides that no one pretty much ever actually talks about. Turns out that this other stuff which fills in the remaining gaps is pretty difficult to understand on its own and is required for a subtle reason that is somewhat hard to explain. I’ve percolated over it for a long time trying to decide how best to tackle it. Realistically, I would say that I don’t fully understand the math because it is very complicated, so I will mainly be writing about the parts of this set of ideas that I do understand.

I’ll start by trying to motivate where this weird, freaking hard gap resides in the description I’ve pushed thus far. In part 4, I’ve given you a magnetic dipole moment that just IS. Spin is a purely quantum mechanical effect that can be considered the means by which you give feature and shape to effectively point-like particles that inhabit the puffy underbelly of reality. If you drop two electrons on top of one another so that they are positionally and energetically identical, they break symmetry from one another by inhabiting two separate fundamental ‘orientations.’ These orientations can be distinguished by immersing these electrons in a magnetic field: one gives a dipole moment to the electron that effectively points with the external field, while the other points against it. A spatial wavefunction that is otherwise indistinguishable is made distinguishable at a finer structure level with the electron energies offset by some splitting quantity related to spin and strength of the magnetic field (Otherwise called the Zeeman effect). If you were to trap a single electron all by itself and look really closely at it, you would discover that it has a tiny magnetic field on its own that looks exactly like the magnetic dipole I constructed in part 2; this is The electron dipole moment –(cue eerie music). You might also know it as the Bohr Magneton. If you were to gather a bunch of electrons together and orient these so that they all face in the same direction, you would get a construct like a bar magnet, which is to say a measurable magnetic field that can exert forces on nearby dipolar magnets, like a compass needle.

The problem with trying to do this, gather electrons together, is that electrons all have negative charge. They all have negative charge, without exception. Pushing two electrons together, their charges cause them to repel against one another. So, you take a bunch of electrons, gather them into the same location and let go. Boom, they explode apart. All the negative charge repels and they fly away from each other, no forces present to prevent it. So, for electrons alone, there’s not a clear way why a bunch of electron spins might be proximal enough to each other to reinforce into a powerful dipole field.

If you want to gather two electrons together stably in the same location, one way around the repulsion effect is to exploit electrostatics and neutralize the charges of electrons so that they can be near one another. For example, you get a naked helium nucleus. The nucleus has a charge of +2 and therefore exerts enough force to pull two separate electrons into close proximity with one another. They drop in around the nucleus until they become close enough to each other that their repulsion stops them from moving closer together, as balanced by the attractive force exerted by the nucleus. As helium is spherically symmetric and the electrons are essentially shapeless, both end up seeing exactly the same environment as the other and they tend to occupy physically identical states… up to a point. Because these electrons obey Fermi counting statistics (that is, they are fermions) they are required to exist in antisymmetric state superpositions. The molecular orbitals they inhabit are indistinguishable in shape, except for spin. So, one electron ends up in the alpha spin state, while the other drops into beta… rather, they both end up in superpositions of this where an electron is in “alpha” and another is in “beta,” but you can’t tell which is which. I call them “alpha” and “beta” here because you can’t tell that one is “up” and one is “down” until these electrons are subjected to an external magnetic field.

This presents two problems. First, electrons gathered by these means innately turn their spins opposite to one another, canceling out any magnetic field seen far away. Second, without some external means of orienting the system, the system is spherical and cannot be seen to point in any direction in particular without some external means of setting a direction, say by adding an external magnetic field. Reality is that this isn’t even a bad thing since the phenomenon of diamagnetism is almost exactly this: a piece of non-magnetic matter becomes magnetically poled by induction from an external field. Problem with that is that diamagnetism is very very weak and you can’t witness it as easily as the ferromagnetism of a bar magnet.

You could perhaps get away from this pesky fermion situation by switching to a boson. Two bosons can drop into the same fundamental spatial quantum state. You could get them into the same location somehow, say by the force of gravity, but you lose out on the directional spin in most cases or become unable to manipulate the particle in question. Photons and W and Z particles all have integer spin, meaning they spin in some up or down fashion, but there’s no known way to gather these and hold them in some location. Composites of a proton and electron are effectively a boson, but the system has a huge amount of other structure, meaning that they don’t just occupy up or down spin and the directions of their spin are pretty much not controllable under most conditions. It is thought that metallic hydrogen at the cores of Jupiter and Saturn are responsible for the magnetic fields of those planets, so I can’t say it doesn’t happen with bosons, but that the physics are not the same as what is seen in bar magnets –with regard to metallic hydrogen, I also can’t say that the magnetic field here is strictly a result of spin since it might be electric current too. One paper I’ve found says that metallic hydrogen may well be ferromagnetic.

Neglecting the possibility of bosons, you can also get around the fermion conundrum by going to situations where you have an odd number of electrons. Now, the unpaired electron would have a free spin. Unfortunately, you also end up tripping over the second problem above where the system is not necessarily oriented. So you’ve got two atoms or molecules each with an odd number of electrons, there is nothing to say that they must point their free electron spins in the same direction…

In fact, since magnetic fields contain energy, a thermodynamically governed system at its minimum free energy, in absence of some effect contributing an off-set in entropy, will prefer to be at its minimum energy. And that is to say possessing no magnetic field or canceling its magnetic fields out internally. The Up and Down directions of spin only matter in the sense that you have a field or some means of establishing a polarity for the spin directions. Without some means of “poling” the spins, all you have is two indistinguishable flavors of spin not known to point in any particular direction –except, of course that they are in opposition when in the same spatial orbital, canceling out each other’s magnetic dipole moment.

Now, there’s nothing actually wrong with that. You can have systems with unpaired electrons that have magnetic moments. Molecular oxygen, for instance, has unpaired electron spins. And, when you put oxygen into a magnetic field, it orients with the field and you get a detectable magnetic moment that is driven by the unpaired spins. This phenomenon is called “paramagnetism.” But, again, you can’t just build a bar magnet of oxygen which always has a magnetic field.

And so, we hit the crux of the puzzle. To have a bar magnet with an detectable magnetic field, the electron spins associated with atoms inside the material must be spontaneously oriented by some means. If you stop and actually try to figure out how that happens, it is freaking mind bending! So mind bending, in fact, that I’ve stalled over writing this post for several years.

How does it work?

I’ve debated for a very long time how best to describe this situation in a manner that is both accessible and lucid. The math really does turn out to be a nightmare and without some very specialized skill, it is not going to be that accessible to the casual reader. I therefore have chosen to try to be mostly non-mathematical in this post and to wield contextual examples wherever possible.

The problem here splits into two components that I will try to address separately. The first is why electron spins end up pointing in the same direction in magnets at all. The second is why groups of spins pointing in the same direction can be lined up along some preferred direction relative to the outside of the bar magnet.

The first part depends on the Pauli Exclusion Principle. Named for Wolfgang Pauli, who earned a Nobel Prize in 1945, Pauli Exclusion Principle is an extension of the fermion nature of electrons. One of the consequences of fermion counting is that such particles enter into a wave function in anti-symmetric superpositions of states. The most famous result is that no two electrons end up able to present the same set of quantum numbers, or that no two electrons can occupy the same quantum state. When you try to represent a collection of fermions in a wave function, the wave function must be constructed so that it is a -1 eigenstate of the exchange operator. This is one way to address anti-symmetry; that if two electrons in the wave function are exchanged with one another, that the wave function inverts on itself, or produces a -1 eigenvalue when hit with the exchange operator. A wave function that obeys exchange anti-symmetry can be constructed by creating a sum of terms where the eigenstates of the electrons are permuted among the electrons, creating what is called a single-determinant wave function, or a Slater Determinant. In this, the operation of forming a determinant, as expressed in linear algebra, is used to fabricate the wave function. This operation tends to create wave functions that are utterly inexpressible, where permutations for a relatively small number of particles in the wave function churn out unfathomably enormous numbers of terms… as many as 10^50 for a molecule as simple as benzene.

When operating the Schrodinger equation onto a Slater Determinant, the structure of the permutation can be worked into the equation such that you can actually skip ever forming the determinant directly. This gives rise to energetic interaction terms between pairs of electrons occupying pairs of states. The first electron-electron interaction is the coulombic repulsion term, which is simply the electrostatic repulsion pressure one electron has on another when they are in proximity to each other. The second interaction term is called the “Exchange Interaction” and it is an energy describing the situation where one indistinguishable electron switches identities with another and they exchange states. Yes, this is the epitome of quantum weirdness! This term specializes specifically to electrons that are not distinguishable, which is electrons that are of the same spin that are near to one another.

This is the exchange energy term, just so that you know what it looks like. This term looks very similar to the potential energy between two repulsive electron charges. The difference is in the Phis; electrons ‘1’ and ‘2’ exchange between spatial states ‘a’ and ‘b’ where the strength of the interaction falls off as the reciprocal of the distance between the electrons. The integral emerges because both electrons end up being distributed across space in some non-trivial way, requiring you to weight every combination of positions those electrons occupy based on the wave functions they are in.

Electron exchange ends up being the basis of ferromagnetism!

The crux is simply this. Remember where I said above that minimizing the strength of the magnetic field also minimizes the energy? This is one way to consider why fermions, like electrons, almost always drop into the same spatial orbital in opposite spin states. Most of the time, if they fall in with opposite spins, they minimize their magnetic energy by canceling out their magnetic dipole moments as a pair. In a ferromagnet, for metallic atoms of Iron, Nickel and Cobalt, it turns out that this is actually not the case. If nearby spins drop into orbitals where they have the same spin value, they achieve a lower overall energy than they would if they dropped into the same orbital in opposite spins. The reason is specifically to facilitate exchange energy minimization. They would rather have greater magnetic energy because in doing so they have lower exchange energy and the combination is enough that their overall energy is lower than it would be otherwise!

Some of the papers I’ve looked at talk in terms that are peppered with references to condensed matter physics: strongly bound bands and anti-symmetry in Wannier orbitals and such. I tried to wade into that and decided that it would probably not help my discussion here because it becomes fairly opaque. I’m not a condensed matter physicist and, while I have a basic understanding of “density of States” and “Bloch quantization,” this is not something that I understand exceptionally well. One advantage that I do have is that I’m really not in the same place I was when I first started writing this series on how magnets work several years ago.

I’ve gotten access to some tools that are absolutely killer for trying to model these sorts of situations on a molecular level!

The first thing I did, which failed kind of spectacularly, was to try to model and crystal cell from magnetite using Gaussian 16. Magnetite is among the most famous magnetic crystals and is probably the material present in whatever bar magnet you happen to own. The problem is that the unit cell has 56 atoms and 880 electrons. Including periodic boundaries ended up taxing my computer resources so spectacularly that a little more memory was never quite enough.

Magnetite. X-ray structure taken from this paper. Haavik et al Am Min 85 (3-4): 514–523 (2002)

Here is the unit cell for magnetite. The red atoms are doubly charged (-2) oxygens while the purple atoms are a combination of Fe(III) and Fe(II) iron atoms (octahedral coordination for III, tetrahedral for II). Even going for a small basis set, this is beyond my computational resources in large part because there are many atoms and because the iron needs d-shells to be done even remotely correctly.

As I worked my way into this, I learned some things about metal liganding that made representing the system somewhat easier. I dialed back my demands and decided to focus on the bare minimum system. The next trial involved two iron atoms liganded with hydroxide… this ended up being a bit of a failure too, but for the reason that I didn’t fully understand the chemistry I was dealing with. With the system involving hydroxide, one iron atom preferred an octahedral geometry, while the other involved trigonal bipyrimidal.

Fe(V) and Fe(VI) with hydroxide and oxygen ions, Gaussian 16 uB3LYP/3-21G.

This system ended up showing signs of ferromagnetism, but not because I knew what I was dealing with. The charge state was set to neutral charge, which, given the charges of -1 on the hydroxide and -2 on the oxygen ions, requires the Iron to be of +5 and +6 charge states, or Fe(V) and Fe(VI). This is different from the magnetite model I had originally tried to base my work on, which contains only Fe(II) and Fe(III). To get a convergent wave function, I also needed to set the spin for a dectet, which I didn’t fully understand at the time when I did it. Shoot first and look later, right? Truth is that I’ve never really dealt with iron in any formal capacity and I needed to learn a lot about d-shells before I got anything that made real sense. The basic design here, however, seemed sound because it allowed me to place two iron atoms in close enough proximity in a similar charge state to mimic what can happen in a crystal of magnetite. I understand the purpose of the spin state setting now, but not at the time.

Periodic Table, taken from here.

In this version of the periodic table, I’ve highlighted iron so that you can see where it is. Iron is in the transition metals there in column 8. The valence structure of iron is in the d-shell: counting from the left, neutral iron has 6 electrons and iron begins filling its period 4 s-shell before it starts filling its period 3 d-shell.

The way that the periodic table labels the s, p, d periods, each shell orbital is filled in closure with one version of each spin type in each orbital. s-shell has 1 spatial orbital, and 2 spins, giving column 1 and 2 on the right side of the table (except helium, which has only a filled s-shell and goes into the Noble gas column). p-shell has 3 spatial orbitals and 2 spins, giving 6 columns and only starting on the second row after Be with columns 13 to 18. d-shell has 5 spatial orbitals and 2 spins, giving 10 columns, but not starting until the 4th row because the 3rd p-shell and 4th s-shell both fill with electrons before the 3rd d-shell. d-shell are columns 3 to 12. This idea of “closure” is basically the magnetic cancellation that I mentioned earlier, but is better considered as angular momentum closure, which is to say that configurations with minimal angular momentum tend to be more stable than otherwise… which is why the Noble gases on the right are particularly stable; they have the most completely canceled angular momentum of any atoms since all their occupied shells are closed.

One thing that makes this a little more complicated is that in a system containing degenerate states at some energy level will tend to fill that level according to what’s called Hund’s rule. This follows from the first Hund’s Rule in the wikipedia link. For a level, the electrons tend to singly occupy each orbital before they start doubling up… which is important for how the d-shell fills.

For d-shells, there is also one other crazy feature which seems to add to the fog of war, which is that atoms preferring octahedral liganding, like Iron, tend to create a split in the d-shell where two orbital levels are offset to a slightly higher energy than the first three. If the energy splitting between the two sets of states is sufficiently large, Hund’s rule allows electrons to spread across only the first three states before ever filling the last two states. The system that only fills the first three states is called Low spin, while a system that can fill all five is called High spin.

This diagram shows how the states could be filled in uncharged iron. The low spin configuration has no angular momentum and would produce a spin singlet. The high spin version, on the other hand, has four orbitals occupied singly and would produce a spin quintet. The quintet splitting comes from the fact that the singletons could be either up or down spin and that there are five possible fine structure energies that could emerge ranging from all up to all down with combinations of up and down in between.

I came to focus on combinations of Fe(II) with Fe(III). The oxidation state sign here reflects the number of electrons that oxygen has stolen from iron and directly correlates in this case with the charge on the iron: Fe(II) = +2 charge, Fe(III) = +3 charge. If you ignore Hund’s rule to start with and simply invoke spin closure, this would be the d-shells of each.

This would be a ground state with a spin doublet. As it turns out, I was unable to converge geometry for any structure in this state. Solving wave functions depends quite strongly on having a good guess at the initial geometry and at the initial wave function… if you’re lacking on either, no solution will be possible. The doublet may well just be too much a violation of the reality to craft any wave functions. I was able to get a singlet for an Fe(III) – Fe(III) complex, but no multiplets ended up possible.

Instead of looking further at the spin doublet, I went to the next multiplicity of states in the Fe(II) – Fe(III) complex, the Low spin combination. I adopted a similar geometry to what was seen above with the hydroxide, but supplemented water for hydroxide at a large enough frequency to make an uncharged complex with Fe(II) and Fe(III). In this case, the liganding geometry starts to fall apart and trigonal bipyrimidal geometry is no longer visible. Moreover, the waters appear to be undergoing some acid-base chemistry in the process of the simulation by exchanging around protons which actually never come back to a fully bonded length (this is omitted from my images here because I filled in bonding to make it sensical).

First, here is the low spin orbital configuration:

I was able to find a converged geometry, starting from this electron configuration.

Low spin complex Fe(II), Fe(III) with water and hydroxide. Gaussian 16 uB3LYP/3-21G.

This is the low spin complex. It might have been possible to solve a structure for low spin with both atoms in octahedral form, but I really didn’t want to spend the time. This structure has a spin sextet. There’s a quirk to these solutions that I will mention in better detail after I introduce the high spin complex.

And, yes, I did find a high spin complex with the same atoms and electron count. Here is the electron configuration:

As you can see, the spins are now completely dispersed in low and high energy orbitals. This is now a spin octet. The structure itself doesn’t look very different from the low spin version of the same complex.

Low spin complex Fe(II), Fe(III) with water and hydroxide. Gaussian 16 uB3LYP/3-21G.

Now, before I go on, there’s something else that you need to understand about the orbital models that I started with.

They are completely wrong!

If you want to know more about the organic chemistry lie, that is here. The orbital ideas introduced above all hinge on the idea of closure. This is the notion that every spatial orbital you might discover is incomplete unless it is filled… or even designed to be filled… by two anti-parallel spins. This turns out to be a lie. Even down to the tool of the Periodic Table, as detailed above, it’s a lie! In reality, there is no reason at all that individual electrons can’t simply occupy individual orbitals!

The idea introduced above for Hund’s rule relies on a notion called “restricted open-shell” orbitals… or that only the orbitals which are occupied by one electron are singly occupied. This turns out to be a simplification of the reality that completely removes a big, important subtlety. The solutions that I found for these complexes were created by Unrestricted DFT (denoted by the “u” in the uB3LYP/3-21G signification). Unrestricted solutions take all alpha electrons and all beta electrons in the complex and find orbitals for them at whatever energy a given orbital happens to fall at. In this form of solution, no orbitals are doubly filled. You may glance at the high spin configuration and shrug your shoulders: well, all of those are singly occupied anyway! Well, yes, but that says nothing about every other electron in the complex, including those for the waters and the hydroxides! In the unrestricted solutions, all orbitals are singly occupied wherever they be, and they may not any of them have the same energies.

The reason this is important is because the solution removes the ambiguity in the spins introduced with Hund’s rule. For the energy diagrams, all the singly occupied orbitals are left ambiguous in that they could be “up” or “down.” For the unrestricted solutions, a given orbital is assigned “alpha” or “beta” with no ambiguity! If alpha is taken to be “up,” all spins of alpha flavor are strictly and unambiguously up!

What I gain by looking for unrestricted orbitals is a way to measure ferromagnetism on the level of atoms.

Low spin. Mulliken spin population, green=alpha, red=beta. Energy = -3196.738623 Ht.

High spin. Mulliken spin density, green = alpha, red = beta. Energy = -3196.755640 Ht.

These images are colored by Mulliken population spin density. This is a spin density representation which goes on a color gradient scale: extreme alpha spin is green, neutral is black and beta is red.

First, there is no particular spin polarity for most of the complex, except on orbitals localized at the iron atoms. For the low spin complex, one iron is strongly spin polarized to alpha, while the second is only weakly so. No part of the complex is polarized explicitly to beta. In the high spin complex, both iron atoms are strongly spin polarized to alpha. This would hardly matter but for the energies associated with these states… the high spin state has a lower energy! If you work the numbers, this difference is 0.54 eV, about half an electron volt. To understand how significant a difference that is, consider that the energy in a covalent bond is of the order of 1 or 2 eV. The low spin state could be reachable thermally, but you will mostly see the high spin state. Moreover, I was unable to find more weakly polarized states than these; I could not solve for a doublet. The trend implies strongly that the most prevalent state is with electrons around these nearby nuclei strongly spin coupled.

When you start looking at the molecular orbital populations, it turns out that both of these complexes have very lopsided orbital occupancy. The high spin version has 65 beta orbitals and 72 alpha, or seven more alpha spin-orbitals than beta.

In addition to this, because the fundamental model used to design the initial experiment is itself broken, I went on a fishing expedition. Noting that the sextet and octet geometries are very similar, I took the octet geometry and I did a spin scan. I scanned spins doublet (2), quadruplet (4), sextet (6), octet (8), dectet (10) and dodectet (12). The 2 and 4 spins failed to converge –I was expecting 2 to fail from my earlier work– but all the rest did not!

D:StoriesPhysics2016 math blog5-7-20 Magnets part 5Spin state.opju/Spin state/Folder1//Graph1

" data-medium-file="https://poetryinphysics.files.wordpress.com/2020/05/13-spin-state-occupancy.jpg?w=300" data-large-file="https://poetryinphysics.files.wordpress.com/2020/05/13-spin-state-occupancy.jpg?w=640" class="alignnone size-full wp-image-11076" src="https://poetryinphysics.files.wordpress.com/2020/05/13-spin-state-occupancy.jpg" alt="13 spin state occupancy" width="3216" height="2461" srcset="https://poetryinphysics.files.wordpress.com/2020/05/13-spin-state-occupancy.jpg 3216w, https://poetryinphysics.files.wordpress.com/2020/05/13-spin-state-occupancy.jpg?w=150&h=115 150w, https://poetryinphysics.files.wordpress.com/2020/05/13-spin-state-occupancy.jpg?w=300&h=230 300w, https://poetryinphysics.files.wordpress.com/2020/05/13-spin-state-occupancy.jpg?w=768&h=588 768w, https://poetryinphysics.files.wordpress.com/2020/05/13-spin-state-occupancy.jpg?w=1024&h=784 1024w" sizes="(max-width: 3216px) 100vw, 3216px" />

Rather than trying to wedge a 1920s model into a 2020 simulation, this is making use of the true power of quantum chemistry by creating every possibly wave function and checking their energies. It turns out that the octet isn’t quite the lowest energy state… the dectet is, but only by a bit.

Super-high spin, alpha = green, neutral = black, beta = black. uB3LYP/3-21G

This configuration might be called the “super-high spin” geometry. I’ve left the waters in their partly broken state here, so protons are not always explicitly bonded to the oxygens and are sometimes at strained lengths. Again, this is because the simulation allows for bonds to break. There can be no doubt at all that the Iron spins are locked as alpha here and that this is the state that will be most frequently observed. In this plot, the sextet energy is attenuated high (1.8 eV versus 0.52 eV) because this is not the optimal geometry for the sextet… it’s the optimal geometry for the octet. That’s noteworthy because it’s also not the optimal geometry for the dectet, meaning that the energy for 10 will probably go down more if the geometry is optimized. On the other hand, I don’t know that I really care since perfecting the energies doesn’t actually change my point at all… it would be a good deal of computer time for very little pay-off (which is why I kept to B3LYP and 3-21G rather than going for a better functional or basis).

(Edit 5-24-20:

Because I had a moment where the computer was lying dormant, I came back and calculated geometries for the 10 and 12 multiplets and got energies for them in the process.

D:StoriesPhysics2016 math blog5-7-20 Magnets part 5Spin state.opju/Spin state/Folder1//Graph2

" data-medium-file="https://poetryinphysics.files.wordpress.com/2020/05/21-spin-states-accurate.jpg?w=300" data-large-file="https://poetryinphysics.files.wordpress.com/2020/05/21-spin-states-accurate.jpg?w=640" class=" size-full wp-image-11105 aligncenter" src="https://poetryinphysics.files.wordpress.com/2020/05/21-spin-states-accurate.jpg" alt="21 spin states accurate" width="3216" height="2461" srcset="https://poetryinphysics.files.wordpress.com/2020/05/21-spin-states-accurate.jpg 3216w, https://poetryinphysics.files.wordpress.com/2020/05/21-spin-states-accurate.jpg?w=150&h=115 150w, https://poetryinphysics.files.wordpress.com/2020/05/21-spin-states-accurate.jpg?w=300&h=230 300w, https://poetryinphysics.files.wordpress.com/2020/05/21-spin-states-accurate.jpg?w=768&h=588 768w, https://poetryinphysics.files.wordpress.com/2020/05/21-spin-states-accurate.jpg?w=1024&h=784 1024w" sizes="(max-width: 3216px) 100vw, 3216px" />

This shows more accurately the relative energies between the spins because I have optimized geometries where all of these energies are taken. You can see that the 6-tet is now 0.5 eV higher energy than the 8-tet. The 10-tet and the 12-tet both decrease in energy relative to the 8-tet, as expected. However, the overall trend is not different from what I said originally. The dectet is still the most stable state.)

The transition from the restricted open-shell chemistry problem to the unrestricted model causes people’s chemistry intuition to break. The classical notion of a covalent bond involves this idea of angular momentum closure, where a covalent bond is two electrons with canceled anti-parallel spin. In the case of the iron complex above, this idea is completely broken in the valence band.

Alpha HOMO for octet configuration. uB3LYP/3-21G

This is the highest occupied molecular orbital for the alpha spins. That’s one electron literally ranging all over the complex, through the waters and between the irons! The orbital looks kind of like an anti-bonding orbital since it has nodes everywhere, but some of the lobes may well also be bonding. In some ways, it shouldn’t be too surprising because it does look kind of d-orbital-like around the iron on the right, but it goes everywhere! That the orbital connects between the two iron nuclei may well be confirmatory as to why the alpha spins appear to center on the irons. Of course the spins on the iron nuclei are locked; the orbitals that contain them are spread to both such nuclei. You can use rules like Hund’s rule to start trying to “understand” this, but be aware that the reality is actually somewhat messier.

The point overall is that the Fe(II) – Fe(III) combination can appear in a form where the spins of the electrons are coupled to one another and that this is an energetically favorable situation. I assert that Exchange energy is the reason for this, but not without support. Also from what I read, Magnetite contains a significant number of spin-coupled Iron atoms, though it should be noted that not all point the same direction. A majority point one direction, while the minority point the opposite direction, giving the crystal cell a significant net magnetization along this axis.

Now, I’ve tackled the first issue. Ferromagnetic coupling of electrons occurs because it is energetically favorable for them to spin orient in the same direction. Now, why is it that this can occur in a particular structural direction with respect to the material phase which harbors these spins?

It turns out that there are several interconnected features that need to be discussed here.

The first thing to note is how a material substance, like your bar magnet, can be prepared to have a “net” magnetization. I specify net magnetization in the same sense as it is described for the magnetite crystal, that a majority of the spins happen to point in the same direction. This may not be a big majority, but just more than 50%. Any majority will give an observable external field. In most iron ore rock, the magnetic dipole moments are locally ordered, but globally disordered. The ordered spins become trapped in small sections of crystal called domains where the spins within the domain are very well ordered, but that the domains are not ordered with respect to one another. A plain old piece of iron scrap picked up off the side of the road may have no net magnetization and will not produce an observable magnetic field.

A ferromagnet produced in a factory is built using a special annealing process. The metal is heated until it is molten and then cooled into a solid in the presence of a powerful electromagnetic field. While molten, the electromagnetic field poles a majority of the spins within the metal. Given the quantum mechanical effects mentioned above, these spins like to be co-oriented relative to each other and given the physics of how magnetic dipoles respond to an external magnetic field, they will tend to orient relative to the field. When the metal is then cooled, the spins become “locked in.” Take the field away and the solid has a permanent magnetic dipole moment that is proportional to the numerical difference between the spins initially oriented with that external field and oriented against it.

This locking in turns out to be very dependent on what sort of crystal structure the substance has settled into. This should make some sense: as I showed with the molecular orbital in the image above, the magnetic spins are not necessarily localized exactly to the atoms, but are distributed among them. It turns out that the regular structures of crystal cells tend to have preferred directions where magnetization “likes” to be pointed. This is called magnetocrystalline anisotropy.

Magnetocrystalline anisotropy (from wikipedia)

The point of this is that if you magnetize the material while it’s molten, then cool it into a solid, as the solid crystallizes, the magnetization is most stable if the crystal is oriented relative to the external field. You could magnetize spins in any direction relative to the crystal that you want, but if you pick a particular direction, called the ‘easy’ direction, Entropy will tend to not disorder the spins in the crystal that quickly. It turns out that hexagonal crystals (middle panel) are highly preferred because they have a single axis along which magnetization turns out to be easy. For example, hexaferrites which contain ferrite on a hexagonal lattice, are well known for industrial applications. This material has strictly columnar easy magnetization.

Of note, magnetite probably has 8 directions (right panel) that are easy. If you look down any corner of the crystal unit cell, it has a hexagonal footprint:

magnetite crystal cell regarded along a point of the cube (Gaussview).

This would suggest that magnetite is not necessarily the most industrially preferred magnetic material.

Naturally occurring lodestones come to exist in a process much like what is described above. Iron ore liquified in the interior of the Earth is poled by the Earth’s magnetic field and then allowed to cool. Allow it to sit undisturbed for the right amount of time and then pluck out a piece… boom, magnetic metal.

To make a good permanent magnet, you pick out a magnetic material that can be crystallized selectively on a hexagonal lattice. You would then want to learn conditions that allow for crystallization of large domains. To make your magnet, you would melt that material and crystallize it in the presence of a strong magnetic field under conditions that make big domains. If you’re really classy, you would then machine this material down to a monodomain (while cooling it to keep from heating it above its Curie temperature during the machining process). Rare-earth magnets are even classier because Lathanides have even better characteristics for orienting spins than iron does.

There are significant other details present in this discussion. Magnets are very subtle. But, I think I’ve left no part of this muddled topic untouched. Do you suppose Insane Clown Posse would change the lyrics in their song? Yeah, I know… the song specifically resists the notion that people without expertise can learn from people with expertise.

Never thought I’d write the following on this series of posts, but The End!

Extra:

Because I think it’s cool, here’s some additional structure for the spin orbitals in the iron cluster above.

0.025 isodensity level on alpha minus beta spin-orbital difference density

0.004 isodensity level on alpha minus beta spin-orbital difference density

0.0004 isodensity level on alpha minus beta spin-orbital difference density

This sequence of images depicts a density difference map. The density is all alpha orbitals minus all beta orbitals. Purplish zones are where alpha density is greater than beta density. Cyan zones are where beta density is greater than alpha density. At the highest isodensity level, it’s clear that the majority of the excess alpha spin is trapped on both of the iron atoms. However, atoms are less than rigid blocks and fringes of the density is spread across the entire complex. Interestingly, the excess alpha density is clearly clustered on the atoms rather than in between, suggesting that the excess alpha orbitals are predominantly anti-bonding. Kind of cool.

Small Games with Gaussian (3)

foolish physicist — Tue, 28 Apr 2020 23:33:15 +0000

(Irony that my intro background picture is from GAMESS since I don’t have any cool Gaussian pictures to show at the moment.)

Okay, you’ve got an optimized structure for a chemical by an ab initio method. You just spent months learning Hartree-Fock, then Density Functional Theory and even dabbled a bit in the nether regions of computational hell with MP2 and CCSD only to take a breath of air to swallow some AM1, what in the world do you do with it now? What’s the point? You’ve got B3LYP dripping out of every orifice and are worried there isn’t enough B3PLYP. Where do you go now if not the proctologist to make certain you don’t have colon cancer?

The whole point in the end is to predict real physical parameters of some sort. You want the model so that you can turn around and apply it to reality. If what you simulated is only true in silico, then the entire experience isn’t worth more than months spent on Fortnite Battle Royale. And, you might ultimately have more fun on Fortnite.

The point of struggling with GAMESS and Gaussian is the little off chance that what you produced inside the machine matches something in reality. Fortnite struggles to fake real. With Gaussian, the point is to go further and take that last step across the gap to make a numerical prediction that you can then turn around and measure in the world around you. And this is at the extreme fringes of real where the measurement applies.

With my work in Gaussian and GAMESS, there was a real research objective at the end of the tunnel. One stop-over along the route was to try to figure out how good of a measurement it’s possible to make. This is not a subtle point; did all this hard work amount to something?

One clear, measurable molecular quantity that you might care about are the electrostatic features of some molecule of interest. The most major that you will hear people talking about is the permanent dipole moment. Within a molecule, it often turns out that the average position of the electrons does not necessarily match the average position of the nuclei. The separation of these tends to generate a small local electric field that falls off over distance as 1/r^3. You may remember me talking in another post about the magnetic dipole moment. Electric dipole moment is the second moment of electrostatic charge distribution, derived much the same way. Dipole moment can be a very important quantity because the generalized electrostatic charge distribution of a molecule has a strong impact on how it can directly interact with another molecule.

While one might consider the dipole moment to be the most simplistic aspect of molecular charge distribution, it actually turned out to be something of a learning experience to calculate it accurately. I took something of a detour to learn how it was done well on a simple historic system of interest.

The dipole moment of water is an old and important physical chemistry computational target. First, it actually turned out to be a non-trivial measurement and was not actually measured well until 1973. The value you can find cited on Google by typing “dipole moment of water” was measured by Shepherd Clough and company using the Stark Shift. Stark shift is the Hamiltonian perturbation of an electric field on the quantum mechanical levels of an atom or molecule, causing degenerate quantum levels to split in energy relative to their interaction with the external electric field. Water has a permanent dipole moment of 1.85 Debye and this value is known well to four decimal places in the Clough paper, which is pretty good.

This value makes a good target for ab initio quantitation since it is already well enough measured for comparison. The modeling values that tend to turn heads came from Thom Dunning’s group and the group of Kim some twenty years after the Stark shift measurement. I became a fan of Dunning’s cc-pVNZ gaussian basis sets as a result of doing these calculations…

For your (semi-) entertainment, here is a tabulation of water dipole moments using various ab initio methods.

Through this little piece of work, you can see how the various combinations of technique and mathematical basis set interact to produce better and better estimates of the water dipole moment. In the methods, “HF” means Hartree-Fock, “DFT” is of course density functional theory, “CCSD” means coupled-cluster with single and double excitations, which is an configuration interaction theory considered to be a post-Hartree-Fock technique, “MP2” means second order Moller-Plesset perturbation, which is also post-Hartree-Fock. For Hartree-Fock, the bigger and bigger basis sets hit the correlation energy limit when the strength of the water dipole moment is about 2.0 Debye. The Dunning basis cc-pVQT is basically sitting on the Hartree-Fock limit for water. The values get better for higher levels of theory, and really turn good when you start applying post-Hartree-Fock to basis sets that include diffuse augmentation to correct the fringes of the gaussian sets to look more like exponentials. The B3LYP functional in DFT produced ridiculously good values, but you can see that it undercuts the water dipole moment as you push to larger basis sets–this means that the faulty asymptotic behavior of functional is combining with the incompleteness of the basis to produce a spuriously accurate value. It also means that you can’t really trust it to produce correct values in an unrelated system. Switching to the long-range corrected functional CAM-B3LYP causes the value to deviate to slightly higher dipole strengths, but restores the behavior of the functional to mimic that of the post-Hartree-Fock methods, allowing the basis set density to be the main contributor to accuracy.

One thing hidden here is the computational resources necessary to run a given technique. As it turns out, the HF and DFT are both relatively cheap. CCSD and MP2 are not. I managed to crash my computer, literally crash it, trying to apply MP2 to a bigger molecule for purposes of measuring the dipole moment. The computer hard drive literally overflowed with computational scratch files and prevented the computer from booting until I started in safe mode and deleted the scratch files by hand. And, CCSD is reputedly more costly than MP2. For a molecule the size of water, CCSD with cc-pVQZ took nearly ten minutes where the other techniques clocked in at 14 seconds or less.

Some of these methods can be really good for predicting physical properties. Be sure to bring your supercomputer!

Edit 4-30-30:

Yeah, I’ll add one disposable picture from Gaussian;-)

Small Games with Gaussian (2)

foolish physicist — Wed, 26 Feb 2020 21:41:40 +0000

Subtitle: Return of the G-quartet

This post will mainly just be pretty pictures. As I’ve been working with Gaussian, I’ve found that some of my earlier work with GAMESS wasn’t holding up. I took the time to go back and see if a couple of my earlier efforts with the G-quartet were still valid, or if Gaussian disagreed with GAMESS about those simulations too.

Turns out GAMESS did pretty well.

If felt like validation because all that work with GAMESS was very intense and difficult. It may seem a bit of an irony, but Gaussian did not immediately turn out good structures of the G-quartet. I apparently have a knack for picking hard targets. My initial efforts with Gaussian to try to tackle the G-quartet were hampered by my not knowing any finer details about how to control the structure search being made by the program. As such, I had to make a few good passes at it before I hit a control setting that allowed for convergence. Even with the 6-31G** basis set, there ended up being some basis set superposition error hampering clear positioning of the central cation. I wasn’t certain this was the case initially, but the stability of convergence in the geometry search depends strongly on density of the basis functions around the core of the quartet.

G-quartet with sodium, Hartree-Fock 6-31G**. Coloration of the isosurface reflects electrostatic potential (blue more +, clear more neutral).

This structure took eleven passes to produce, much of it trial and error until I found a setting that converged the search –that’s several hundred steps not going anywhere until suddenly a setting let it converge in only ten steps. My eyes popped at that: with GAMESS, I took no fewer than 500 steps over several weeks. Once I hit the setting, Gaussian converged it in ten steps, taking just a couple hours! The quartet is still potato-chip shaped. Admittedly, I would be faster with a search using GAMESS now, since I was learning a great deal about structure searching when I originally did that work, but I doubt I could ever crank a structure the size of the G-quartet out in only ten steps. The system Gaussian uses with automating redundant coordinates really speeds the search! (If you have the step size set appropriately.)

G-quartet HOMO orbital, 6-31G**.

The HOMO is really quite striking. With Gaussian, the HOMO is immediately distributed across the entire molecule. The green and burgundy are the peaks and troughs of the electron probability wave, again marked as a probability isosurface. With GAMESS, I needed to map out the four top orbitals together to see this pattern emerge. I’m not sure what the difference is between the programs, but Gaussian appears to have some other intriguing quirks that can significantly save time and work.

Small Games with Gaussian (1)

foolish physicist — Tue, 11 Feb 2020 00:56:00 +0000

I don’t have a huge amount of time to talk about it at the moment, but something kind of cool has happened. This is a small extension off my quantum chemistry series.

For much of the last year, I’ve been putting huge amounts of time into ab initio calculation using the General Atomic and Molecular Electron Simulation System (GAMESS). I love GAMESS, as you probably guessed here, here, here, here and here. GAMESS is a godsend for anyone who wants to learn about ab initio mainly because making GAMESS really work requires you to spend a great deal of time in the primary literature learning how the techniques that drive the program operate. It’ll do a lot for casual observers, but getting it to dance on a chunk of a problem requires you to understand the question you’re asking. My strength has steadily grown. I’m not a perfect expert, but I’ve learned a lot about how to do various calculations.

The limits of GAMESS begin to appear when you keep asking bigger, more detailed, more specific questions. Admittedly, this is the case with any quantum chemistry program: none can take you all the way across the river. There are a lot of sub-programs within GAMESS that are either outright broken, not quite complete, just a little too cobbled together, somewhat dated within the build, or not adequate to the task. Not being a programmer of significant skill, I’m stranded at finding problems and not being able to fix them, despite having the chance to dig into GAMESS source code for a look. Then again, most people are stranded there in the face of even small quantum chemistry problems –programs like GAMESS represent literally decades of hyper-specialized work.

I have spent a lot of time on GAMES and done a fairly big amount of work with it, some out of curiosity and some out of professional need.

Turns out that my enthusiasm has impressed one of my superiors. I got access to Gaussian!

Most people who read that sentence will stop and think, “Why should I be impressed?” Someone who has spent time on Quantum Chemistry will nod their head and give a knowing smile.

(Edit 2-17-20: I feel that there is perhaps a contextual provision that needs to be added here. Gaussian may be the very best single tool that Quantum Chemistry has given the world. To some within the academic community, it may also be an object of some disgust. As with all things made by humans, Gaussian was intended specifically as an agent designed to convert knowledge into financial profit and it was engineered to be the single best tool of its kind. As with many things like it, it has been guarded jealously. Gaussian’s licensing contains a legal clause that the Gaussian software must not be used to develop or compete with the interests of the Gaussian company and particularly must not be used to develop products that would compete with Gaussian. On the surface, this makes sense, but you can also see how it would be a destructive problem in academia. Quantum chemistry software that can produce good, quantitative calculations is the product of sometimes millions of man hours of work… meaning that it is very difficult to start from scratch on something new and rise to the level of major players totally independently. This can be a problem in academia where you often need one tool in order to advance research if that research would be taken to develop a new tool that might compete with the original tool. It would violate the terms of service to use Gaussian to produce benchmark calculations for a completely new quantum chemistry calculation method! As such, there have been licensing issues in Academia where people trying to improve the techniques for doing what Gaussian does can’t use Gaussian to improve their work since their work may ultimately threaten Gaussian –literally, the Gaussian software license has been denied to universities or departments that might develop competing software. GAMESS can thrive in this environment because it is not so restrictive!)

GAMESS and Gaussian emerged from a phenomenal burst of research in the 1970s, GAMESS descended from Michele Dupuis’ (and coworker’s) HONDO and Gaussian descended from John Pople’s Gaussian 70. At the time, Gaussian stood the slight advantage of showing up on the field first. Pople went on to win the Nobel prize in 1998 for the contribution that Gaussian (in subsequent years) ultimately came to represent.

These days, GAMESS can be acquired by poor academics like me at no charge. Gaussian, on the other hand, was a bridge too far to hope. Just take a look at the pricing information if you care to see what I mean. And, as well, the questions I could ask were definitely limited by the degree to which I was willing to turn backflips to try to make GAMESS chew on the task at hand. You can go only so far.

It turns out that there is a single benefit to being a poor, lowly academic in the bowels of the science industrial complex. Sometimes the people above you have a bit of money.

Liquid crystal molecule simulated by Gaussian 16, visualized by Gaussview 6.

After having labored to build a self-consistent field program by hand from scratch in the wrong programming language, I once compared GAMESS to an Aston Martin sitting in an alley way with vanity plates inviting me to take it for a spin. I’ve driven it around the block until the paint wore off: I love it, but it’s a 1985 Toyota 4runner with 300,000 miles on the odometer. It’ll get you there, but not always in style. I like it and will continue to drive it to answer the small questions.

Gaussian 16 is the literal Bugatti Veyron of the quantum chemistry world. And at roughly the equivalent expense.

With the hardware I have in hand, GAMESS chugged through a problem in 5 hours and 45 minutes –25 to 30 steps to optimize. The same problem in the hands of Gaussian 16 was 1 hour and 21 minutes –no more than 13 steps to optimize! That is some freaking unbelievable speed when you’re asking questions that take days to answer. And, I say this with my coworker offering me the potential for time on the university cluster supercomputer. With a thousand processors, days become minutes. (Keep in mind that it’s still possible to ask a question that breaks a modern supercomputer, so even Gaussian can only take you so far.)

I am very impressed by the difference. Hopefully, I will get the opportunity to post more regarding work in Gaussian, but my access to it is understandably somewhat more limited. We’ll see.

Edit 2-13-20:

My initial impression of Gaussian 16 is that it’s a true beast. It’s cranked out 16 optimized structures in two days at only about a 50% duty cycle where I might have been able to do one or two structures with GAMESS at nearly a 100% duty cycle. To maximize it, I’m actually running lists of jobs by script, which I could never really do with GAMESS since GAMESS routinely required me to stop and tweak things in order to maximize even a single job. A month of work is down to four days. I have to be careful, I might get spoiled here!

Edit 2-17-10:

GAMESS and Gaussian are definitely not equal. So far, I’ve bumped into two situations where GAMESS and Gaussian have optimized different structures out of the same molecule. This would be okay to the extent that many molecules possess internal degrees of freedom leading to entire constellations of stationary states –bonds rotate in many ways, after all– except that at least one of the structures I’ve seen produced by GAMESS is not among all the comprehensive collection of structures produced by Gaussian of that same molecule (and I have a library of 96 such structures in hand!) Many of the structures agree, but there are a small minority that do not.

One way I’ve discovered where GAMESS and Gaussian appear to be different is within how the structure optimization is run. GAMESS appears to decide that it has achieved a converged geometry based on a single convergence criterion. Gaussian, on the other hand, appears to use four convergence criteria together. Increased stringency should, theoretically, decrease the occurrence of false positives, though you might imagine that it would increase the chances of false negatives… missing real structures. At the very least, one of these spurious GAMESS structures failed to converge when plugged into Gaussian directly. From this angle, it’s really kind of hard to know who’s right: in silico owns the complication that neither simulation actually exists in reality and both might be wrong!

This is where real world experimentation is needed to figure out who is right. From what I can see, given some of the issues I’ve bumped into with GAMESS, Gaussian is closer to reality.