MO Coefficients are different in the output file (out file) and the formatted checkpoint file (fchk)

wangsy · December 16, 2023, 5:44am

When using “scf_final_print 3” in the input file, the printed MO coefficients in the output file are completely different from those in the fchk or h5 file. What could be the reason for this?

The image above is the “out” file, and the image below is the “fchk” file.

input file: Please note that the “****” below the basis set cannot be displayed.
$rem
METHOD b3lyp
SCF_ALGORITHM DIIS
BASIS mixed
SCF_GUESS sap
SYMMETRY false
SYM_IGNORE TRUE
SCF_CONVERGENCE 8
UNRESTRICTED true
MAX_SCF_CYCLES 100
$end
$molecule
1 1
C 0.00000000 0.00000000 0.00000000
F -1.06371800 0.61413800 0.00000000
F 0.00000000 -1.22827500 0.00000000
F 1.06371800 0.61413800 0.00000000
$end
$basis
C 1
cc-pcvdz

F 2
cc-pvdz

F 3
cc-pvdz

F 4
cc-pvdz

$end

@@@

$occupied
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16
1 2 3 5 6 7 8 9 10 11 12 13 14 15 16
$end
$rem
METHOD b3lyp
SCF_ALGORITHM DIIS
BASIS mixed
SYMMETRY false
SYM_IGNORE TRUE
SCF_GUESS read
SCF_CONVERGENCE 7
MOM_START 1
MOM_METHOD IMOM
UNRESTRICTED true
scf_final_print 3
IQMOL_FCHK TRUE
MAX_SCF_CYCLES 100
$end
$molecule
2 2
C 0.00000000 0.00000000 0.00000000
F -1.06371800 0.61413800 0.00000000
F 0.00000000 -1.22827500 0.00000000
F 1.06371800 0.61413800 0.00000000
$end
$basis
C 1
cc-pcvdz

F 2
cc-pvdz

F 3
cc-pvdz

F 4
cc-pvdz

$end

jherbert · December 17, 2023, 6:39pm

(a) Is this true in both jobs or just in the 2nd job?
(b) You can use the </> button to give us the input file with all the special characters intact.

wangsy · December 18, 2023, 3:03am

The 2nd job is incorrect. Here is my complete input file.

$rem
METHOD b3lyp
SCF_ALGORITHM  DIIS
BASIS mixed
SCF_GUESS sap
SYMMETRY false
SYM_IGNORE TRUE
SCF_CONVERGENCE 8
UNRESTRICTED true
MAX_SCF_CYCLES 100
$end
$molecule
1 1
C       0.00000000      0.00000000      0.00000000
F      -1.06371800      0.61413800      0.00000000
F       0.00000000     -1.22827500      0.00000000
F       1.06371800      0.61413800      0.00000000
$end
$basis
C 1
cc-pcvdz
****
F 2
cc-pvdz
****
F 3
cc-pvdz
****
F 4
cc-pvdz
****
$end

@@@

$occupied
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16
1 2 3 5 6 7 8 9 10 11 12 13 14 15 16
$end
$rem
METHOD b3lyp
SCF_ALGORITHM  DIIS
BASIS mixed
SYMMETRY false
SYM_IGNORE TRUE
SCF_GUESS read
SCF_CONVERGENCE 7
MOM_START 1
MOM_METHOD IMOM
UNRESTRICTED true 
scf_final_print 3
IQMOL_FCHK TRUE
MAX_SCF_CYCLES 100
$end
$molecule
2 2
C       0.00000000      0.00000000      0.00000000
F      -1.06371800      0.61413800      0.00000000
F       0.00000000     -1.22827500      0.00000000
F       1.06371800      0.61413800      0.00000000
$end
$basis
C 1
cc-pcvdz
****
F 2
cc-pvdz
****
F 3
cc-pvdz
****
F 4
cc-pvdz
****
$end

wangsy · December 18, 2023, 3:05am

The 2nd job is incorrect.
Here is my complete input file.

$rem
METHOD b3lyp
SCF_ALGORITHM  DIIS
BASIS mixed
SCF_GUESS sap
SYMMETRY false
SYM_IGNORE TRUE
SCF_CONVERGENCE 8
UNRESTRICTED true
MAX_SCF_CYCLES 100
$end
$molecule
1 1
C       0.00000000      0.00000000      0.00000000
F      -1.06371800      0.61413800      0.00000000
F       0.00000000     -1.22827500      0.00000000
F       1.06371800      0.61413800      0.00000000
$end
$basis
C 1
cc-pcvdz
****
F 2
cc-pvdz
****
F 3
cc-pvdz
****
F 4
cc-pvdz
****
$end

@@@

$occupied
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16
1 2 3 5 6 7 8 9 10 11 12 13 14 15 16
$end
$rem
METHOD b3lyp
SCF_ALGORITHM  DIIS
BASIS mixed
SYMMETRY false
SYM_IGNORE TRUE
SCF_GUESS read
SCF_CONVERGENCE 7
MOM_START 1
MOM_METHOD IMOM
UNRESTRICTED true 
scf_final_print 3
IQMOL_FCHK TRUE
MAX_SCF_CYCLES 100
$end
$molecule
2 2
C       0.00000000      0.00000000      0.00000000
F      -1.06371800      0.61413800      0.00000000
F       0.00000000     -1.22827500      0.00000000
F       1.06371800      0.61413800      0.00000000
$end
$basis
C 1
cc-pcvdz
****
F 2
cc-pvdz
****
F 3
cc-pvdz
****
F 4
cc-pvdz
****
$end

jherbert · December 18, 2023, 5:37pm

I am looking into this.

wangsy · December 19, 2023, 4:39am

Thank you very much for your reply. I believe it might be more convenient to find the answer from the source program. I have used other quantum chemistry software for the same calculation in order to verify the correct results, but unfortunately, the results obtained do not match the fchk and out files from QChem. I would like to ask if you can determine which results are correct (out or fchk)? This is important to me because I need to use this data for further calculations. I look forward to your reply, and once again, thank you very much!

jherbert · December 19, 2023, 7:10pm

It seems to be a problem with the compound job input (“@@@”). For the time being, please try running these as two separate jobs. Separate your input into job1.in and job2.in then do something like

qchem -save job1.in job1.out job1.scr
cp -r $QCSCRATCH/job1.scr $QCSCRATCH/job2.scr
qchem -save job2.in job2.out job2.scr

(You can always run on multiple threads with -nt, not shown here.) In this case, everything seems to match; please confirm whether that’s true for you also. Note that you can use PRINT_ORBITALS = TRUE as an alternative to SCF_FINAL_PRINT = 3 to get MOs in a nicer output format. Note also that the MO coefficients in the .fchk file are transposed with respect to those in the Q-Chem output file.

wangsy · December 19, 2023, 10:12pm

Actually, I discovered this issue during a separate task. In order to facilitate communication, I merged the two tasks and reproduced the issue here. When I performed the separate task again just now, the second step still resulted in a mismatch between “out” and “fchk.” My Qchem software version is 6.1.1.

wangsy · December 19, 2023, 10:34pm

When using the keywords SCF_FINAL_PRINT 3 and PRINT_ORBITALS TRUE simultaneously, it seems that the orbitals printed by the two methods do not match. The orbitals printed by the PRINT_ORBITALS TRUE keyword match the fchk file, but the part printed by SCF_FINAL_PRINT 3 still does not match the previous two. Could this be due to the keywords? In addition, when testing with different systems, it was found that in some systems, using the SCF_FINAL_PRINT 3 keyword matches completely with the fchk file, while in others, it does not.

jherbert · December 20, 2023, 7:50pm

I would trust PRINT_ORBITALS, which is the standard way to get orbitals. Does this match the FChk in all cases?

In cases where SCF_FINAL_PRINT=3 gives something different, can you try setting GEN_SCFMAN = FALSE and do the same comparison? SCF_FINAL_PRINT options was added to GEN_SCFMAN (the new SCF code) relatively recently, maybe there’s a bug with that implementation.

jherbert · December 20, 2023, 9:47pm

Okay, I can reproduce this problem. Good news is that PRINT_ORBITALS matches the Fchk, whether I run this as a single compound input or as two sequential inputs. (Do you agree?) It is output from SCF_FINAL_PRINT=3 that fails to match, although strangely it does seem to match for certain orbitals but not the first ones. I suspect the latter is a bug. I will post a ticket on our developer site and see if someone else has some insight.

wangsy · December 21, 2023, 4:29am

I would like to confirm that the orbitals coefficients printed by the keyword PRINT_ORBITALS always match with the fchk. However, the orbitals coefficients obtained with the keyword SCF_FINAL_PRINT=3 sometimes match with the fchk and sometimes do not.
Thank you very much for your attention.

jxzou-MOKIT · December 21, 2023, 2:41pm

When SCF_FINAL_PRINT=3 is used, is it possible that there is a hidden matrix diagonalization/iteration (e.g. 1 cycle of FC=SCE) between printing MO coefficients and creating .fchk file?

If such a matrix diagonalization/iteration exists, the MO coefficients of degenerate orbitals will probably be changed. Especially for the lowest core orbitals, they are usually degenerate. (For energetically degenerate orbitals, any unitary transformation does not change the orbital energy or the total electronic energy)

And if such a matrix diagonalization/iteration exists, there is one more problem: the DFT grid. The default grid may lead to near-degenerate orbitals for exactly degenerate ones. But if we add xc_grid 000099000590, the degenerate orbitals would just be degenerate. This numerical issue would affect the matrix diagonalization or reproduction of the problem.

jherbert · December 21, 2023, 7:26pm

The grid shouldn’t affect the agreement between the two printouts, but in light of your comment about an extra diagonalization, I increased SCF_CONVERGENCE to 8 in the 2nd job and now I get agreement between all three ways of accessing to coefficients. This does suggest that maybe the two printouts are out-of-sync by at least one Roothaan step.

jherbert · January 5, 2024, 12:36pm

Q-Chem staff looked into this further and it seems that it’s just swapping the order of two degenerate core orbitals. Depending on nuances like grid or convergence, the degeneracy might be split just enough for this not to happen. It’s a good question, made me think, but in the end it’s a feature (of quantum mechanics) not a bug.