A systematic study of the valence electronic structure of cyclo(Gly–Phe), cyclo(Trp–Tyr) and cyclo(Trp–Trp) dipeptides in the gas phase

Elena Molteni; Giuseppe Mattioli; Paola Alippi; Lorenzo Avaldi; Paola Bolognesi; Laura Carlini; Federico Vismarra; Yingxuan Wu; Rocio Borrego Varillas; Mauro Nisoli; Manjot Singh; Mohammadhassan Valadan; Carlo Altucci; Robert Richter; Davide Sangalli

doi:10.1039/D1CP04050B

View PDF VersionPrevious ArticleNext Article

Open Access Article

This Open Access Article is licensed under a Creative Commons Attribution-Non Commercial 3.0 Unported Licence

DOI: 10.1039/D1CP04050B (Paper) Phys. Chem. Chem. Phys., 2021, 23, 26793-26805

A systematic study of the valence electronic structure of cyclo(Gly–Phe), cyclo(Trp–Tyr) and cyclo(Trp–Trp) dipeptides in the gas phase†

Elena Molteni *^ab, Giuseppe Mattioli ^a, Paola Alippi ^a, Lorenzo Avaldi ^a, Paola Bolognesi ^a, Laura Carlini ^a, Federico Vismarra ^cd, Yingxuan Wu ^cd, Rocio Borrego Varillas ^d, Mauro Nisoli ^cd, Manjot Singh ^e, Mohammadhassan Valadan ^ef, Carlo Altucci ^ef, Robert Richter ^g and Davide Sangalli ^ab
^aIstituto di Struttura della Materia-CNR (ISM-CNR), Area della Ricerca di Roma 1, Via Salaria km 29.300, CP 10, Monterotondo Scalo, Roma, Italy. E-mail: elena.molteni@mlib.ism.cnr.it
^bDipartimento di Fisica, Universita' degli Studi di Milano, via Celoria 16, I-20133 Milano, Italy
^cDipartimento di Fisica, Politecnico di Milano, Piazza Leonardo da Vinci, 32, Milano, Italy
^dCNR-Istituto di Fotonica e Nanotecnologie, Piazza Leonardo da Vinci, 32, Milano, Italy
^eDipartimento di Scienze Biomediche Avanzate, Universita' degli Studi di Napoli Federico II, via Pansini 5, I-80131, Napoli, Italy
^fIstituto Nazionale Fisica Nucleare (INFN), Sezione di Napoli, Napoli, Italy
^gSincrotrone Trieste, Area Science Park, Basovizza, Trieste, Italy

Received 3rd September 2021 , Accepted 18th November 2021

First published on 19th November 2021

Abstract

The electronic energy levels of cyclo(glycine–phenylalanine), cyclo(tryptophan–tyrosine) and cyclo(tryptophan–tryptophan) dipeptides are investigated with a joint experimental and theoretical approach. Experimentally, valence photoelectron spectra in the gas phase are measured using VUV radiation. Theoretically, we first obtain low-energy conformers through an automated conformer–rotamer ensemble sampling scheme based on tight-binding simulations. Then, different first principles computational schemes are considered to simulate the spectra: Hartree–Fock (HF), density functional theory (DFT) within the B3LYP approximation, the quasi-particle GW correction, and the quantum-chemistry CCSD method. Theory allows assignment of the main features of the spectra. A discussion on the role of electronic correlation is provided, by comparing computationally cheaper DFT scheme (and GW) results with the accurate CCSD method.

1 Introduction

Peptides are short chains of amino acids covalently linked by peptide bonds between the carboxylic terminal group of each amino acid and the amino terminal group of the following one. They are said to be cyclic if they contain a circular sequence of bonds. Cyclo-dipeptides (CDPs) or 2,5-diketopiperazines (DKP), derived from the cyclization of a dipeptide, i.e., of a peptide made by two amino acids, are the smallest cyclic peptides: they are widely present in nature and they display a variety of biological and pharmacological activities (such as antibacterial, antiviral, antitumoral, antioxidant).¹ They have higher structural rigidity and enzymatic stability with respect to linear peptides, and they feature multiple hydrogen bonding sites, which can potentially have a role both in self-assembly² and in forming complexes with different molecules, e.g. proteins.³ CDPs and their derivatives have therefore received attention both in drug discovery and as possible building blocks for nanodevices.^3,4

DKPs can be considered as precursors of longer oligopeptides, they have been detected, e.g., in meteorites,⁵ and, as chiral molecules, they can catalyze enantioselective reactions.⁶ They may therefore have had a role in the synthesis of the first biomolecules on Earth, and in the emerging of the so-called “homochirality of life”, i.e. the fact that naturally occurring proteins are composed of L-amino acids.⁵ The formation of cyclo-dipeptides in simulated prebiotic conditions has been studied in ref. 6. The correlation between chirality and self-organization of cyclo-dipeptides has also been investigated in the literature.⁴

Biomolecules are usually studied in solution, often in water, in order to mimic physiological conditions. The presence of the solvent can significantly affect their chemical and physical properties and indeed the analysis of the effects of type of solvent, pH and temperature on e.g. protein stability and conformation is an interesting field of research. On the other hand, in spite of the challenge to evaporate these molecules intact, gas phase characterization opens the way to determining the intrinsic properties of the molecule without environmental interferences. It also allows to investigate fundamental processes like ultra-fast charge migration via pump and probe experiments.⁷ Photoinduced charge transfer through molecules is involved in a variety of mechanisms and phenomena, including charge separation in photosynthetic centers, charge generation in organic solar cells, photoprotection and photodamage.^8–12

Experimental works on cyclo-dipeptides or linear dipeptides have already been reported in the literature, e.g., the study of the correlation between photoelectron spectra and structures in c-PhePhe, c-TyrPro and c-HisGly¹³ or in c-GlyGly, c-PhePro and c-LeuPro,¹⁴ the laser UV photoionization of the linear TrpTyr and TrpTrp dipeptides¹⁵ and the investigation on aromatic cyclo-dipeptides, such as c-TrpTrp, in methanol solution as building blocks for self-assembled biocompatible supramolecular structures with photoluminescence properties.¹⁶ The electron mobility and dissociation of several different short linear peptides in the gas phase via UV and IR multiphoton absorption,¹⁷ as well as the fragmentation of c-AlaAla, a process relevant to the understanding of the early stages of evolution of life,¹⁸ have attracted some interest.

Here a joint experimental and theoretical study of the electronic properties of c-GlyPhe, c-TrpTyr, and c-TrpTrp has been performed. The valence photoemission spectra (PES) in gas phase have been measured using synchrotron radiation and interpreted with the help of ab initio simulations. In our computational protocol, stable geometries of DKP in gas phase are extracted through a large-scale conformational search carried out using tight-binding (TB) simulations. Density functional theory (DFT) calculations are then used to compute the gas phase electronic properties of the most stable conformers. The electronic energy levels and the resulting densities of states (DOS) are compared with the measured PE spectra.

2 Methods

2.1 Theory

2.1.1 Energy minimization and structural relaxation. The GFN2-xTB tight-binding Hamiltonian¹⁹ has been used as the “engine” for the search of minimum energy configurations through an automated conformer–rotamer ensemble sampling tool (CREST) code.^2,19–21 The code provides the coordinates of a series of conformers of the molecule, ordered according to their increasing (/decreasing) energy (/Boltzmann population).

For each of the three dipeptides, a few lowest energy conformers found with this method were then subjected to a geometry optimization run, and, subsequently, to self-consistent electronic structure calculations: both calculations were performed within DFT^22,23 using either the Quantum ESPRESSO (QE)^24,25 or the ORCA²⁶ suites of programs. There is a generally close agreement on the energy ordering of molecular configurations between tight-binding and ab initio calculations. A very detailed investigation on this point has been performed in the case of c-TrpTrp, and is discussed in the ESI.†

In the case of QE, simulations of the same molecules have been performed in the framework of a plane-waves/pseudopotential method. Initial structures selected by the CREST algorithm have been accommodated in large cubic supercells (40 a.u., i.e. ≈21.167 Å, side length) and fully optimized. Then, in view of the more computationally demanding GW calculations, also a face-centered cubic (FCC) cell has been used, yielding the same distance between molecules in replicated cells with a smaller volume and, therefore, a lower number of G-vectors. Total energies have been calculated using norm-conserving Troullier–Martins atomic pseudopotentials,²⁷ a plane-wave basis set, and the B3LYP hybrid exchange–correlation functional,^28,29 (and also the M062X functional³⁰ in some cases) including the D3 pairwise dispersion correction for van der Waals (vdW) interactions.³¹ Satisfactorily converged results have been achieved by using cutoffs of 90 Ry on the plane waves and of 360 Ry on the electronic density, respectively. H 1s² (2 electrons), C 2s²-2p² (4 electrons), N 2s²-2p³ (5 electrons), and O 2s²-2p⁴ (6 electrons) electrons have been treated as valence electrons. All of the inner shell electrons are embedded in the pseudopotentials. The Makov–Payne correction to the total energy was also computed and an estimate of the vacuum level was calculated, which allowed us to properly align electronic energy levels.³² For geometry relaxation runs the Broyden–Fletcher–Goldfarb–Shanno (BFGS) scheme was used.³³

In the case of ORCA, DFT simulations have been performed in an all-electron localized-basis-set framework.^26,34 Kohn–Sham orbitals have been expanded on large, Gaussian-type def2-QZVPP basis sets.^35,36 The corresponding def2/J basis has been also used as an auxiliary basis set for Coulomb fitting in a resolution-of-identity/chain-of-spheres (RIJCOSX) framework. Molecular geometries have been fully re-optimized and their properties investigated by using the same B3LYP functional used in the case of QE, including the D3 pairwise dispersion correction.

2.1.2 Energy levels. We then analyzed the electronic levels obtained from the B3LYP simulations, with the aim of describing the photoemission measurements. Even if DFT is in principle suitable for reproducing charged excitations using orbital-dependent Koopman's compliant functionals,³⁷ the use of the B3LYP eigenvalues and wave-functions to model photoemission is formally not justified. In particular for empty orbitals, which are not even affected by the DFT constraint of reproducing the exact density. On the other hand, such an approach can be generally justified a posteriori by the close similarity often obtained between B3LYP energies and photoemission measurements, as well as by comparison with accurate yet much more expensive calculations. We proceed therefore to a detailed comparison with other approaches, using GlyPhe in particular as a test case. For GlyPhe only, we compare the energy levels calculated at the B3LYP level against Hartree–Fock (HF) ones, also obtained via a self consistent field (SCF) calculation done with QE. Moreover, for the first ionization energy (IE) and the electron affinity (EA) we compute total energy differences using the B3LYP functional and also the M062X functional. Validation of a DFT-based approach is a prerequisite to use such results as starting points for the subsequent application of many body perturbation theory (MBPT) methods; B3LYP eigenvalues are then used as a starting point for the quasiparticle (QP) corrections calculated within the GW approximation with the Yambo code.³⁸ The HF solution is instead the starting point for an additional calculation with the Equation of Motion (EOM) Coupled Cluster Single and Double excitations method (CCSD) with the ORCA code. Both EOM-CCSD and GW give a formal description of photoemission. CCSD directly gives an approximation to the photoemission [and inverse photoemission] energies (E^N₀ − E^N−1_i) [and (E^N+1_i − E^N₀)] and wave-functions including dynamical correlations on top of HF. Here E^N₀ is the total energy of the ground state in presence of N electrons, while E^N+1_i and E^N−1_i are the total energies of excited states with N ± 1 electrons. GW describes the same energies within the quasi-particle approximation ε_i ≈ (E^N₀ − E^N−1_i) for ε_i < μ [and ε_i ≈ (E^N+1_i − E^N₀) for ε_i > μ], with μ the chemical potential, and assuming that the Kohn–Sham (KS) wave-functions are a good approximation to QP wave-functions.^39,40 Both EOM-CCSD and GW calculations give a renormalization of the peak intensity due to coupling with either independent-particles (IP) transitions, in the CCSD case, or correlated Random Phase Approximation (RPA) transitions or plasmons, in the GW case. EOM-CCSD simulations are performed for the lowest energy conformer of each dipeptide, while GW calculations only for the lowest energy conformer of GlyPhe, for which we also directly compute (E^N₀ − E^N−1₀) and (E^N+1₀ − E^N₀), both using the B3LYP and the M062X functional, performing in each of the two cases three different total energy simulations changing the number of electrons in the system. For the GW scheme we compute the RPA screening within the Plasmon Pole Approximation (PPA), with 400 bands and an energy cutoff of 2 Ha. The GW self-energy is then constructed using 60 bands in the GW summation.

Reference values for molecular ionization energies have been obtained using ORCA, in the same Gaussian-type orbitals (GTO) framework discussed above but through a different wavefunction approach. As introduced above, the simulations are based on the equation of motion – coupled cluster theory (EOM-CC), which extends a primarily ground-state method like the single-reference coupled-cluster one to the calculation of excited/ionized states as well as to the corresponding excitation/ionization energies.⁴¹ Within the ORCA implementation, core and valence ionization energies, as well as electron attachment energies for the mapping of unoccupied electronic states, are calculated by grouping electron pairs in domain-local pairs of natural orbitals (DLPNO), and the chain-of-sphere approximation is used to speed up the calculation of exchange-like integrals with four virtual labels (COSX).^42,43 The present implementation makes the treatment of large systems (2196 basis functions in the case of the largest c-TrpTrp) possible, with a loss of accuracy lower than 0.01 eV with respect to canonical algorithms.

2.2 Experimental

The measurements of the photoelectron spectra have been performed at the CiPo beamline⁴⁴ of the Elettra synchrotron facility in Trieste, Italy. The radiation from the electromagnetic elliptical wiggler insertion device⁴⁵ in Section 4.2 of the storage ring is monochromatized by a variable angle spherical grating monochromator⁴⁶ and focused by a toroidal mirror to the interaction region of the end station equipped by a VG-220i hemispherical electron energy analyser.⁴⁷ The hemispherical electron energy analyser is mounted in the plane perpendicular to the direction of propagation of the photon beam, at the magic angle with respect to the electric vector, to avoid that the intensities of the different features in the photoelectron spectrum are affected by the asymmetry parameters of the ionized states. The hemispherical analyser is equipped with a 2D position sensitive detector⁴⁸ which spans a kinetic energy range of about 10% of the analyser pass energy and is characterized by an energy resolution of about 2% of the pass energy. In the present experiment a photon energy of 60 eV and an analyser pass energy of 30 eV have been used, resulting in an overall energy resolution of about 600 meV. The electrons kinetic energy scale and energy resolution have been calibrated against the He 1s peak and confirmed by the photoemission peak of water molecules present as background gas in the vacuum chamber or desorbed from the samples. A time-of-flight mass spectrometer, mounted opposite to the hemispherical analyser, has been used to monitor the thermal stability of the sample at regular intervals during the measurements. The experimental chamber is maintained at a background pressure of low 10⁻⁸ mbar, reaching a few 10⁻⁷ mbar in operating conditions. The target molecules, which are in the form of a powder at standard ambient temperature and pressure, are all commercially available. The c-TrpTrp and c-TrpTyr species were purchased from BACHEM while the c-GlyPhe molecule from Sigma-Aldrich. All the species have purity ≥98% and have been used without further purification. In all cases, an amount of sample of about 30 mg was inserted in a crucible under vacuum and sublimated at a temperature of 95, 155 and 147 °C for c-GlyPhe, c-TrpTyr and c-TrpTrp, respectively. The photoelectron spectra have been obtained by adding several scans over the binding energy region 7–30 eV with an energy step of 120 meV and acquisition time of 4 s per point.

3 Results

The experimental photoemission spectra of the c-GlyPhe, c-TrpTyr, and c-TrpTrp are shown in Fig. 1. The structure of the three cyclo-dipeptides is made by the central DKP ring and different side chains depending on the constituent amino acids, which contain the phenyl, phenol and indole chromophore groups in Phe, Tyr and Trp respectively. Thus the three spectra share some general characteristics, namely a first feature in the 8–10 eV energy region, followed after a gap of a couple of eV by broader features which display a maximum intensity at around 14 eV. In the next sections the ab initio simulations for each cyclo-dipeptide will be discussed in detail.


	Fig. 1 Experimental photoemission spectra of the c-GlyPhe, c-TrpTyr, and c-TrpTrp dipeptides. The feature at −12.6 eV in the c-TrpTrp spectrum is due to residual water in the sample.

All amino acids except glycine (Gly) have a chiral center: each chiral molecule can exist in two chemically indistinguishable forms, enantiomers, labeled “S” and “R”, which can only be identified through their different interaction with circularly polarized electromagnetic radiation, yielding opposite circular dichroism spectra. Out of the possible enantiomers or diastereoisomers resulting from the combinations of the “S” or “R” forms of the two constituent amino acids, we have performed our calculations – for each of the three investigated dipeptides (c-GlyPhe, c-TrpTyr, c-TrpTrp) – on the form on which experiments were performed, i.e. the “S” enantiomer for c-GlyPhe, and the “S,S” diastereoisomer for both c-TrpTyr and c-TrpTrp.

3.1 Cyclo(GlyPhe)

In Fig. 2 we show the three lowest energy conformers of the cyclo(GlyPhe) peptide as obtained by the tight-binding conformational search using the CREST code. According to this method, the first conformer is much more populated than the following two conformers (second column of Table 1). Subsequent geometry relaxation within DFT B3LYP, both using the ORCA localized basis all-electron code (third column of Table 1) and the QE plane-wave pseudopotential code, confirms the energy ordering of these conformers, and it does not substantially alter their geometry.


	Fig. 2 Optimized geometries of the three lowest energy conformers of the c-GlyPhe dipeptide, i.e. conformer 1 (a), conformer 2 (b) and conformer 3 (c), each shown in two different orientations. Carbon atoms are depicted in black, oxygen atoms in red, nitrogen atoms in light blue and hydrogen atoms in white.

Table 1 Tight-binding energies and populations, and ORCA B3LYP energies of the three lowest energy conformers of GlyPhe

	TB ΔE (mHa)	TB pop. (%)	ORCA ΔE (mHa)
	TB ΔE (mHa)	TB pop. (%)	B3LYP + vdW
conf1	0.00	84.3	0.00
conf2	2.40	6.6	3.47
conf3	2.87	4.0	3.92

We use cyclo(GlyPhe), which is the smallest of the three peptides under study, to compare the electronic properties with different levels of approximation for the exchange–correlation (xc) energy: HF, B3LYP and M062X, with the EOM-CCSD method, and with the GW approximation. This was done in order to investigate the sensitivity of these properties to the chosen method and xc functional, and to find the method and/or functional which yields the best trade off between agreement with experimental data and computational cost. In particular, we focused on the comparison between measured and theoretically estimated – with different approaches - PES.

We inspect the electronic densities of states (DOS) for the GlyPhe conformer 1 (see Fig. 3, panel (a)) resulting from DFT B3LYP (magenta curve in the figure) and HF (yellow curve) occupied electronic levels, and we compare the results with those obtained by calculating ionization energies using the EOM-CCSD method (blue) and the GW scheme (green curve). The latter are used as theoretical reference, and compared with the gas-phase experimental photoemission spectrum (black) of GlyPhe. In the EOM-CCSD DOS reported here each energy root has been weighed with its percentage contribution from single transitions, while in the GW simulation the quasi-particle poles ε_n are weighed by the renormalization factor Z_n.


	Fig. 3 Panel (a): GlyPhe conformer 1. Hartree–Fock (yellow) and B3LYP (magenta) DOS for the occupied electronic levels, compared with the GW (green), and CCSD (blue) PE spectral function and with experimental PE spectrum (black). Inset: QE Hartree–Fock (yellow), ORCA Hartree–Fock (red), QE B3LYP (magenta), ORCA B3LYP (brown) unoccupied electronic energy levels compared with GW (green) and CCSD (blue) inverse PE energies. GW corrections have been computed for the five bound QE B3LYP empty levels only. The vacuum level is set at 0 energy. In the HF, B3LYP and GW cases occupied levels are shifted by +2 eV, −2.5 eV, −1.47 eV respectively, with empty levels unshifted. Panel (b): DFT B3LYP DOS of GlyPhe conformer 1 (magenta) and conformer 2 (dark red), both shifted by −2.5 eV, compared to experimental photoemission spectrum (black); inset: DFT B3LYP occupied electronic energy levels of GlyPhe conformer 1 (magenta) and conformer 2 (dark red): the vacuum level is set at 0 energy. The vertical axis in the two insets (as the horizontal one in the two large panels) reports energies in eV. Curves for calculated DOS and PE spectral functions are obtained from the corresponding electronic energy levels by applying a Lorentzian broadening of 0.5 eV.

We can observe that the B3LYP calculation provides a better description of the energy distribution of occupied electronic states with respect to the HF one, provided that a rigid shift of −2.5 eV is applied‡ to the B3LYP levels in order to align the B3LYP HOMO level with the EOM-CCSD at −9.25 eV and thus to the experimental IE. We also report that the B3LYP IE obtained via total energy difference is E^N−1₀ − E^N₀ = 8.57 eV, while the corresponding value obtained using the meta-GGA global hybrid M062X functional is 9.25 eV, thus matching the CCSD value. HF tends to overbind all main features of the PE spectrum by ≈−2 eV. This is why a rigid shift in the opposite direction is applied to the plot of the occupied HF levels. However, apart from this rigid shift, the agreement between the distribution of HF levels and the PE spectrum is not good, and HF tends to overestimate the energy splitting of the levels close to the HOMO of ≈+2 eV. As a consequence, the unshifted ε^HF_HOMO = 9.29 eV is very close to the CCSD result, but this is the result of two errors with opposite sign. For better clarity the unshifted HOMO/IE are reported in Table 2 and later discussed.

The EOM-CCSD method, taking into account dynamical electronic correlation, is able to yield a PE spectral function (blue curve) in good agreement with the experimental PE spectrum, despite starting from the Hartree–Fock calculation which by itself badly describes the energy distribution of electronic levels. The EOM-CCSD PE energies and the B3LYP DOS are in a surprising and satisfactory agreement with each other in the analyzed energy range, although the CCSD scheme is numerically much more demanding. The GW corrections (green curve) on top of the B3LYP energies lead to a final result which is even closer to the EOM-CCSD one, in particular in the width of the first DOS feature (the one at ≈−10 eV) and in the detailed shape of the various peaks/features in the −11 eV to −19 eV range. GW also partially cancels the underestimation of the HOMO binding energy. A residual underestimation of −1.47 eV remains. The GW levels in Fig. 3 are then shifted by such value. Finally GW and EOM-CCSD also agree for the renormalization of the PE peaks intensity, determined by the percentage contribution from single transitions in EOM-CCSD and by the renormalization factor Z_n in GW. In the EOM-CCSD approach, contributions from double transitions are negligible up to ≈−19 eV (an energy close to the double ionization threshold⁴⁹), and the mixing between single and double transitions is very small in the whole analyzed energy range, (i.e., up to ≈−25 eV). With the exception of two energy roots, with percentage contributions from single transitions of 80%, ≈10% respectively, the contribution from single transitions to a given energy root is either larger than 90%, or below 0.5%. Similarly the renormalization factors Z_n of the GW calculation are all above 0.75 up to ≈−23 eV.

In panel (b) of Fig. 3 we compare DFT B3LYP occupied energy levels and DOS of the two lowest energy conformers of the GlyPhe peptide. Although the geometries of these two conformers are quite different (Fig. 2), the energy positions of their electronic levels (inset of Fig. 3(b)) display minor mutual differences only. Once a Lorentzian broadening is applied to these electronic levels in order to obtain the densities of states of the two conformers, these latter appear hardly distinguishable from each other, being both in good agreement with the features in the PE spectrum.

We now focus on empty electronic levels. The inset of Fig. 3, panel (a), shows the position of unoccupied electronic energy levels of GlyPhe conformer 1 obtained within Hartree–Fock (HF) and B3LYP, compared with the inverse PE energies obtained within the GW method on top of B3LYP and within the EOM-CCSD method. For the GW method we only consider the correction to the QE B3LYP bound empty states. No energy shift is applied to the empty states besides the vacuum level correction computed within HF (0.18 eV on the cubic cell) and B3LYP (0.18 eV on the cubic cell; 0.25 eV on the FCC cell). This is somehow equivalent to assume that the shifts applied to the occupied states correspond to an opening of the band gap, i.e. the energy difference between the IE and minus the EA. EOM-CCSD gives a negative EA, i.e. the first root has positive energy of 1.15 eV (fifth column in Table 2). In contrast B3LYP gives ε^KS_n < 0 for n = LUMO and 5 bound empty states in total. The B3LYP EA obtained via total energy difference is instead negative E^N₀ − E^N+1₀ = −0.81 eV. At the HF level the first unoccupied state localized on the molecule, obtained from the simulation with localized basis set, is at ε^HF_LUMO ≈ 2.3 eV. We refer to such state as a resonance; in the ESI† we show the spatial localization of this state. In the figure inset, HF resonances from ORCA (with localized basis set) are highlighted in red in the dense list of states obtained with QE using the plane-waves basis set, which also captures vacuum states. In a similar way, B3LYP empty levels obtained with ORCA are highlighted in brown in the list of B3LYP empty states obtained with QE. In the B3LYP case we obtain respectively five (two) bound empty states with QE (ORCA), but without a significant variation in the energy of the first unoccupied state (−0.75 eV with QE B3LYP, −0.70 eV with ORCA B3LYP). The GW correction to the bound B3LYP empty levels also gives a negative EA with ε^GW_LUMO = 1.21 eV, quite close to the EOM-CCSD result. GW corrections to the delocalized vacuum states are not considered. Similarly to the occupied levels case, in the EOM-CCSD approach, the contribution from single transitions is larger than 90% for all the inverse PE energy roots, and in GW the renormalization factors Z_n are larger than 0.9 for all the computed empty levels.

Table 2 Ionization energy (IE) and the opposite of the electron affinity (−EA) of GlyPhe conformer 1 with different methods. Values are in eV. The experimental value (column 6) is a vertical IE and it corresponds to the maximum of the first feature in the photoelectron spectrum. Also the IE and EA values calculated via DFT, with either the B3LYP (column 2) or the M062X (column 3) exchange–correlation functional, both with the def2-QZVPP basis set and the D3 vdW correction, are vertical values. In the ESI we report further IE and EA values for the three investigated cyclo-dipeptides, calculated via total energy differences, including a comparison between vertical and adiabatic values

	HF eigen.	B3LYP Etot	M062X Etot	GW eigen.	CCSD roots	Exp.
IE	9.29	8.57	9.25	7.94	9.25	9.54 ± 0.02
-EA	2.30	0.81	0.91	1.21	1.15
gap	11.59	9.38	10.16	9.15	10.4

We underline that in the positive energy range it is not possible to compare the energy levels distribution between the different approaches due to the presence of delocalized states from the continuum. Continuum states are captured when using the plane-waves basis set. On the other hand the approach based on localized basis set only captures resonances, although missing their delocalized contribution (resonances have positive energy, as a consequence they must be in part delocalized) and thus possibly introducing an error in the resulting energy. A good description of resonances using a plane-waves basis set would require very large boxes or possibly some lifetime to accelerate convergence. Still the comparison between results with the two basis sets is interesting to be investigated also with the finite boxes we use (see ESI†).

The overall picture is that GlyPhe has a negative EA of 1.15 eV, a IE of 9.25 eV and a gap, or difference between IE and EA, of 10.4 eV. In Table 2 we report the IE and the EA from the different approaches, assuming that HF and B3LYP eigenvalues for the HOMO and the LUMO are a zero order approximation. B3LYP eigenvalues, not shown in the table, underestimate both the IE (6.91 eV) and EA (0.75 eV), and as a consequence the gap, while HF does the opposite.⁵⁰ B3LYP and M062X total energies and GW eigenvalues still underestimate the gap and the IE and EA, but the error is much smaller. In particular, the meta-GGA global hybrid M062X functional yields vertical IE and EA values in better agreement than B3LYP with those obtained with the CCSD method, and therefore a IE value which is closer to the experimental one (also vertical). The different results are mostly affected by how the electron exchange or Fock term enter in the description. HF has the full exchange term which opens too much the electronic gap, while B3LYP has only a fraction of the exchange which is not enough. Both GW and CCSD add extra terms which lead to a screening of the exchange (in GW via the RPA screening and in CCSD via the inclusion of terms which mimic the IP screening) leading to the correct inclusion of this effect. The dynamical corrections included both within GW and CCSD do not play a significant role since all poles have dominant single character (/are well described within the QP approximation). However CCSD also includes extra terms, neglected in GW, that give a sizable effect. These extra terms are known to be important in small molecules, but their relevance usually decreases while increasing the system size (up to the limit of extended systems where GW gives a very good physical description). Finally, while the use of screened exchange and extra terms is important for a correct electronic gap, the energy levels distribution is much less sensitive as already discussed.

In order to shed light on similarities between the EOM-CCSD on top of HF methods and the GW on top of B3LYP scheme, we also analyzed the spatial localization of some B3LYP occupied and empty electronic states in the energy region of the frontier orbitals, encompassing the gap, as shown in Fig. 4 for the GlyPhe conformer 1. Each of the EOM-CCSD ionization energies can be interpreted as a linear combination of ionizations of the subsiding Hartree–Fock levels. We thus compare the spatial localization of the HOMO B3LYP bound electronic states with that of the weighed superposition of the HF states mostly contributing to the EOM-CCSD ionization energies.


	Fig. 4 DFT B3LYP orbitals of GlyPhe conformer 1.

As shown in Fig. 5, the agreement is very good with the first EOM-CCSD ionization energy mixing HOMO and HOMO−2 HF orbitals to lead to a final wave-function which is very similar to the B3LYP HOMO. We did the same analysis considering also the following 2 EOM-CCSD PE energies which we label here IE-1 and IE-2 (ESI†). IE-1 mixes HOMO, HOMO−1 and HOMO−2 HF orbitals leading to a wave-function very close to the B3LYP HOMO−2 state. IE-2 mixes HOMO−5, HOMO−2, HOMO−1 and HOMO HF orbitals, leading to a wave-function similar (but with some differences) to the B3LYP HOMO−5 state. In conclusion, from the analysis of both electronic densities of states and spatial localization of molecular orbitals, we can thus conclude that DFT B3LYP calculations provide a reasonably good description of the electronic properties, if compared to the EOM-CCSD approach, but at a much lower computational cost. This is why we take the liberty of using the wording “molecular orbitals” also for the B3LYP ones.


	Fig. 5 GlyPhe conformer 1: Hartree–Fock orbitals (left) contributing to the CCSD ionization energy (center), and the B3LYP HOMO (right).

3.2 Cyclo(TrpTyr)

We now turn our attention to the TrpTyr molecule. As opposed to GlyPhe, TrpTyr has two chiral centers, one on each of the two constituent amino acids. Therefore, in addition to chemically indistinguishable enantiomers of the TrpTyr dipeptide, i.e. pairs of molecule isomers each of which is the mirror image of the other (such as SS vs. RR), we can also have pairs of chemically distinguishable diastereoisomers, such as SS vs. SR, where e.g. “SR” indicates a TrpTyr molecule with Trp in its “S” enantiomer and Tyr in its “R” one. For our calculations we chose the “SS” diastereoisomer, the one used in the photoemission measurements.

In Fig. 6 we show the five lowest energy conformers of the “S,S” diastereoisomer of the cyclo(TrpTyr) peptide as obtained by a tight-binding conformational search using the CREST code. Conformers are labeled (1, 2,…) according to their tight-binding energy ordering (column 1 of Table 3). For TrpTyr the energy ordering of the conformers obtained through the tight-binding run changes upon performing a DFT B3LYP geometry relaxation, with either the ORCA or the QE code, although their geometries remain substantially unaltered. In particular, the energy ordering obtained after performing ORCA B3LYP relaxations (column 5 of Table 3) is the following (using conformer labels based on their tight-binding energy ordering): 1, 5, 4, 2, 3. Thus conformer 1 remains the lowest energy geometry, while conformers 5 and 4, in which (as in conformer 1) the side-chain rings of the tyrosine and tryptophan amino acids are closer to each other, become energetically favored with respect to conformers 2 and 3. Based on these results, we decided to focus on conformers 1, 5 and 4 for further calculations on electronic properties.


	Fig. 6 Geometries of the five lowest energy conformers of the cyclo(TrpTyr) peptide. Color codes as in Fig. 2.

Table 3 Tight-binding energy differences and populations, and ORCA B3LYP energies of the five lowest energy conformers of TrpTyr

	TB ΔE (mHa)	TB pop. (%)	ORCA ΔE (mHa)
	TB ΔE (mHa)	TB pop. (%)	B3LYP	vdW	Sum
conf1	0.00	30.4	0.00	0.00	0.00
conf2	0.38	20.3	0.32	2.52	2.84
conf3	0.57	16.6	0.59	2.57	3.16
conf4	1.14	0.09	0.26	1.79	2.06
conf5	1.49	0.06	1.49	−0.49	1.00

Intramolecular weak interactions such as the van der Waals forces can have an important role in molecule stability, and the chosen method for their computational treatment can alter the energy ordering of molecule conformers. π–π and CH–π interactions can contribute to stabilizing specific conformers, in molecules containing aromatic rings. Indeed, out of the five low-energy geometries obtained through the tight-binding conformational search, conformers 1, 5 and 4 appear to be stabilized to some extent by π–π interactions between the phenol ring of tyrosine and the indole ring of tryptophan, while the geometries of conformers 2 and 3 suggest a possible CH–π interaction between the C_β group of tyrosine and the indole ring of tryptophan. Interestingly, the dispersive vdW contribution (column 4) to the computed total energy of c-TrpTyr has an important role in determining the DFT B3LYP energy ordering of its conformers (see column 5 of Table 3). We recall that a similar vdW correction is present in the TB simulation. More in detail: the ORCA B3LYP calculation (third column in the table) tends to energetically favor conformer 4 with respect to the energy ordering obtained from the tight-binding conformational search (first column) as long as we do not consider the vdW contribution to the DFT total energy. The vdW contribution to the DFT energy (fourth column) is negative for conformer 5 only – and zero or positive for the other conformers – thus strongly favoring conformer 5. As a final result, the energy ordering of conformers according to the total B3LYP energy including the vdW contribution is, as already mentioned, 1, 5, 4, 2, 3. The same final energy ordering is also obtained using the D4 vdW dispersion correction^51,52 in the ORCA B3LYP calculation, instead of the D3 discussed so far: the only remarkable difference is that the ORCA calculation with the D4 dispersion correction yields a positive vdW contribution for conformer 5; this is not sufficient to change the energy ordering of the conformers, but it brings the energy of conformer 5 very close to that of conformer 4 (differing by 1 meV only).

In Fig. 7, panel (a), we show the electronic densities of occupied states for TrpTyr conformer 1 (i.e. the most populated one according to both tight-binding conformational search and B3LYP DFT) obtained either from DFT B3LYP (magenta curve), or from a valence photoemission spectrum (blue) obtained through a EOM-CCSD calculation. Also for this cyclo-dipeptide, as already observed for GlyPhe, the DFT B3LYP and EOM-CCSD densities of states are in good agreement with each other and with the experimental photoemission spectrum (black curve), once a −2.5 eV energy shift is applied to the B3LYP DOS. Also here (as in the GlyPhe case) the EOM-CCSD energy roots have been weighed with their percentage contribution from single transitions. For TrpTyr this percentage is higher than 90 in all the analyzed energy range, which is indeed shorter than the one considered for GlyPhe (since considering the same number of energy roots for a larger molecule).


	Fig. 7 Panel (a): DOS of TrpTyr conformer 1 obtained with the CCSD method (blue), and with DFT B3LYP (magenta), compared to experimental photoemission spectrum of TrpTyr (black): the B3LYP DOS is shifted by by −2.5 eV. Panel (b): DFT B3LYP DOS of TrpTyr conformer 1 (magenta), conformer 5 (dark red) and conformer 4 (green), all shifted by −2.5 eV, compared to experimental photoemission spectrum (black). The horizontal axes of both panels report energies in eV. Curves for calculated DOS and PE spectral functions are obtained from the corresponding electronic energy levels by applying a Lorentzian broadening of 0.5 eV.

As for the sensitivity of the DFT B3LYP densities of states to molecule conformation, panel (b) of the same figure shows the DFT B3LYP DOS for the three lowest energy conformers of TrpTyr according to DFT B3LYP, i.e. conformers 1, 5 and 4 according to tight-binding labeling. Also for TrpTyr we can only observe minor differences among the densities of states of different conformers, such as a shoulder at ≈8 eV ionization energy, which is most pronounced in conformer 5, less in conformer 1, and almost absent in conformer 4. As such, also for TrpTyr the densities of states of all the three analyzed conformers are in good agreement with the experimentally measured photoemission spectrum of the molecule (black curve in the figure).

The spatial localization of electronic states of TrpTyr lowest energy conformer (conformer 1) is shown in Fig. 8. Frontier orbitals only are considered, calculated within DFT B3LYP with either the plane-wave pseudopotential code QE or the localized basis all-electron code ORCA. As expected, the two codes give almost identical results, apart from a switch between HOMO−2 and HOMO−1 which are very close in energy (Table 4).


	Fig. 8 QE (upper panel) and ORCA (lower panel) B3LYP orbitals of TrpTyr conformer 1.

Table 4 Energies of the highest occupied and of the lowest unoccupied QE B3LYP and ORCA B3LYP electronic levels for conformer 1 of TrpTyr. Occupied levels are shifted by −2.5 eV

	QE (eV)	ORCA (eV)
LUMO+3(*)	−0.3325	+0.2208
LUMO+2(*)	−0.4689	+0.0528
LUMO+1	−0.6307	−0.3518
LUMO	−0.7573	−0.5886
HOMO	−8.2152	−8.0903
HOMO−1	−8.7344	−8.5768
HOMO−2	−8.7602	−8.6369

From Table 4 we can also observe that the ORCA B3LYP calculation yields two bound empty states only (column 2); on the other hand, in the QE B3LYP calculation we find seven bound empty states, i.e. three further states in addition to the ones appearing in column 1 of the table. As already observed for the c-GlyPhe dipeptide, in the ORCA B3LYP calculation with localized bases empty levels are generally less “dense” in energy with respect to those obtained from the plane-waves QE B3LYP calculation, while the energy distribution of occupied states is very similar with the two approaches. This trend can be rationalized by recalling that DFT calculations with plane-waves basis sets are able to also capture continuum(-like) delocalized empty electronic states, as opposed to localized basis codes, which only capture resonances, possibly introducing an error in their energies.

3.3 Cyclo(TrpTrp)

The TrpTrp dipeptide has two chiral centers (one for each of the two constituent amino acids) and therefore, as already mentioned for TrpTyr, the four combinations of enantiomers of the single amino acids “S,S”, “R,R”, “S,R”, “R,S” are possible. Being TrpTrp composed of two identical molecules (at a difference with the other two analyzed dipeptides), the “S,R” and “R,S” isomers may in principle be achiral due to symmetry reasons, however their symmetry and therefore chirality properties will in general depend on the conformation. In our calculations we considered the “SS” diastereoisomer, i.e the one used in the photoemission measurements.

Also for the “S,S” form of TrpTrp, as already found for TrpTyr, the energy ordering of conformers as obtained through the initial TB conformational search is not maintained in QE DFT B3LYP (see columns 1 and 3 of Table 5). In this case not even the lowest energy conformer is conserved: labeling conformers according to their TB energy order, within DFT B3LYP with the QE code the lowest energy geometry is conformer 3, followed by conformers 2, 1, 4 and 5 (their geometries are shown in Fig. 9). Also for this dipeptide, some of the five TB lowest energy conformers, i.e. conformers 2, 1 and 5, appear likely to be stabilized to some extent by π–π interactions between the aromatic rings of the two constituent amino acids – here two tryptophan indoles – and others (conformers 3 and 4) suggest CH–π interactions between the C_β group of one tryptophan and the indole ring of the other.

Table 5 Tight-binding energies and populations, and QE B3LYP energies of the five lowest energy conformers of TrpTrp

	TB ΔE (mHa)	TB pop. (%)	QE ΔE (mHa)
	TB ΔE (mHa)	TB pop. (%)	B3LYP + vdW
conf1	0.00	35.1	0.00
conf2	0.52	20.3	−0.15
conf3	1.21	19.5	−0.41
conf4	1.51	14.3	1.74
conf5	2.53	4.8	5.00


	Fig. 9 Geometries of the five lowest energy conformers of the cyclo(TrpTrp) peptide. Color codes as in Fig. 1.

A more detailed analysis (see ESI†), exploring the dependence of the DFT total energy ordering of the five lowest energy conformers from the tight binding conformational search on the computational details of the DFT calculation, such as cell size, localized bases (ORCA) vs. plane waves (QE), type of vdW treatment, exchange–correlation functional, pseudopotentials, has shown that the energy ordering among conformers 1, 2 and 3 can change in some cases, but these three conformers are always lower in energy than conformers 4 and 5. Moreover, the energy differences among conformers 1, 2 and 3 are in most cases within the value of k_BT at room temperature, and lower than those between these three conformers and the other two (conformers 4 and 5). This supports our choice to analyze conformers 1, 2 and 3 as lowest energy/most populated conformers of this peptide.

The agreement between the electronic densities of states obtained either from DFT B3LYP energy levels (green curve in panel (a) of Fig. 10) or from a valence photoemission spectrum (blue curve) obtained through a EOM-CCSD calculation is quite good, and both curves (obtained for the lowest energy conformer according to DFT B3LYP) reproduce reasonably well the experimental photoemission spectrum (black) of TrpTrp in the analyzed energy range. The EOM-CCSD energy roots have been weighed with the percentage contribution from single transitions, which is also in this case (as in TrpTyr) higher than 90 for all the obtained ionization energies.


	Fig. 10 Panel (a): DOS of TrpTrp conformer 3 obtained with the EOM-CCSD method (blue), and with DFT B3LYP (green), compared to the experimental photoemission spectrum of TrpTrp (black): the B3LYP DOS is shifted by −2.5 eV. Panel (b): DFT B3LYP DOS of TrpTrp conformer 1 (red), conformer 2 (blue) and conformer 3 (green), all shifted by −2.5 eV, compared to experimental photoemission spectrum (black). The feature at −12.6 eV in the experimental spectrum is due to residual water in the sample. The horizontal axes of both panels report energies in eV. Curves for calculated DOS and PE spectral functions are obtained from the corresponding electronic energy levels by applying a Lorentzian broadening of 0.5 eV.

A comparison between DFT B3LYP densities of states of the three lowest energy conformers of the TrpTrp dipeptide (panel (b) of Fig. 10) shows a negligible conformational sensitivity of the DOS also for this molecule: the three DOS curves are almost indistinguishable, despite the rather different geometries of the analyzed conformers (Fig. 9). Also in this case, the calculated density of states alone would not be sufficient in order to draw any conclusions on the presence of the different possible conformers in the experimental sample.

In Fig. 11 we report the spatial localization of QE DFT B3LYP highest occupied and lowest unoccupied electronic states of the TrpTrp lowest energy conformer (conformer 3). Interestingly, molecular orbitals ranging from HOMO−3 to LUMO+1 are alternately localized on the indole ring of either of the two tryptophan amino acids. This is likely due to structural differences responsible for a different chemical environment of the two otherwise identical indole groups. Molecular orbitals localized on the diketopiperazine ring resulting from peptide cyclization appear at lower energies (HOMO−4 to HOMO−7 levels).


	Fig. 11 DFT B3LYP orbitals of TrpTrp conformer 3.

4 Discussion

The measured photoemission spectra of the three cyclo-dipeptides under study (see Fig. 1 and black curves in Fig. 3, 7 and 10) share some general characteristics, namely a first feature in the 8–10 eV energy region, followed after a gap of about 2 eV by a series of broader bands. In the first feature two structures are clearly visible in the spectra of TrpTrp and TrpTyr, while they merge in the GlyPhe spectrum. The energy of the first feature in the experimental photoelectron spectra increases from 7.97 ± 0.04 eV of TrpTrp to 8.18 ± 0.04 eV in TrpTyr and then 9.54 ± 0.02 eV in GlyPhe. The trend is the same of the ionization potential of the three aromatic aminoacids Trp, Tyr and Phe, confirming the role of the aromatic chromophore, i.e. the side chain in the aromatic amino acids building the cyclo-dipeptide, in determining the chemical physics properties of the peptide. This is consistent with previous observations in the photoelectron spectra of other cyclo-dipeptides containing amino acids with an aromatic side chain.¹³ On the other hand, the ionization potentials in the cyclo-dipeptides are higher than those of the constituent aromatic amino acids. This can be considered as a sign of the stability effect induced by the orbitals of the DKP ring on the ones of the aromatic ring of the side chain.

Indeed, the IE is given by the difference between E^N₀ (the ground state total energy with N electrons) and E^N−1₀ (the ionized state with N − 1 electrons), so that, when comparing the IE values of different molecules, the observed trends will depend non-trivially on the combined effect of variations in E^N₀ and in E^N−1₀. In particular, the trend observed for the IE values of the three investigated cyclo-dipeptides c-GlyPhe, c-TrpTyr, c-TrpTrp, i.e. lower IEs for larger aromatic system, can be rationalized by considering that a larger aromatic group will make the ionized molecule more stable (lowering of E^N−1₀) due to charge delocalization. When comparing the IEs of these cyclo-dipeptides with those of the corresponding single amino acids, instead, the above-mentioned stabilizing effect of the DKP ring on the neutral molecule (lowering of E^N₀) can explain the higher values of IE found for cyclo-dipeptides with respect to single amino acids.

Similar trends are predicted by B3LYP and CCSD calculated densities of electronic states (colored curves in Fig. 3, 7 and 10). From the numerical simulations we can analyze the orbitals which determine the ionization potential and contribute to the first peak in the DOS, in the energy range between 8–10 eV. To this end we plot in Fig. 12, for the case of the TrpTyr, the electronic density obtained from the orbitals in the selected energy range. Similar informations can be obtained for GlyPhe and TrpTrp, by looking at the orbitals in Fig. 4 and 11. From the plots we clearly see that the ionization potential is determined by “pz-like” orbitals in the aromatic rings and that these orbitals provide a large contribution to the whole first feature (consisting of two substructures, which merge for GlyPhe only) in the DOS. The second substructure in this first DOS feature shows contributions also from the orbitals on the DKP ring (highlighted in blue in Fig. 12). In the case of phenylalanine, the energy of the ionization of the phenyl ring approaches the one of the DKP ring,¹⁴ and the DOS shows a set of six states very close to each other. As a consequence the highest occupied orbitals of the phenyl side chain are mixed with the DKP ones, and produce the single broader feature observed in the experiment as well as in the calculated DOS for GlyPhe. After these frontier orbitals, theory predicts a region void of states consistently with the gap observed in the experimental spectra. Then a region with a high density of states (29 in the case of TrpTyr as shown in Fig. 12) follows. These electronic states lie in the planes of the aromatic rings, primarily due to the network of sp² orbitals of the carbon atoms in these rings.


	Fig. 12 Bottom left: [−20, −5] eV part of the plot (see Fig. 7 panel (a)) showing the B3LYP calculated DOS of TrpTyr conformer 1. Other panels: plots showing the density of states, for TrpTyr conformer 1, integrated on selected energy ranges, corresponding to specific regions of the DOS plot.

All the above considerations are based on the analysis of the B3LYP orbitals, which is simple to perform. The good quality of the B3LYP description is confirmed by the agreement with the experimental data, but also by the more refined calculations using the CCSD approach. The latter confirm that the signatures of PES are mostly related to single particle features, with only a very weak satellite appearing in GlyPhe, i.e. the smallest of the molecules. This is in contrast to what often happens in small molecules where mixing with double and higher order excitations can have an important role. The theoretical description is reminiscent of what happens in extended systems which involve mostly s and p electrons. In such cases DFT provides a reasonable description of the electronic density, and only the eigenvalues need to be corrected. Indeed, for GlyPhe only, we also performed calculations within the state of the art DFT + GW scheme, usually employed to describe extended systems, and found good agreement with the CCSD scheme. The key ingredient of the GW scheme is the screening, whose role grows in importance with the system size, while making other correlation effects included in schemes like CCSD less important. GW is not on top of the CCSD results, but it is reasonably close, and it also confirms that the PES can be well described in terms of single-particle features.

This can be further understood in terms of the available phase space. When the system size is increased, so is the number of single particle orbitals, which then provide enough phase space to reach a good description of the electronic properties of the molecules. Observations related to the system size can be done also for the HOMO–LUMO gap and its trend with the molecule size. The HOMO–LUMO gap provides an indication of the stability of the activated molecule towards further chemical reactions. In our case we found the HOMO–LUMO gap tends to decrease as the size of the side chain group becomes larger. This is similar to what happens for example in solid state physics when moving from smaller to bigger nano-structures, to the bulk limit. The bigger the system, the more available phase space to relax, the smaller the electronic gap.

5 Conclusions

We have performed a combined theoretical and experimental study of three cyclo-dipeptides built on amino acids with an aromatic side chain. Theory has allowed assignment of the main features of the photoelectron spectra, which share the characteristic of a feature due to the frontier orbitals located on the aromatic ring. Both theory and experimental data confirm that the ionization energy decreases with increasing size of the aromatic side chains. These two observations indicate the role of the side chain in determining the (photo)chemical properties of the molecules. The detailed analysis of the spatial localization of electronic states performed in this work will pave the way to the interpretation of the fragmentation of these cyclo-dipeptides by VUV radiation and on the related topic of the charge mobility following ionization between the functional groups making up the cyclo-dipeptide.

Moreover, we also used the molecules to analyze different computational schemes, in particular comparing an accurate quantum chemistry scheme, based on CCSD, with the state of the art approach for extended systems, GW on top of DFT. The goal was to validate the computationally cheaper DFT results; but also to draw considerations related to the importance of dynamical corrections (i.e. multiple excitations or satellites), electronic screening, and the system size. We consider CCSD as the reference scheme for these molecules, but we also observe that their size is approaching the point where the DFT + GW scheme can also be considered a good alternative.

Author contributions

DS, and LA designed the project. EM, GM, and PA performed the numerical simulations. LA, PB, LC, FV, YW, RBV, MN, MS, MV, CA, RR performed the experimental measurements. EM, DS, GM, and LA wrote the manuscript. All the authors have carefully read the manuscript.

Conflicts of interest

There are no conflicts to declare.

Acknowledgements

The present work was performed in the framework the PRIN 20173B72NB research project “Predicting and controlling the fate of biomolecules driven by extreme-ultraviolet radiation”, which involves a combined experimental and theoretical study of electron dynamics in biomolecules with attosecond resolution.

Notes and references

A. K. Mishra, J. Choi, S.-J. Choi and K.-H. Baek, Molecules, 2017, 22, 1796 CrossRef PubMed.
G. Mattioli, L. Avaldi, P. Bolognesi, J. D. Bozek, M. C. Castrovilli, J. Chiarinelli, A. Domaracka, S. Indrajith, S. Maclot, A. R. Milosavljevic, C. Nicolafrancesco, C. Nicolas and P. Rousseau, Sci. Rep., 2020, 10, 13081 CrossRef CAS PubMed.
K. Zhao, R. Xing and X. Yan, Pept. Sci., 2021, 113, e24202 CrossRef CAS.
A. Jeziorna, K. Stopczyk, E. Skorupska, K. Luberda-Durnas, M. Oszajca, W. Lasocha, M. Grecki, J. Frelek and M. J. Potrzebowski, Cryst. Growth Des., 2015, 15, 5138–5148 CrossRef CAS.
G. Danger, R. Plasson and R. Pascal, Chem. Soc. Rev., 2012, 41, 5416–5429 RSC.
J. Ying, R. Lin, P. Xu, Y. Wu, Y. Liu and Y. Zhao, Sci. Rep., 2018, 8, 936 CrossRef PubMed.
F. Calegari, D. Ayuso, A. Trabattoni, L. Belshaw, S. D. Camillis, S. Anumula, F. Frassetto, L. Poletto, A. Palacios, P. Decleva, J. B. Greenwood, F. Martin and M. Nisoli, Science, 2014, 346, 336–339 CrossRef CAS PubMed.
M. Nisoli, P. Decleva, F. Calegari, A. Palacios and F. Martin, Chem. Rev., 2017, 117, 10760–10825 CrossRef CAS PubMed.
F. Krausz and M. Ivanov, Rev. Mod. Phys., 2009, 81, 163–234 CrossRef.
F. Gao and O. Inganas, Phys. Chem. Chem. Phys., 2014, 16, 20291–20304 RSC.
R. Weinkauf, P. Schanen, A. Metsala, E. W. Schlag, M. Buergle and H. Kessler, J. Phys. Chem., 1996, 100, 18567–18585 CrossRef CAS.
F. Rozzi, C. A. Troiani and I. Tavernelli, J. Phys.: Condens. Matter, 2018, 30, 013002 CrossRef PubMed.
A. P. W. Arachchilage, F. Wang, V. Feyer, O. Plekan and K. C. Prince, J. Chem. Phys., 2012, 136, 124301 CrossRef PubMed.
A. P. W. Arachchilage, F. Wang, V. Feyer, O. Plekan and K. C. Prince, J. Chem. Phys., 2010, 133, 174319 CrossRef PubMed.
M. P. Pileni, M. Bazin and R. Santus, Chem. Phys. Lett., 1977, 51, 61–64 Search PubMed.
K. Tao, Z. Fan, L. Sun, P. Makam, Z. Tian, M. Ruegsegger, S. Shaham-Niv, D. Hansford, R. Aizen, Z. Pan, S. Galster, J. Ma, F. Yuan, M. Si, S. Qu, M. Zhang, E. Gazit and J. Li, Nat. Commun., 2018, 9, 3217 CrossRef PubMed.
R. Weinkauf, P. Schanen, D. Yang, S. Soukara and E. W. Schlag, J. Phys. Chem., 1995, 99, 11255–11265 CrossRef CAS.
D. Barreiro-Lage, P. Bolognesi, J. Charinelli, R. Richter, H. Zettergren, M. H. Stockett, L. Carlini, S. Diaz-Tendero and L. Avaldi, J. Phys. Chem. Lett., 2021, 12, 7379–7386 CrossRef CAS PubMed.
S. Grimme, J. Chem. Theory Comput., 2019, 15, 2847–2862 CrossRef CAS PubMed.
C. Bannwarth, S. Ehlert and S. Grimme, J. Chem. Theory Comput., 2019, 15, 1652–1671 CrossRef CAS PubMed.
G. Mattioli, L. Avaldi, P. Bolognesi, J. Bozek, M. Castrovilli, J. Chiarinelli, A. Domaracka, S. Indrajith, S. Maclot, A. R. Milosavljevic, C. Nicolafrancesco, C. Nicolas and P. Rousseau, Phys. Chem. Chem. Phys., 2021, 23, 15049–15058 RSC.
P. Hohenberg and W. Kohn, Phys. Rev., 1964, 136, B864–B871 CrossRef.
W. Kohn and L. J. Sham, Phys. Rev., 1965, 140, A1133–A1138 CrossRef.
P. Giannozzi, O. Andreussi, T. Brumme, O. Bunau, M. Buongiorno Nardelli, M. Calandra, R. Car, C. Cavazzoni, D. Ceresoli, M. Cococcioni, N. Colonna, I. Carnimeo, A. Dal Corso, S. de Gironcoli, P. Delugas, R. A. di Stasio Jr, A. Ferretti, A. Floris, G. Fratesi, G. Fugallo, R. Gebauer, U. Gerstmann, F. Giustino, T. Gorni, J. Jia, M. Kawamura, H.-Y. Ko, A. Kokalj, E. Küçükbenli, M. Lazzeri, M. Marsili, N. Marzari, F. Mauri, N. L. Nguyen, H.-V. Nguyen, A. O. de-la Roza, L. Paulatto, S. Poncé, D. Rocca, R. Sabatini, B. Santra, M. Schlipf, A. P. Seitsonen, A. Smogunov, I. Timrov, T. Thonhauser, P. Umari, N. Vast, X. Wu and S. Baroni, J. Phys. Condens. Mater., 2017, 29, 465901 CrossRef CAS PubMed.
P. Giannozzi, S. Baroni, M. C. N. Bonini, R. Car, C. Cavazzoni, D. Ceresoli, G. L. Chiarotti, M. Cococcioni, I. Dabo, A. Dal Corso, S. de Gironcoli, S. Fabris, G. Fratesi, R. Gebauer, U. Gerstmann, C. Gougoussis, A. Kokalj, M. Lazzeri, L. Martin-Samos, N. Marzari, F. Mauri, R. Mazzarello, S. Paolini, A. Pasquarello, L. Paulatto, C. Sbraccia, S. Scandolo, G. Sclauzero, A. P. Seitsonen, A. Smogunov, P. Umari and R. M. Wentzcovitch, J. Phys. Condens. Mater., 2009, 21, 395502 CrossRef PubMed.
F. Neese, Wiley Interdiscip. Rev.: Comput. Mol. Sci., 2012, 2, 73–78 CAS.
N. Troullier and J. L. Martins, Phys. Rev. B: Condens. Matter Mater. Phys., 1991, 43, 1993 CrossRef CAS PubMed.
A. D. Becke, J. Chem. Phys., 1993, 98, 5648 CrossRef CAS.
C. T. Lee, W. T. Yang and R. G. Parr, Phys. Rev. B: Condens. Matter Mater. Phys., 1988, 37, 785 CrossRef CAS PubMed.
Y. Zhao and D. G. Truhlar, Theor. Chem. Acc., 2008, 120, 215–241 Search PubMed.
S. Grimme, J. Antony, S. Ehrlich and H. Krieg, J. Chem. Phys., 2010, 132, 154104 CrossRef PubMed.
G. Makov and M. C. Payne, Phys. Rev. B: Condens. Matter Mater. Phys., 1995, 51, 4014–4022 CrossRef CAS PubMed.
D. F. Shanno, Math. Comput., 1970, 24, 647 CrossRef.
F. Neese, Wiley Interdiscip. Rev.: Comput. Mol. Sci., 2018, 8, e1327 Search PubMed.
A. Schaefer, H. Horn and R. Ahlrichs, J. Chem. Phys., 1992, 97, 2571–2577 CrossRef CAS.
F. Weigend and R. Ahlrichs, Phys. Chem. Chem. Phys., 2005, 7, 3297–3305 RSC.
G. Borghi, A. Ferretti, N. L. Nguyen, I. Dabo and N. Marzari, Phys. Rev. B: Condens. Matter Mater. Phys., 2014, 90, 075135 CrossRef CAS.
D. Sangalli, A. Ferretti, H. Miranda, C. Attaccalite, I. Marri, E. Cannuccia, P. Melo, M. Marsili, F. Paleari, A. Marrazzo, G. Prandini, P. Bonfà, M. O. Atambo, F. Affinito, M. Palummo, A. Molina-Sánchez, C. Hogan, M. GrÃÂ¼ning, D. Varsano and A. Marini, J. Phys.: Condens. Matter, 2019, 31, 325902 CrossRef CAS PubMed.
G. Onida, L. Reining and A. Rubio, Rev. Mod. Phys., 2002, 74, 601–659 CrossRef CAS.
D. Golze, M. Dvorak and P. Rinke, Front. Chem., 2019, 7, 377 CrossRef CAS PubMed.
R. J. Bartlett, Wiley Interdiscip. Rev.: Comput. Mol. Sci., 2012, 2, 126–138 CAS.
A. K. Dutta, F. Neese and R. Izsak, J. Chem. Phys., 2016, 144, 034102 CrossRef PubMed.
A. K. Dutta, F. Neese and R. Izsak, J. Chem. Phys., 2016, 145, 034102 CrossRef PubMed.
A. Derossi, F. Lama, M. Piacentini, T. Prosperi and N. Zema, Rev. Sci. Instrum., 1995, 66, 1718–1720 CrossRef CAS.
R. P. Walker and B. Diviacco, Rev. Sci. Instrum., 1992, 63, 332–335 CrossRef.
H. A. Padmore, Rev. Sci. Instrum., 1989, 60, 1608–1615 CrossRef CAS.
O. Plekan, H. Sa’adeh, A. Ciavardini, C. Callegari, G. Cautero, C. Dri, M. Di Fraia, K. C. Prince, R. Richter and R. Sergo, et al. , J. Phys. Chem. A, 2020, 124, 4115–4127 CrossRef CAS PubMed.
G. Cautero, R. Sergo, L. Stebel, P. Lacovig, P. Pittana, M. Predonzani and S. Carrato, Nucl. Instrum. Methods Phys. Res., Sect. A, 2008, 595, 447–459 CrossRef CAS.
J. Eland and R. Feifel, Double Photoionisation Spectra of Molecules, Oxford University Press, 2017, p. 193 Search PubMed.
P. Mori-Sánchez, A. J. Cohen and W. Yang, Phys. Rev. Lett., 2008, 100, 146401 CrossRef PubMed.
E. Caldeweyher, C. Bannwarth and S. Grimme, J. Chem. Phys., 2017, 147, 034112 CrossRef PubMed.
E. Caldeweyher, S. Ehlert, A. Hansen, H. Neugebauer, S. Spicher, C. Bannwarth and S. Grimme, J. Chem. Phys., 2019, 150, 154122 CrossRef PubMed.

Footnotes

† Electronic supplementary information (ESI) available: Text and figures about unbound electronic states in the continuum: Fig. S1: calculated DOS for both occupied and empty electronic states of c-GlyPhe, obtained with different computational methods; Fig. S2: spatial localization of “resonance” states and of completely delocalized states. Text and tables (Tables S1–S4) on the energy ordering of c-TrpTrp conformers and its dependence on several computational details. Fig. S3 ans S4: spatial localization of EOM-CCSD and of B3LYP orbitals of c-GlyPhe. Tables S5 and S6: calculated vertical and adiabatic ionization energies and electron affinities for the cyclo-dipeptides under study. See DOI: 10.1039/d1cp04050b

‡ This is an extra shift on top of the vacuum level correction which we compute within the Makov–Payne and Martyna–Tuckerman (MT) schemes. For the box size used both schemes give ≈0.25 eV.

Click here to see how this site uses Cookies. View our privacy policy here.