Ivan D.
Avdeev
and
Dmitry S.
Smirnov
*
Ioffe Institute, 194021 St. Petersburg, Russia. E-mail: smirnov@mail.ioffe.ru
First published on 13th May 2019
The spin dynamics of localized charge carriers is mainly driven by hyperfine interaction with nuclear spins. Here we develop a theory of hyperfine interaction in transition metal dichalcogenide monolayers. Using group representation theory and the tight binding model we derive effective Hamiltonians of the intervalley hyperfine interaction in the conduction and valence bands. The spin–valley locking and pronounced spin–orbit splitting lead to a specific form of hyperfine interaction, which we call “helical”. We also demonstrate that the hyperfine interaction is noncollinear for chalcogen atoms in the general case. At the same time in the upper valence band the hyperfine interaction is purely of the Ising type, which suggests that the spin–valley polarization of localized holes in transition metal dichalcogenide monolayers can be conserved for a particularly long time.
Particularly promising for optoelectronic devices are zero-dimensional systems, like quantum dots based on TMD MLs.11,12 In principle any disorder in 2D structures leads to the localization of charge carriers.13 In practice TMD ML quantum dots can be made by chemical exfoliation14–16 and lithographic nanopatterning17 or charge carriers can be trapped by wrinkles,18 homojunctions,19 or defects.20–22
The spin–valley polarization lifetime for excitons is unavoidably limited by the exciton lifetime being in a few picosecond range.23 But this limitation is released for resident charge carriers. Their polarization can be preserved for a few nanoseconds in MoS2,24–26 and even longer in WSe2.27–29 For free charge carriers the polarization relaxation is related to the spin–orbit interaction, which can be suppressed by localization. In this case the dominant role in the spin and valley dynamics is played by the hyperfine interaction with the host lattice nuclear spins.30
In TMD MLs, in contrast to many other materials, the electron nuclear spin flips within a valley are suppressed by spin–orbit splitting, which is about 104 times larger than the hyperfine interaction. At the same time the intervalley spin flips should be accompanied by the transfer of the large momentum, equal to K+ − K−. This is, however, easily possible because of the very short range nature of the hyperfine interaction. For this reason in our work we will focus only on the intervalley hyperfine interaction and neglect spin flips in one valley.
Remarkably, the hyperfine interaction has the same relativistic origin as the spin–orbit interaction, so one can expect that it is also strong in TMD MLs. At the same time, the related effects in TMD MLs stay essentially unexplored.31,32 In this work we derive the Hamiltonian of the hyperfine interaction from a rigorous symmetry analysis and corroborate our results using the tight binding model.
We show that the low symmetry of TMD MLs allows for the noncollinear hyperfine interaction,33,34 so the nuclear spins can be flipped without the need to flip the valley pseudospin. This effect was previously observed in GaAs based quantum dots, where it manifested itself as a dragging of the quantum dot resonance frequency.35 Our calculations show that this effect is about two orders of magnitude stronger in TMD ML quantum dots.
The locking of spin and valley degrees of freedom also brings specifics to the hyperfine interaction. In this work we demonstrate that it leads to a “helical” structure of the interaction of the valley pseudospin with nuclei, which means that the components of the hyperfine interaction Hamiltonian are periodically modulated in space. This effect manifests itself in the dynamic nuclear spin polarization and formation of the nuclear spin polaron, which also have a helical structure.
Previous studies of spin relaxation in TMD ML quantum dots predicted the divergence of the polarization relaxation time in zero magnetic field.36 Physically the lifetime of spin–valley polarization in zero magnetic field is limited by the hyperfine interaction. We show that for electrons the interaction with nuclei leads to spin dephasing on the timescale of tens of nanoseconds. For holes, in contrast to previous misleading studies,31 we show that the hyperfine interaction in the upper subband is of the Ising type. So the polarization relaxation of holes is parametrically longer than that of electrons.
The paper is organized as follows. In Sec. 2 we perform the symmetry analysis of the hyperfine interaction. Then, in Sec. 3, using the tight binding model we calculate the components of the hyperfine interaction tensors. The main physical results are derived and discussed in Sec. 4. Finally, the conclusions are given in Sec. 5.
We chose the origin of the coordinate frame at the center of the hexagon, formed by the metal and chalcogen atoms,1,2 as shown in Fig. 1. We also choose the y axis to be oriented towards the nearest pair of chalcogen atoms. We note that caution should be taken regarding the choice of the coordinate frame origin and orientation of the axes, when comparing with the results of some other authors.3,37
The point symmetry of the TMD ML is D3h. This group consists of a horizontal (lateral) reflection plane σh‖(xy), three fold rotation axis C3‖z, three vertical reflection planes 3σv, three in-plane two fold rotation axes (including the y axis), and the combinations S3 = σhC3. In total, there are 12 symmetry operations including identity.
The valence and conduction band extrema are located at the two inequivalent K± points of the Brillouin zone, as described in the Introduction; also see the ESI† for more details. The wave vector point symmetry in these valleys is C3h, which is a subgroup of D3h lacking all the elements interchanging the K+ and K− valleys.
All the irreducible representations of the C3h group are one dimensional, so all electronic states in K± valleys are nondegenerate, as shown in the band diagram in Fig. 2. However, the two valleys are related by the time reversal symmetry and their energies coincide, in agreement with the Kramers theorem.
We focus our attention on the four (sub)bands in the vicinity of the band gap, which we label by the index m = cb + 1,cb,vb,vb − 1, as shown in Fig. 2. The electron wave function in the K± valley in the mth band has the general form
Ψ(m)±(r) = eiK±ru(m)±(r), | (1) |
The nuclear spins weakly break the translation symmetry of the structure and lead to the splitting and mixing of the states in K+ and K− valleys. The strength of the hyperfine interaction with the nuclear spins is usually of the order of 1 μeV (ref. 30 and 31). This is much smaller than the spin–orbit splittings of the conduction and valence bands in TMD MLs, which are of the order of a few tens of meV and a few hundreds of meV, respectively.2,38 Therefore, the hyperfine interaction can mix only the states with the same energy, i.e. in the same band, but in different valleys, as shown by the wavy arrows in Fig. 2.
The two states in K+ and K− valleys can be interpreted as a valley qubit.39–41 We introduce the valley pseudospin matrices = (x,y,z), see the ESI†, so that K± states correspond to τz = ±1/2, respectively. We note that due to the difference of the Bloch wave vectors K+ and K−, spin is not a good quantum number, so the hyperfine interaction Hamiltonian should be written in terms of the valley pseudospin τ. Taking into account the form of the electron wave functions, eqn (1), the hyperfine interaction Hamiltonian in the mth band is
(2) |
Further symmetry analysis allows us to find the restrictions on the form of the hyperfine interaction tensors. It is most convenient to start the analysis from the C3h group of the wave vector, and then consider the raise of the symmetry up to D3h, the point symmetry of the structure.
The representations corresponding to the standard choice of the origin of the coordinate frame at the center of the hexagon are well established.1,42 But to analyze the hyperfine interaction tensor with the nth nucleus, it is convenient to move the origin of the coordinate frame towards the corresponding nucleus by the two-dimensional vector Rn. Upon this nontrivial translation, the irreducible representations of the wavefunctions Ψ(m)± change. In the ESI† we show that for each symmetry operation g of the wave vector group C3h the matrix of the representation should be multiplied by
e−iK±(Rn−gRn). | (3) |
(4) |
In Table 1 we present the irreducible representations of the electronic states in the C3h group in the bands under study. The representations corresponding to the shifted origin of the coordinate frame can be calculated using the multiplication rules for the C3h group.43 The two representations, corresponding to the two valleys in the same band, are always conjugate, in agreement with the time reversal symmetry.
Band | O | M | X | |||
---|---|---|---|---|---|---|
K ± | K − | K + | K − | K + | K − | |
a The order of bands corresponds to the molybdenum based structures. | ||||||
cb + 1 | Γ9 | Γ10 | Γ8 | Γ7 | Γ12 | Γ11 |
cb | Γ11 | Γ12 | Γ7 | Γ8 | Γ10 | Γ9 |
vb | Γ7 | Γ8 | Γ10 | Γ9 | Γ11 | Γ12 |
vb − 1 | Γ8 | Γ7 | Γ12 | Γ11 | Γ9 | Γ10 |
Now we consider the point symmetry group D3h of the TMD ML. The conjugate representations of the C3h group join in the D3h group in pairs as follows (see the ESI†):
{Γ8(C3h), Γ7(C3h)} → Γ7(D3h), | (5a) |
{Γ10(C3h), Γ9(C3h)} → Γ8(D3h), | (5b) |
{Γ11(C3h), Γ12(C3h)} → Γ9(D3h). | (5c) |
Band | irrep (M) | A M xx /AM | A M yy /AM | A M zz /AM |
---|---|---|---|---|
a Additionally the second column shows the irreducible representations in the D3h group with the center of transformations at a metal atom. | ||||
cb + 1 | Γ7 | 2/7 | −2/7 | −4/7 |
cb | Γ7 | 2/7 | 2/7 | 4/7 |
vb | Γ8 | 0 | 0 | 24/7 |
Vb − 1 | Γ9 | 0 | 0 | 32/7 |
The irreducible representations corresponding to the components of the valley pseudospin can be found from the decomposition of the squares of the self-conjugate representations found above. The multiplication rules are43
Γ7 ⊗ Γ7 = Γ8 ⊗ Γ8 = Γ1 ⊕ Γ2 ⊕ Γ5, | (6a) |
Γ9 ⊗ Γ9 = Γ1 ⊕ Γ2 ⊕ Γ3 ⊕ Γ4. | (6b) |
Now let us turn to the classification of the nuclear spin components. We recall that we perform the symmetry analysis in the point symmetry group D3h with the center of transformations at Rn. The nuclear spin of a metal atom, IM, is a pseudovector, and its components transform according to Γ2 (IM,z) and Γ5 (∓IM,x − iIM,y) representations. The two chalcogen atoms at the two-dimensional coordinate Rn (in the same unit cell) exchange their places under the reflection in the horizontal plane σh. So we introduce the linear combinations of their spins IX = (Iup + Idown)/2 and ΔI = Iup − Idown, where the superscripts “up” and “down” refer to the spins of the atoms above and below the (xy) plane. Under the reflection σh the components of IX transform in the same way as those of IM, while the components of ΔI additionally change the sign. Under the reflection in the vertical plane σv the components of both IX and ΔI transform in the same way as those of IM. As a result the components IX,z and ∓IX,x − iIX,y belong to Γ2 and Γ5 representations, respectively; the component ΔIz belongs to Γ3, while ΔIx and ΔIy, form the basis of the representation Γ6.
Now the symmetry analysis of the hyperfine interaction becomes straightforward. According to the method of invariants the coupling is allowed only between the components of and I, which transform according to the same irreducible representation. For the representations Γ7 and Γ8 corresponding to the coordinate Rn the hyperfine interaction Hamiltonian has the form:
(7) |
In the case of the representation Γ9 the Hamiltonian has the form
(8) |
One can see that the component Azz is symmetry allowed for all atoms in all bands. This means that the nuclear spin polarization along z creates a longitudinal Overhauser field and lifts the Kramers degeneracy of the bands, similarly to an external longitudinal magnetic field. Similarly the valley polarization along the z direction creates a Knight field perpendicular to the monolayer plane, which acts on the nuclei. The difference between Azz in the pairs of bands cb and cb + 1, and vb and vb + 1 is analogous to the longitudinal spin g factor of the charge carriers.44,45
The in-plane hyperfine coupling is allowed only for certain bands and it depends on the coordinates of the nuclei. The phase ϕn in eqn (7) and (8) effectively describes the rotations of nuclear spins, so we call this hyperfine interaction helical. We stress that the helical hyperfine interaction is a direct consequence of the spin–valley locking inherent to TMD MLs. Note also that the second term in eqn (8) describes the noncollinear hyperfine interaction. We discuss the corresponding physical effects in Sec. 4.
The microscopic Hamiltonian of the hyperfine interaction with the nuclei has the form
(9) |
For the sake of simplicity, we limit ourselves to d orbitals at the metal atoms (thus neglecting s orbitals47) and p orbitals at the chalcogen atoms.2,37 Bearing in mind the irreducible representations summarized in Table 1, one finds the form of the Bloch amplitudes in the tight-binding model.2,37 We neglect the mixing between spin-up and spin-down states and obtain
(10a) |
(10b) |
(10c) |
(10d) |
Each pair of functions (10a)–(10d) forms the basis of an irreducible representation of the group D3h, and the two corresponding functions belong to the conjugated irreducible representations of the group C3h, in agreement with eqn (5). Note that in ref. 31, where the hyperfine interaction was studied using first principles, the expression for uvb± was different, which led to some wrong results.
In the vicinity of the nth nucleus the orbital wavefunction has the form
(11) |
(12) |
Taking into account the explicit form of the wave functions, eqn (10), additional restrictions can be obtained for the hyperfine interaction tensors. Indeed, the Hamiltonian (9) cannot change the total electron angular momentum fz = mz + sz by more than 1.‡ The representation Γ9 corresponds to fz = ±3/2, so these states cannot be mixed by the hyperfine interaction. In this case the components Axx and Ayy vanish, in agreement with the general symmetry arguments. Additionally, the wavefunctions at the metal atoms in the upper valence band, vb, have the total angular momenta fz = ±5/2 and hence Axx = Ayy = 0 in this case (see Table 2). As a result the hyperfine interaction in the upper valence band is purely of the Ising type.
The calculation of the matrix elements of the Hamiltonian (9) with the wavefunctions described by eqn (10) and (11) yields the relationship between the in-plane and out-of-plane components of the hyperfine interaction tensors. These results are summarized in the last three columns in Tables 2 and 3. One can see that |Azz| = 2|Axx|, whenever Axx is nonzero. The absolute values of the hyperfine interaction constants in Tables 2 and 3 are determined by
(13) |
The noncollinear term ∝τyΔIz in eqn (8) vanishes in the model described above. In the ESI† we consider the 22 band tight binding model,37 which takes into account all p states at both chalcogen atoms and all d states at metal atoms in the unit cell. This model describes weak mixing between spin-up and spin-down states, so the two Bloch wavefunctions in K± valleys in the bands cb + 1 and vb at the chalcogen atoms above/below the (xy) plane have the form:
(14a) |
(14b) |
(15) |
Interestingly the optical activity of the “dark” excitons in tungsten based TMDs in z polarization42 is also related to the mixing of spin-up and spin-down states. Therefore the dipole moment in z polarization is proportional to the parameter α, but it also requires the overlap between orbitals at the two chalcogen atoms.
First of all we note that the relationships obtained in Sec. 2 are strict and follow only from the symmetry analysis. Therefore the same results can also be applied to any substitutional impurity in TMD MLs.
In TMDs not all the metal and chalcogen isotopes have nonzero nuclear spins. The ones with nonzero nuclear spins are listed in Table 4 together with their abundances (ν) and spins (I). One can see that less than a half of atoms of each type have nonzero spins.
M | ν (%) | I | A (μeV) | |
---|---|---|---|---|
Mo | 95 | 15.92 | 5/2 | −0.57 |
97 | 9.55 | 5/2 | ||
W | 183 | 14.31 | 1/2 | 0.64 |
S | 33 | 0.76 | 3/2 | 0.75 |
Se | 77 | 7.63 | 1/2 | 3.9 |
Te | 123 | 0.89 | 1/2 | −8.3 |
125 | 7.07 | 1/2 |
The values of the hyperfine interaction constants (A) can be calculated using atomistic approaches, for example DFT.31,48,49 The orbitals at molybdenum and tungsten are related mainly to 4d and 5d atomic orbitals, respectively, while the orbitals are related to 3p, 4p and 5p atomic orbitals at sulfur, selenium and tellurium, respectively. The estimations for the hyperfine coupling constants can be obtained from the corresponding values known for the free atoms.50 They are given in the last column in Table 4. We note that the straightforward calculation of the integrals in eqn (13) for the Slater orbitals51 with the effective screening constants52 yields the values an order of magnitude smaller.
From Table 4 one can see that the hyperfine interaction is stronger for chalcogen atoms than for the metal atoms. It is related to the fact that p orbitals correspond to the smaller angular momentum and are more localized at the nuclei than d orbitals. Moreover, as one could expect, separately for metal and chalcogen atoms, the heavier is the atom the stronger is the hyperfine interaction. We note that the spin–orbit splitting of the conduction and valence bands in TMD MLs qualitatively obeys the same rules, which is related to the common relativistic origin of the two effects.
It is instructive to compare the hyperfine interaction parameters with those in the well studied semiconductor GaAs. In GaAs the spin–orbit splitting of the valence band is about 330 meV,53 which approximately equals to the splitting of the two uppermost valence bands in TMD MLs. The hyperfine interaction constants in the valence band of GaAs are of the order of 10 μeV,54,55 which is comparable to those in TMDs. By contrast, the hyperfine interaction in the conduction band of GaAs is an order of magnitude stronger due to the s type of the Bloch amplitudes.56,57 Hence the hyperfine interaction of electrons in TMD MLs is much weaker than in GaAs.
The most important effect related to the hyperfine interaction is the valley polarization relaxation. In large magnetic fields the loss of the polarization is dominated by the electron phonon interaction, but this mechanism predicts infinite spin relaxation time in zero magnetic field.36 In fact the polarization relaxation in small magnetic fields is related to the hyperfine interaction. The timescale related to the hyperfine interaction can be estimated as58
(16) |
In order to describe the valley pseudospin dynamics quantitatively we use the model of Merkulov, Efros and Rosen.58 The polarization relaxation in zero magnetic field consists of two stages. In the first stage, the charge carrier spin precesses in the static fluctuation of the Overhauser field with the frequency Ω, while the nuclear spin dynamics can be neglected. The distribution function of the Larmor precession frequency has the form
(17) |
(18) |
Let us consider the first stage in more detail. Provided that the valley pseudospin is initially oriented along the z direction, the dynamics of this component is described by
τz(t) = τz(0)[cos2(θ) + sin2(θ)cos(Ωt)], | (19) |
Fig. 3 shows this dynamics. The black curve corresponds to the spin relaxation of localized electrons in TMD MLs, where λ = 2 (see Tables 2 and 3). The spin polarization decreases due to the spin precession in the random nuclear field. However, the component of the spin parallel to the nuclear field does not precess, so the fraction f of the initial spin polarization is conserved. For comparison the blue curve corresponds to the electrons in typical GaAs quantum dots, where the hyperfine interaction is isotropic (λ = 1). Compared to this case the spin polarization of electrons in TMD ML quantum dots decays slower, and the larger part of spin polarization is preserved during the first stage. Finally, the red curve corresponds to the localized holes in TMD MLs. It follows from Tables 2 and 3 that in this case Ωx = Ωy = 0, so the spin polarization is constant, and it does not decay due to the hyperfine interaction on the time scale under study.
Fig. 3 Polarization relaxation due to the hyperfine interaction. The spin relaxation of electrons in TMD MLs (black solid curve), holes in TMD MLs (red dashed curve) and electrons in typical GaAs quantum dots is described by the anisotropy parameter λ = 2, ∞ and 1 (see eqn (17) and (19)), respectively. |
The fraction of the initial spin polarization conserved during the first stage can be calculated analytically:30
(20) |
(21) |
The most interesting situation in TMD MLs takes place in the upper valence band. Here, as follows from Tables 2 and 3, λ = ∞ and f = 1. This describes the situation, when the relaxation of the spin z component is absent because of the Ising type of the hyperfine interaction. For chalcogen atoms this is the symmetry requirement, while for metal atoms this results from atomic orbitals, which cannot be mixed by the hyperfine interaction Hamiltonian (9). Therefore, one should expect longest valley coherence times for localized holes in TMD MLs.
On long timescales the valley relaxation of localized holes can be related to (i) two phonon processes,59–61 (ii) single phonon processes in combination with the hyperfine interaction,62 or (iii) mixing of the energy degenerate states by the localization potential.36,63,64 In the latter case, the localization potential should be atomically sharp, e.g. an impurity. Otherwise, the degree of mixing of the two valleys is proportional to the ratio κ of the lattice constant and the localization length. This situation is similar to the one described above with an effective degree of the hyperfine interaction anisotropy λeff ∼ 1/κ.
Apart from the spin relaxation the hyperfine interaction gives rise also to the effect of dynamic nuclear spin polarization.65 This effect consists in the transfer of spin polarization from the charge carriers to nuclei. The dynamic nuclear spin polarization is inefficient in the upper valence band of TMD MLs, but in all the other bands it can be significant.
Dynamic nuclear spin polarization is most pronounced under excitation by circularly polarized light. It orients valley pseudospin along the z axis, and this polarization is transferred to nuclear spins. Uniform nuclear spin polarization creates an Overhauser field, which splits the optical resonance in TMD ML quantum dots, and this slitting can be observed experimentally.
The observation of the helical nuclear spin polarization requires in-plane valley pseudospin polarization. We note that the transverse magnetic field does not mix the free states in the different valleys because of the translational invariance. For charge carriers and charged complexes, however, this restriction is removed. For example, in symmetric quantum dots with the symmetry D3h and with the center at the metal atom the two ground electron states belong to the Γ7 representation, see Table 2. The direct product Γ7 ⊗ Γ7 contains the representation Γ5, which corresponds to the in-plane magnetic field. As a result the magnetic field can lead either to thermal in-plane pseudospin polarization or to the rotation of out-of-plane polarization to the (xy) plane. The in-plane valley pseudospin leads to electron-nuclear spin flip flops and to chiral dynamic nuclear spin polarization. The chiral nuclear spin polarization manifests itself, for example, similar to the transverse nuclear spin polarization in the Hanle effect.65
An example of nuclear spin polarization distribution for τ‖x in the lowest conduction band of molybdenum based TMD MLs is shown in Fig. 1. Notably, the distribution of nuclear spin polarization is nonuniform. It is described by the phases ϕn in eqn (7) and looks like rotating nuclear spins. So we call it the “helical” structure of nuclear spin polarization. Despite the nuclear spin ordering a macroscopic nuclear spin polarization is absent, which is analogous to an antiferromagnetic spin state.
The helical hyperfine interaction is essentially based on valley degeneracy in contrast to many other effects. In fact, it requires two ingredients: (i) valley degeneracy and (ii) strong spin–orbit interaction, which lifts the spin degeneracy in each valley. For example, this effect cannot take place in usual GaAs-based structures, where the valley degeneracy is absent. In SiGe quantum wells the spin–orbit interaction also does not lead to the spin slitting of the valleys,66 and in this case the intervalley hyperfine interaction is hidden by the intravalley interaction. Thus TMD MLs are the most suitable structures for the observation of the helical hyperfine interaction. This observation is also facilitated by the direct band gap of these materials.
Interestingly the D3h symmetry of TMD MLs allows for the noncollinear hyperfine interaction in the bands vb and cb + 1 with chalcogen atoms. Estimations made in the previous section show that Ayz in the band cb + 1 is only 6 times smaller than Azz. For comparison in GaAs quantum dots, the noncollinear term is two orders of magnitude smaller.35 The pronounced noncollinear hyperfine interaction in TMD MLs can lead to effective nuclear spin relaxation in a strong external perpendicular magnetic field (see the discussion above). Indeed, it allows flipping nuclear spins without changing the electron energy by the Zeeman energy. The same processes are also responsible for the “dragging” of the quantum dot resonance frequency by laser light.67 Based on Tables 2 and 3 we predict that this effect is particularly strong for quantum dots based on tungsten dichalcogenides, where the lowest conduction band corresponds to cb + 1 in our notations.
Finally, we note that in the case of charge carrier localization in a two dimensional structure, the many-body nuclear spin effects such as nuclear spin self-polarization68,69 and formation of nuclear spin polaron70,71 can be pronounced. The nuclear spin polaron can also reveal the helical structure of the hyperfine interaction. These effects can be used to additionally increase the valley pseudospin relaxation time and to realize robust control of its orientation, similarly to magnetic skyrmions.72,73
Footnotes |
† Electronic supplementary information (ESI) available. See DOI: 10.1039/c8na00360b |
‡ The last term in eqn (9) being a part of the dipole–dipole interaction can change the electron orbital momentum mz by ±2, but simultaneously the electron spin sz changes by ∓1, so the total angular momentum fz cannot be changed by more than 1, in agreement with the spherical symmetry of the Hamiltonian (9). |
This journal is © The Royal Society of Chemistry 2019 |