[图片]Towards the targeted activation of silent biosynthetic gene clusters by chemical elicitors(诱导剂)AbstractSystematic genome mining has revealed that microbes(微生物) encode numerous uncharacterized secondary metabolite biosynthetic gene clusters (BGCs). The efficient and selective activation of these silent or cryptic(隐性的) BGCs is crucial for the high-throughput(高通量) discovery of novel natural products. Recent influential studies have demonstrated that using small chemical elicitors is a practical and cost-effective(经济高效) method to unlock the secondary metabolic potential of microbes. However, the current approach mainly relies on high-throughput, non-targeted screening methods to discover chemical elicitors capable of activating these silent BGCs. Therefore, this study comprehensively reviews reported cases of small molecules that activate silent BGCs, covering the chemical structures of elicitors, resulting natural products, and target BGCs, thereby constructing an integrated knowledge graph. We also summarize the underlying activation mechanisms. Leveraging(利用) relationships captured(捕获) in this graph, we outline directions for targeted activation of silent pathways using small molecules, thereby facilitating more efficient natural product discovery.1. IntroductionThe discovery of novel natural products holds significant importance across various scientific fields due to their remarkable structural diversity, which leads to a wide range of biological activities and applications, including pharmaceuticals, food, cosmetics(化妆品), and nutraceuticals(营养保健品). With the advancement of next-generation sequencing technologies and the development of computational genome mining, the discovery of natural products has entered a genome-driven era (Navarro-Muñoz et al., 2020; Ziemert et al., 2016). In this era, genomic data is leveraged to uncover the secondary metabolic potential of microorganisms, guiding the discovery of novel natural products (Harvey et al., 2015). Numerous genome mining studies have revealed that the majority of the biosynthetic gene clusters (BGCs) encoded by microorganisms remain untapped(未开发的). For example, Gavriilidou et al. estimated that only about 3% of the natural products encoded in bacterial genomes were experimentally characterized (Gavriilidou et al., 2022). Our recent study indicated that 96.8% of the secondary metabolic potential in marine prokaryotes(原核生物) remains unexplored (Wei et al., 2023).原核生物prokaryotes;真核生物eukaryotes The primary reason for this gap between the natural products that have been characterized and those identified through genome mining is that many BGCs are silent or expressed at low levels under laboratory conditions, thus limiting the production and characterization of these potential natural products (Xu et al., 2019).Many strategies have been used to activate silent BGCs, leading to the discovery of novel secondary metabolites from diverse microorganisms. These strategies are broadly divided into two categories: stimulation(刺激) and genetic activation(遗传激活) (Baral et al., 2018; El-Hawary et al., 2023; Mao et al., 2018; Zong et al., 2022). Stimulation involves using external factors to activate silent BGCs without altering the organism's genetic composition. This category of strategies can be classified into three types: chemical elicitation(诱导), biological elicitation, and environmental perturbation(扰动). When microorganisms are grown in pure culture in vitro, many activating signals are absent, leading to the downregulation of specialized metabolite BGCs. To address this, co-culturing with competing species (biological elicitation) or adding chemical elicitors (chemical elicitation) can induce changes in BGC expression (Rutledge and Challis, 2015). Additionally, altering environmental conditions (environmental perturbation), such as nutrient levels, pH, or oxygen availability, can stimulate BGC activity (Romano et al., 2018). Genetic activation, on the other hand, directly manipulates genetic elements to induce BGC expression. Examples include manipulating global or pathway-specific regulators (Lu et al., 2017; Mao et al., 2017; Otur and Kurt-Kızıldoğan, 2024), reporter-guided mutant selection (Mao et al., 2018), refactoring(重构) (He et al., 2024), and heterologous expression (Kang and Kim, 2021). However, these approaches frequently mandate(要求) intricate manual design and often have low success rates, particularly in heterologous expression because compounds may be biosynthesized through the collective contribution of multiple gene clusters (van Bergeijk et al., 2020). By contrast, chemical elicitation offers simplicity, ease of operation, and good reproducibility. In recent years, high-throughput(高通量) elicitor screening (HiTES) has been successfully employed to activate silent BGCs. HiTES can use various readouts, including transcriptional reporter assays (Xu et al., 2017), as well as genetics-free techniques such as bioactivity assays (bioactivity-HiTES) (Moon et al., 2019b), mass spectrometry detection (MS-coupled HiTES) (Covington and Seyedsayamdost, 2021), and fluorescence resonance energy transfer-based assays (FRET-HiTES) (Han et al., 2023), enabling efficient discovery of elicitors in a multi-well plate format. Over the past decade, chemical elicitation has gained widespread attention and application.In this review, we define a focused scope on studies in which chemical elicitors were explicitly(明确) shown to activate silent or lowly expressed BGCs, leading to the production of cryptic natural products. We note that, in some cases, the addition of small molecules results in the discovery of new structural analogues rather than genuine activation of a silent BGC. In such scenarios, the supplemented compounds often function as metabolic precursors, thereby altering substrate availability within an already active pathway. For example, feeding Streptomyces chartreusis NRRL 3882 with 20 different proteinogenic amino acids revealed that calcimycin production and the distribution of its analogs are strongly dependent on amino acid supply, with glutamine yielding the highest calcimycin levels (Arend and Bandow, 2021). Additional examples of precursor-driven modulation of product biosynthesis have been reviewed elsewhere (Yan et al., 2024). These effects, while valuable for expanding chemical diversity, do not constitute activation of a transcriptionally silent BGC and are therefore not discussed further in this review.While chemical elicitor activation is conceptually simple, the identification of effective elicitors remains a major challenge. Current strategies, such as HiTES, allow for efficient screening of large compound libraries but are largely empirical and lack a strong theoretical foundation. To address this limitation and enable more targeted elicitor selection, we first construct a comprehensive knowledge graph of known elicitors based on data curated(整理) from the literature, providing a structured knowledge base. In parallel, we review and generalize experimental paradigms for elucidating the regulatory mechanisms governing BGC expression, with an emphasis(强调) on omics(组学)-based regulatory networks. Building on these two foundations, we propose a set of guidelines to support the rational selection of small-molecule elicitors. Through this review, we aim to promote the targeted activation of silent BGCs and, ultimately, facilitate the discovery of novel natural products.2. Overview of chemical elicitors in activating silent BGCsDozens of small molecules have been reported to activate silent or poorly expressed gene clusters in microorganisms, leading to the production of new natural products. Here, we construct a comprehensive knowledge graph comprising 32 BGCs, which can be activated or upregulated by 54 small-molecule elicitors, leading to the synthesis or increased production of 87 representative secondary metabolites (Fig. 1 & Table S1). This knowledge graph was manually curated from published literature and designed as a conceptual integration framework. Specifically, elicitor–BGC–metabolite relationships were systematically extracted from peer-reviewed studies reporting experimentally validated activation or significant upregulation of silent or lowly expressed BGCs. Small-molecule elicitors were classified primarily based on their reported mechanisms of BGC activation. Accordingly, elicitors were grouped into three broad classes, including stress-associated elicitors, signaling-associated elicitors, and others that no clear mechanisms are discussed. Among these, stress-associated elicitors, representing the most extensively reported class, were further subdivided according to chemical nature and ecological context, including antibiotics, non-antibiotic clinical drugs, host-related molecules, and chemically modified molecules. This mechanism-oriented classification framework is intended to provide a conceptual guide for sourcing effective elicitors.2.1. Stress-associated elicitorsStress-associated elicitors refer to small molecules that activate silent or weakly expressed BGCs primarily by inducing physiological stress responses. In this context, elicitation arises from global or semi-global perturbations of cellular(细胞的) homeostasis(稳态), including disturbances to redox(氧化还原反应) balance, DNA replication, transcription, translation or metabolic flux.[图片]Fig. 1. A knowledge graph integrating reported chemical elicitors (triangles, n = 54, labeled as E1–E54), their induced BGCs (circles, n = 32), and metabolic products (diamonds, n = 87, labeled as C1–C87), corresponding to entries in Table S1. Elicitors are color-coded by category (stress-associated, signaling-associated, and others). Dashed arrows(虚线) indicate induction relationships, whereas solid arrows denote biosynthetic relationships.2.1.1. AntibioticsAmong stress-associated elicitors, antibiotics applied at sub-inhibitory concentrations represent one of the most extensively studied and experimentally validated classes. A series of HiTES-based studies by Seyedsayamdost group found that the majority of effective elicitors identified were antibiotics, highlighting their important role in activating silent biosynthetic pathways. For instance, the β-lactam(β-内酰胺类) piperacillin (E1) and dihydrofolate reductase inhibitor trimethoprim (E2) were found to activate the mal gene cluster in the model pathogen Burkholderia thailandensis E264. These compounds induced a 10–145-fold overproduction of malleilactone A (C1) and facilitated the discovery of a new analog, malleilactone B (C2). In addition to the mal gene cluster, trimethoprim, as a global activator, also activated at least four biosynthetic pathways. For example, it activated the tha gene cluster of B. thailandensis E264, leading to the production of thailandamide A (C3). Additionally, it induced the hmq gene cluster, leading to a sevenfold increase in the yield of the quinolone antibiotic and quorum-sensing (QS) modulator 4-hydroxy-3-methyl-2-(2-nonenyl)-quinoline (HMNQ, C4). Trimethoprim, as well as the Streptomycete-derived antitumor antibiotic drug mitomycin (E3) and the antimicrobial molecules flumequine (E4), were found to activate the bhc gene cluster in B. thailandensis E264, leading to the production of burkholdac A (C5), a histone deacetylase inhibitor (Fig. 2) (Seyedsayamdost, 2014).[图片]Fig. 2. A representative knowledge graph of stress-associated antibiotic elicitors with their induced BGCs and products. Piperacillin (E1), trimethoprim (E2), mitomycin (E3) and flumequine (E4) partially and collectively activate five BGCs in B. thailandensis E264 and lead to the production or overproduction of 14 secondary metabolites (C1–C14).A follow-up study found that trimethoprim triggered the production of over 100 compounds in B. thailandensis E264 that are not observed under standard growth conditions. Among these, 40 compounds were successfully identified using MS/MS and NMR, including a group of new molecules named acybolins (C6-C14), which originate from the bta gene cluster (Fig. 2) (Okada et al., 2016). Acybolins are bivalent metabolites composed of a bactobolin antibiotic scaffold covalently linked to an acyl chain derived from QS acyl-homoserine lactones (AHLs), a connection that requires a ligase not encoded within the bta cluster. Bactobolin itself is produced by the bta cluster and has been shown to be strictly regulated by AHL-mediated QS (Seyedsayamdost et al., 2010). By contrast, the combined presence of AHLs and trimethoprim elicits the formation of acybolins. This represents a distinct case in which antibiotic-induced stress reshapes the metabolic output of a known BGC, enabling the emergence of cryptic metabolites.Beyond B. thailandensis E264, subinhibitory exposure to antibiotics can also activate silent or lowly expressed BGCs in diverse microbial strains. Notably, several effective elicitors identified to date are antibiotics derived from Streptomyces. For instance, subinhibitory streptomycin (E5), an aminoglycoside antibiotic, induced the tam gene cluster in Microbispora sp. BCCAGE54, promoting a 33-fold enhancement in tetarimycin B (C15) production alongside several putative new tetarimycin analogs (Covington et al., 2018). Although tetarimycin B was detectable at low levels in unstimulated cultures, the corresponding gene cluster is therefore considered to be lowly expressed here. Similarly, actinomycin D (E6), a chemotherapeutic(化疗的) agent, was found through HiTES combined with cytotoxicity assays to activate the tun gene cluster in Streptomyces clavuligerus ATCC 27064, thereby enhancing the production of streptovirudin A2 (C16) and tunicamycins IIA (C17), IIB (C18), G (C19), and V (C20). Actinomycin D also activated the cla gene cluster in the same strain, leading to the production of clavorubin A (C21) (Han et al., 2022). In addition, subinhibitory lincomycin (E7) potentiated the act gene cluster in Streptomyces coelicolor A3(2) and Streptomyces lividans 1326, resulting in pronounced overproduction of the blue-pigmented antibiotic actinorhodin (C22), which is produced at low levels under standard conditions. Additionally, S. lividans 1326 produced several potential novel congeners of calcium-dependent antibiotics (CDAs) only under lincomycin exposure (Imai et al., 2015).2.1.2. Non-antibiotic clinical drugsIn addition to antibiotics, a broad range of non-antibiotic small molecules can also impose physiological stress on microbial cells. Notably, bioactive compounds that are already used clinically are particularly enriched in effective stress-associated elicitors, likely owing to their well-defined molecular targets and pronounced cellular effects. For example, pyronaridine (E8), an antimalarial drug, primarily targets nucleic acids by intercalating into DNA, thereby inducing DNA damage and triggering the SOS and/or oxidative stress. Such stress responses were shown to activate the tun gene cluster in S. clavuligerus ATCC 27064, as described previously. Similarly, etoposide (E9), a chemotherapeutic agent with broad-spectrum antiparasitic activity, has been shown to inhibit bacterial, and particularly actinobacterial, DNA gyrase, thereby inducing the SOS response. Through this stress pathway, etoposide acts as a potent elicitor in Streptomyces albus J1074, strongly activating the sur gene cluster and resulting in the discovery of seven surugamides and seven albuquinones (C23–C36) (Fig. 3 A) (Xu et al., 2017)[图片]Fig. 3. A representative knowledge graph of stress-associated non-antibiotic clinical drugs (A) and host-related molecules (B) with their induced BGCs and products. Etoposide (E9) and p-coumaric acid (E11) activate the sur and rbt gene clusters, leading to the production of surugamides (C23–C36) and roseobactin (C38), respectively.Spermidine (E10) was found to activate the gld gene cluster in Burkholderia gladioli ATCC 10248, resulting in enhanced production of the siderophore gladiobactin (C37). Spermidine is a key component of the intracellular polyamine pool, and its concentration balance with other polyamines, such as putrescine, is vital for microbial physiological state and metabolic regulation. At elevated concentrations, spermidine inhibited the growth of B. gladioli, indicative of a stress response that may underpin(支撑) enhanced gladiobactin biosynthesis. Notably, siderophore(铁载体) production to sequester(螯合) toxic metals is often a hallmark(标志) of oxidative stress response, and high spermidine level has been reported to trigger oxidative stress in E. coli (Yoshimura et al., 2020).2.1.3. Host-related moleculesFrom an ecological perspective, microorganisms are constantly exposed to a wide range of host-derived metabolites that shape their physiological states and metabolic behaviors. These compounds, produced by plants or animals in shared ecological niches, can act as chemical cues or stress signals that influence microbial secondary metabolism. Among such host-related molecules, p-coumaric acid (E11), a ubiquitous(普遍存在的) plant-derived phenolic compound, has emerged as a representative elicitor capable of activating cryptic metabolites. Algal(藻类的) p-coumaric acid was shown to activate the rbt gene cluster in its symbiotic bacterium Phaeobacter inhibens 2.10, leading to the discovery of an unreported siderophore, roseobactin (C38) (Fig. 3B). p-coumaric acid is a senescence(衰老)-associated molecule produced by the microalgal Emiliania huxleyi, and its accumulation marks a shift in microalgal-bacterial symbioses(共生细菌) from mutualistic(互利共生的) to parasitic phases. Meanwhile, p-coumaric acid is toxic to the bacteria at high concentrations but induces oxidative stress response by formation of reactive oxygen species (ROS) and secondary metabolism at sub-lethal doses(亚致死剂量). These findings indicate that algal-derived phenylpropanoids, including p-coumaric acid as well as other compounds such as sinapic acid and ferulic acid, can induce cryptic secondary metabolism in symbiotic bacteria, underscoring(凸显) the potential of ecological interactions for elicitor discovery (Wang et al., 2022a).2.1.4. Chemically modified moleculesChemically modified elicitors represent a class of molecules that are generated through chemical modification of known bioactive compounds, often in the form of structural analogues. Rather than originating from natural or clinical contexts, these molecules are typically derived from pre-existing scaffolds and tailored to modulate microbial physiology. Importantly, this class of elicitors can be explored through rational design strategies, such as systematic structural optimization, to identify analogues with enhanced elicitor potency or selectivity.A representative example is the synthetic elicitor ARC2 (E12), an analogue of triclosan with a related mode of action. ARC2 activates the production of multiple specialized metabolites in Streptomyces. For instance, it activates the gcs gene cluster in Streptomyces coelicolor M145, resulting in a 3-fold increase in germicidins A–C (C39–C41) while decreasing the yields of prodiginines and the daptomycin-like CDA (Fig. 4). ARC2 also activates the des and dps gene clusters in Streptomyces pristinaespiralis ATCC 25486 and Streptomyces peucetius 27,952, enhancing the production of desferrioxamine B (C42) and E (C43) as well as doxorubicin (C44) and baumycin (C45) (Craney et al., 2012). Although ARC2 exhibits dose-dependent antibacterial toxicity at high concentrations, it primarily inhibits the enoyl reductase FabI, thereby blocking fatty acid biosynthesis. Inhibition of FabI induces a cellular stress response that depends on the global regulators AfsR and AfsS, linking ARC2-mediated stress to the activation of secondary metabolism (Calvelo et al., 2021a).[图片]Fig. 4. A representative knowledge graph of stress-associated chemically modified molecules with their induced BGCs and products. ARC2 (E12) activates three BGCs and lead to the production of seven secondary metabolites (C39–C45).Building on this scaffold, Cl-ARC (E13), a chemically modified derivative of ARC2, was shown to selectively enhance yields of secondary metabolites but has relatively little effect on the rest of the metabolomic profile. Because its molecular target, FabI, is highly conserved across Actinobacteria, Cl-ARC functions as a broadly effective elicitor in most Streptomyces species. Specifically, Cl-ARC activates the oxo, smd, and non gene clusters, increasing the yields of oxohygrolidin (C46) from S. ghanaensis ATCC 14672, 9-methylstreptimidone (C47) from S. hygroscopicus ATCC 53653, and nonactin (C48), monactin (C49), dinactin (C50), and trinactin (C51) from strain WAC0256 (Pimentel-Elardo et al., 2015).2.2. Signaling-associated elicitorsSignaling-associated elicitors represent a distinct class of elicitors that induce silent or weakly expressed BGCs by mimicking(模拟) or transmitting ecologically relevant signals rather than imposing physiological stress. These elicitors are often derived from host organisms or neighboring species and are sensed through dedicated regulatory systems, enabling microorganisms to coordinate secondary metabolism with specific ecological contexts.Zhang et al. utilized HiTES combined with matrix-assisted laser desorption/ionization mass spectrometry (MALDI-HiTES) and revealed that plant-derived amygdalin (E14), sarsasapogenin (E15), and homobutein (E16) in flask cultures of Streptomyces ghanaensis ATCC 14672 activated the cip gene cluster, resulting in a 4–10-fold increase in production of a novel non-ribosomal peptide, cinnapeptin (C52) (Zhang and Seyedsayamdost, 2020). In a related study, Li et al. demonstrated that two adjacent LuxR-type regulators in the gla cluster from Streptomyces globisporus C-1027, which is homologous to the cip cluster and also directs the biosynthesis of cinnamoyl moiety-containing compounds, function as cluster-situated positive regulators (Li et al., 2022). Notably, homologous LuxR-type regulators are also present in cip cluster. Based on these observations, we speculated that, as a plant-associated bacterium, S. ghanaensis may sense these plant-derived metabolites via these two LuxR-type regulators, thereby relieving repression of the silent BGC and initiating cinnapeptin biosynthesis as an ecologically grounded signaling response. However, this hypothesis requires further experimental validation.Vitamin B3 niacin (E17) was identified as a highly specific elicitor of the tqq gene cluster in Streptococcus suis ATCC 43765, whose activation led to the discovery of threoglucin A (C53), characterized by an aliphatic ether bond forming a substituted 1,3-oxazinane heterocycle within the peptide backbone (Covington and Seyedsayamdost, 2022). Previous evidence has indicated that Streptococcus species are capable of responding to extracellular niacin. In particular, the presence of NiaR has been identified in S. suis. NiaR is a niacin-responsive transcriptional repressor that regulates genes involved in nicotinamide adenine dinucleotide (NAD) metabolism (Rodionov et al., 2008). NiaR-mediated regulation links extracellular niacin availability to the coordinated expression of multiple genes involved in cellular metabolism (Afzal et al., 2017). From an ecological perspective, S. suis is a taxing agricultural pathogen and zoonotic agent, whereas niacin is an endogenous human metabolite. Therefore, niacin is likely an environmental signal that S. suis readily encounters and may use to modulate metabolic processes. Additionally, in the genomic context, the tqq gene cluster is located adjacent to an shp/rgg QS operon, suggesting that QS–related regulatory mechanisms may also be involved (Covington and Seyedsayamdost, 2022).Using Agar-HiTES, Lee et al. identified several steroids as the most effective elicitors in the plant fungal pathogen Sclerotinia sclerotiorum Ss-1. Among these, the glucocorticoid prednisone (E18) specifically activated the scl gene cluster, leading to the production of a previously uncharacterized metabolite, sclerocyclane (C54). Further studies revealed that the plant-derived steroids β-sitosterol and brassinolide, which are structurally similar to prednisone, also function as effective elicitors. Concurrently, the induced metabolite sclerocyclane exhibits antibiotic activity against Burkholderia plantarii, an ecological competitor of S. sclerotiorum. These findings suggest that secondary metabolites may be induced in regions of active plant growth where steroids, acting as host- or environment-associated signaling molecules, are released, thereby shaping ecological interactions (Lee and Seyedsayamdost, 2022).Similarly, Han et al. applied FRET-HiTES to Streptomyces clavuligerus ATCC 27064 and discovered that several steroids, including testosterone (E19), estrone (E20), cortisone (E21), danazol (E22), and 11α-hydroxyprogesterone (E23), activated the clv gene cluster, leading to the production of cryptic enediyne natural products, termed clavulynes A − J (C55–C64) (Fig. 5). This study suggests that steroids can play important roles in bacteria, functioning as signaling agents or carbon sources. Elicitation in S. clavuligerus may involve regulation of signal transduction pathways and/or an additional nutrient supply (Han et al., 2023).[图片]Fig. 5. A representative knowledge graph of signaling-associated elicitors with their induced BGCs and products. Five steroids (E19–E23) activate the clv gene cluster and lead to the production of clavulynes (C55–C64).2.3. OthersBeyond the elicitors discussed above, many additional bioactive compounds have been reported to activate cryptic BGCs, although their mechanisms of action remain largely unexplored. A growing number of studies have demonstrated that diverse bioactive small molecules, many of which are clinically used drugs, can function as effective elicitors for cryptic BGCs when interrogated using HiTES-based strategies.Using HiTES, Seyedsayamdost first showed that in Burkholderia thailandensis E264, the bhc gene cluster could be activated by the DNA topoisomerase I inhibitor β-lapachone (E24), and the nuclear factor-kappa B (NF-κB) inhibitor BAY 11–7082 (E25), leading to efficient synthesis of burkholdac A (C5) (Seyedsayamdost, 2014).Subsequently, Xu et al. employed a genetics-dependent reporter-based HiTES to demonstrate that the broad-spectrum antiparasitic ivermectin (E26) strongly activated the sur gene cluster in S. albus J1074, leading to the production of surugamides and albuquinones (C23–C36) (Xu et al., 2017). They later developed a genetics-free strategy, HiTES in conjunction with imaging mass spectrometry (IMS-HiTES), to identify additional elicitors. Using this approach, kenpaullone (E27), a GSK3β inhibitor used for the treatment of chronic immune (idiopathic) thrombocytopenia, was identified as the most effective elicitor of the can gene cluster in Streptomyces canus NRRL B3980, inducing two novel cryptic lassopeptides, canucins A (C65) and B (C66). In contrast, dihydroergocristine (E28), a γ-secretase inhibitor clinically used to delay progressive mental decline in conditions such as Alzheimer's disease, activated the ker gene cluster in Amycolatopsis keratiniphila NRRL B24117, resulting in the production of seven glycopeptides, including four keratinimicins (C67–C70) and three keratinicyclins (C71–C73) (Xu et al., 2019).Using bioactivity-coupled HiTES, Moon et al. identified the clinical diuretic furosemide (E29) and the cholesterol-lowering agent fenofibrate (E30) as the most effective elicitors for Saccharopolyspora cebuensis SPE 10–1. Both compounds activated the ceb gene cluster, leading to the production of a novel lanthipeptide, cebulantin (C74), which exhibited selective antibacterial activity against Gram-negative bacteria (Moon et al., 2019a). Using the same approach, Moon et al. identified the β-blocker atenolol (E31) as an effective global elicitor for Streptomyces hiroshimensis A18. Atenolol activated the hrs and tfl/tox gene clusters, thereby inducing the production of a novel naphthoquinone epoxide hiroshidine (C75), a new toxoflavin-type analog taylorflavin A, (C76), and several known metabolites, including taylorflavin B (C77). Notably, both taylorflavins exhibited selective growth-inhibitory activity against Gram-negative bacteria, particularly Escherichia coli and Acinetobacter baumannii (Moon et al., 2019b).To streamline(简化) the analysis of HiTES-derived metabolomics data, Covington et al. developed MetEx, a metabolomics explorer for LC–MS-based HiTES workflows. Application of MetEx to Burkholderia gladioli ATCC 10248 enabled identification of multiple bioactive elicitors. Specifically, the β-lactam antibiotic oxacillin (E32), the calcium-channel blocker verapamil (E33), the antihypertension(抗高血压) agent methyldopa (E34), the imidazoline decongestant naphazoline (E35), and the anti-inflammatory drug nabumetone (E36) were found to activate the gld gene cluster, leading to the production of gladiobactin (C37). The oxacillin (E32) and the chemotherapeutic agent azacitidine (E37) activated the gbn gene cluster, inducing gladiolin (C78) biosynthesis. Meanwhile, anticancer agents bleomycin (E38) and carboplatin (E39) activated the ico gene cluster, strongly enhancing icosalide (C79) production. Finally, the synthetic corticosteroid desoximetasone (E40) and the HIV protease inhibitor ritonavir (E41) activated the bgd gene cluster, serving as potent elicitors for burriogladin A (C80) (Covington and Seyedsayamdost, 2021).Lee et al. extended the HiTES to solid media by developing Agar-HiTES, enabling systematic elicitor screening in fungi and bacteria. Using this approach, they identified the antidepressant isocarboxazid (E42), and the antibiotics paromomycin (E43) and sulfanilamide (E44) as the most effective elicitors of the plant fungal pathogen(病原体) Sclerotinia sclerotiorum Ss-1, where these compounds activated the scl gene cluster and induced the production of a novel metabolite sclerocyclane (C54). In Rhizoctonia solani AG-3, the calcium-channel blockers nisoldipine (E45) and nifedipine (E46), together with the antifungal clotrimazole (E47), potently activated the sol gene cluster, with nisoldipine yielding 12-fold overproduction of two new siderophores, solanibactins A and B (C81–C82) (Lee and Seyedsayamdost, 2022). Applying Agar-HiTES to Burkholderia species, Lee et al. further showed that in B. plantarii ATCC 43733, plant-derived tropane alkaloids, including ipratropium bromide (E48) and atropine (E50), as well as the antimigraine drug zolmitriptan (E49), robustly induced the bet cluster, leading to 12–15-fold enhanced production of burkethyl A (C83) and B (C84) at 90 μM ipratropium bromide. In B. gladioli ATCC 10248, chemically diverse elicitors, including the synthetic steroid fluocinolone (E51), the antibiotics imipenem (E52) and rifapentine (E53), and the HIV protease inhibitor saquinavir (E54), markedly altered secondary metabolism and triggered the production of modified gladiobactins [hydroxylated B/C (C85-C86) and glycosylated D (C87)] (Lee et al., 2025).3. Mechanisms of elicitor-mediated BGC activationA comprehensive understanding of the regulatory mechanisms underlying BGCs is essential for the targeted activation of silent clusters using chemical elicitors. Because BGC expression is typically orchestrated(协同调控) by complex, multilayered regulatory networks, network-based models provide a systematic framework for deciphering(解析) these processes. Such models can illuminate the interactions among diverse regulators and thereby offer theoretical guidance for designing effective elicitor-driven strategies to activate silent BGCs. In this section, we outline the principles of regulatory networks and discuss their utility in elucidating elicitor-mediated BGC activation.3.1. Regulatory network models of BGC expressionBGC expression is governed by intricate regulatory networks, which encompass interactions among DNA, RNA, proteins, and small molecules. The collective set of these interactions is often referred to as the interactome (Fig. 6A) (Luck et al., 2016; Sanchez et al., 1999). Among them, the typical gene regulatory network (GRN) models, transcription factor–target gene (TF–TG) interactions, which are often visualized as directed graphs where nodes represent genes or proteins and edges represent regulatory relationships (Fig. 6B) (Babu et al., 2004; Seshasayee et al., 2006). However, TF-based models alone cannot fully capture the complexity of BGC regulation, since gene expression is also influenced by epigenetic modifications (EMs)(表观遗传修饰), post-transcriptional regulation (PTR) [such as RNA-RNA interactions (RRIs)] (Silverman and Melamed, 2025), post-translational modifications (PTMs) [such as protein-protein interactions (PPIs)] (Wang et al., 2022b), and other regulatory pathways. Extended GRNs therefore integrate additional regulatory components, including independently acting molecules such as cofactors, sigma factors, small RNAs (sRNAs) (Sesto et al., 2013; Waters and Storz, 2009), as well as the cis-regulatory elements (CREs) and riboswitches embedded within TGs or mRNA. In addition, extended GRNs encompass regulatory components associated with modification-based mechanisms, which commonly involve EMs, PTR, and PTMs (Fig. 6B). Together, these regulatory components provide a more complete view of BGC control (Augustijn et al., 2024; Badia-i-Mompel et al., 2023).[图片]Fig. 6. Conceptual frameworks for regulatory and induction mechanisms governing BGC expression. (A) Key cellular components and their interactions. (B) A typical TF-centered GRN and an extended GRN incorporating multi-level regulatory mechanisms. (C) Major classes of elicitor-induced regulatory mechanisms underlying BGC activation.GRNs typically exhibit a modular structure, consisting of recurring regulatory motifs (e.g., feed-forward loops [FFLs], single-input modules [SIMs], multi-input modules [MIMs]) as well as higher-order functional modules (e.g., stress response, metabolism). Global regulators (GRs) govern broad cellular functions, whereas cluster-situated regulators (CSRs) fine-tune(精细调控) the expression of individual BGCs. Additional layers of regulation, including various post-transcriptional and post-translational mechanisms, further contribute to the dynamic and multilayered nature of GRNs (Fig. 6B). Recent advances have facilitated the integration of GRN concepts into BGC studies. For instance, the antiSMASH7 now includes a transcription factor binding site (TFBS) prediction module, enabling the identification of key regulatory elements (Blin et al., 2023). Likewise, a recent review by Wezel and Medema emphasized the potential of GRN-based approaches to accelerate natural product discovery in Actinobacteria (Augustijn et al., 2024). Collectively, applying GRN models is expected to provide a rational framework for systematically analyzing BGC regulation and for guiding the selection of effective elicitors.3.2. Deciphering elicitor-mediated BGC activation through regulatory networksAs outlined in Section 3.1, activation of BGC expression can be systematically interpreted(阐释) within the framework of regulatory networks. Based on existing studies, we summarize four major classes of elicitor-associated regulatory mechanisms, including oxidative stress–driven regulation, SOS response–mediated regulation, ribosomal perturbation–driven regulation, and metabolic stress-driven regulation (Fig. 6C). Although the molecular targets of these elicitors vary widely, their effects are ultimately perceived and processed through specific regulatory pathways, which converge on gene regulatory networks to activate the corresponding BGCs. It should be noted that the depth of mechanistic understanding differs substantially among these categories, and additional regulatory mechanisms may exist beyond those currently recognized. Further investigation will be required to achieve a more comprehensive understanding of elicitor-mediated BGC activation.3.2.1. Oxidative stress-driven regulationIn the induction cases discussed in the previous section, a significant activation is associated with bacterial stress states, such as oxidative stress. Here, we first clarify the mechanisms related to stress, and then illustrate the mechanism by which BGC is induced using specific examples. Stress states induced by exogenous small molecules are mainly manifested as oxidative stress or DNA damage–triggered SOS responses, and these two processes are partially interconnected. Under normal conditions, bacterial cells maintain redox(氧化还原) homeostasis(稳态), in which metabolically generated ROS are balanced by antioxidant systems to sustain a stable intracellular redox state. However, interference by exogenous molecules can disrupt this balance, driving cells into a stress state that may result in outcomes such as apoptosis-like cell death(类细胞凋亡), autophagy(细胞自噬), lipid peroxidation(脂质过氧化), and DNA damage (Hong et al., 2024). Meanwhile, multiple peroxide-sensor TFs, including OxyR, PerR, and SoxRS, are directly activated by ROS, thereby initiating complex downstream transcriptional programs that rewire cellular metabolic networks (Imlay, 2015). Accumulating evidence indicates that these stress conditions activate stress-responsive transcription factors and lead to extensive changes in gene expression, highlighting the systems-level impact of stress-responsive regulatory networks (Guo et al., 2024; Seo et al., 2015).It has been proposed that many bactericidal antibiotics promote bacterial cell death not only through their specific drug–target interactions but also by inducing intracellular ROS, a hypothesis originally put forward by Kohanski et al. and widely discussed thereafter (Kohanski et al., 2007). Although this antibiotic–ROS–cell death model has remained controversial, accumulating evidence suggests that ROS can contribute to antibiotic-mediated cytotoxicity in a strain- and context-dependent manner. Based on this logic, low or sublethal doses of antibiotics may induce detectable ROS levels and engage stress-responsive pathways, such as redox-sensing TFs or the SOS response, thereby reprogramming global transcriptional and metabolic networks and potentially activating silent or weakly expressed BGCs. This notion is consistent with the pattern reported by the Seyedsayamdost group (Davies et al., 2006; Okada and Seyedsayamdost, 2017), and aligns with the concept of hormesis (Calabrese and Baldwin, 2003). For instance, Seyedsayamdost et al. demonstrated that the antifolate antibiotic trimethoprim and the β-lactam piperacillin upregulated five target genes in the malleicyprol (mal) gene cluster of Burkholderia thailandensis E264 (Seyedsayamdost, 2014). Li et al. further showed that piperacillin activates the mal gene cluster by inducing oxidative stress involving redox-sensing GRs. At low doses, piperacillin generates ROS, which trigger redox-sensitive TFs OxyR and SoxR. These TFs subsequently activate MalR, the CSR of the mal cluster (Fig. 7A) (Li et al., 2021). Meanwhile, as a LuxR-type transcriptional regulator, MalR activates the mal cluster in an AHL-independent manner by binding to a lux box-like element in the mal promoter, differently from the canonical AHL-based QS mechanism (Truong et al., 2015). Similarly, trimethoprim, a global regulatory elicitor, activates multiple metabolic pathways in B. thailandensis E264 through global regulatory mechanisms. It upregulates malR and thereby enhances mal expression (Okada et al., 2016). In parallel, it significantly boosts thailandamide production (8- to 36-fold), accompanied by strong induction of biosynthetic genes thaH (polyketide synthase, +26-fold) and thaR (oxidoreductase, +17-fold) (Seyedsayamdost, 2014). While the precise regulatory intermediates remain unknown, these findings illustrate how elicitors exploit global stress-response modules within GRNs to activate BGCs.[图片]Fig. 7. Representative regulatory mechanisms of chemical elicitors in activating silent BGCs, including (A) oxidative stress–driven regulation, (B) SOS response–driven regulation, (C) ribosomal perturbation–driven regulation, and (D) metabolic stress–driven regulation.Natural metabolites can function as elicitors by activating stress-related GRN modules. For example, the algae-derived compound p-coumaric acid induces the production of the siderophore roseobactin in Phaeobacter inhibens, increasing the expression of roseobactin biosynthetic genes by 3.3- to 6.7-fold. Concurrently, it upregulates multiple antioxidant and stress-response enzymes by 4.3- to 13.7-fold and enhances the expression of genes associated with cellular repair systems by 3.3- to 14.9-fold. These effects support a model in which p-coumaric acid triggers ROS formation, activating oxidative stress-response modules within the GRN. The induced stress response facilitates repair of damaged biopolymers and promotes secondary metabolite biosynthesis, while cellular energy may be conserved by downregulating non-essential processes, such as motility and flagellar synthesis, thereby redirecting resources toward roseobactin production (Fig. 7A) (Wang et al., 2022a).3.2.2. SOS response-driven regulationIn addition to ROS, some exogenous small molecules act as direct DNA-damaging agents, a phenomenon commonly encountered in natural environments, and can induce DNA lesions, thereby triggering the SOS response (Maslowska et al., 2019). The SOS response is regarded as one of the most fundamental and evolutionarily conserved global transcriptional regulatory systems in prokaryotes, specifically dedicated to coping with severe DNA damage. At the core of this system lies the coordinated action of two key regulators, LexA and RecA, which function as a transcriptional repressor and a damage sensor, respectively (Lima-Noronha et al., 2022). Under normal growth conditions, LexA binds as a dimer to specific DNA sequences known as SOS boxes located in the promoter regions of genes within the SOS regulon. This binding effectively prevents the recruitment of RNA polymerase, thereby maintaining SOS genes at low basal expression levels. However, when cells are exposed to stressors that lead to the accumulation of single-stranded DNA (ssDNA), RecA is rapidly recruited to ssDNA and forms an activated ATP-dependent nucleoprotein filament, referred to as RecA*. Beyond its role in recombinational repair, RecA* acts as a co-protease for LexA, inducing LexA autoproteolysis. The cleavage of LexA results in loss of its promoter-binding affinity, thereby derepressing the transcription of numerous genes involved in DNA damage tolerance and repair, as well as global transcription reprogramming (Podlesek and Žgur Bertok, 2020).In such contexts, elicitors that induce DNA damage and activate the SOS response have the potential to trigger the expression of silent BGCs. A representative example is the activation of the sur gene cluster by the clinical DNA-damaging agent etoposide. Etoposide has been shown to inhibit actinobacterial DNA gyrase. At low concentrations, etoposide strongly induces the expression of recA and lexA in the SOS response pathway (Fig. 7B). Subsequent RT-PCR and gene knockout analyses revealed that etoposide downregulates SurR, a pathway-specific transcriptional repressor, thereby derepressing the cluster and enhancing the production of cryptic metabolites. Collectively, this case illustrates that elicitor-mediated activation of the SOS response can establish a global regulatory context in which relief of pathway-specific transcriptional repression enables BGC activation, linking DNA damage responses with secondary metabolism (Xu et al., 2017). In addition, mitomycin and bleomycin, as mentioned in the preceding examples, have also been reported to induce the SOS response in E. coli (Wei et al., 2001; Xu et al., 2012). However, their mechanisms of action as elicitors have not yet been experimentally examined.3.2.3. Ribosomal perturbations–driven regulationSome elicitors, particularly translation-inhibiting antibiotics, can rewire cellular metabolic networks by perturbing ribosomal homeostasis. Ochi and co-workers discovered that mutations in ribosomal proteins can profoundly reshape cellular metabolic phenotypes in Streptomyces lividans (Shima et al., 1996), an observation that later gave rise to the concept of ribosome engineering. In this strategy, mutations in ribosomal components or RNA polymerase (RNAP) are introduced to reprogram transcriptional and translational states (Zhu et al., 2019). These findings established the ribosome and RNAP as regulatory targets for inducing cryptic metabolites. Perturbation of ribosome or RNAP often disrupts translational dynamics, leading to ribosomal pausing or stalling. Such disturbances are rapidly sensed by stress-responsive regulatory networks, such as WhiB7/WlbC TFs, which coordinate ribosome rescue, translational reprogramming, and broader metabolic adaptation, ultimately leading to the activation of silent BGCs (Hurst-Hess et al., 2023; Lee et al., 2020).A representative example of this ribosome-centered elicitation mechanism is provided by lincomycin. The Hosaka group identified lincomycin at subinhibitory concentrations as an effective elicitor for actinorhodin biosynthesis in S. coelicolor A3(2) and further elucidated its regulatory mechanisms (Imai et al., 2015). Lincomycin, a ribosome-targeting antibiotic, temporarily inhibits ribosome function by binding to the peptidyl transferase loop of 23S rRNA in 70S ribosomes. This perturbation elicits a rapid response mediated by the WblC regulatory network (Mukai et al., 2023). As a master regulator, the TF WblC coordinates lincomycin resistance and ribosome reconstruction by upregulating genes that encode ABCF proteins (facilitating lincomycin efflux), HflX proteins (ribosome-splitting factors that resolve stalled ribosomes), and 23S rRNA methyltransferase (structurally protecting ribosomes against inhibition) (Ishizuka et al., 2018). In the context of WblC regulation, reduced 20S proteasome activity stabilizes ribosome-associated proteins and enhances translational efficiency. This multilayered regulation promotes the downstream expression of the CSR actII-ORF4, ultimately increasing actinorhodin production (Fig. 7C). Moreover, streptomycin, described in the above cases as an elicitor, can be classified as a ribosome-targeting antibiotic, and its elicitor mechanism remains unreported (Lin et al., 2018).3.2.4. Metabolic stress-driven regulationPrimary metabolism comprises the core biochemical pathways that sustain cellular growth, energy production, and structural integrity. Because these pathways operate at high flux and are deeply interconnected, their activities are highly sensitive to perturbation. Certain small molecules can directly target and inhibit key enzymes within primary metabolic pathways, disrupting metabolic flux balance and homeostasis, giving rise to the metabolic stress. Cells respond to this stress by engaging global TFs, leading to the reprogram of gene expression and metabolic flux.The ARC2 compound series provides an example of a metabolic stress-driven regulation, in which elicitors redirect metabolic flux by inhibiting core primary metabolic pathways. Due to its structural similarity to triclosan, a known FabI inhibitor, validation experiments and gene complementation studies confirmed that ARC2 functions as an effective inhibitor of the FabI enoyl reductase in fatty acid biosynthesis (Craney et al., 2012). Subsequent studies demonstrated that the ARC2-induced metabolic response depends on the global TFs AfsR and AfsS (Calvelo et al., 2021b). Notably, there is currently no evidence that AfsR or AfsS act as direct sensors of metabolic stress; instead, they are generally regarded as global regulators of antibiotic production in Streptomyces (Yan and Xia, 2024). As a regulatory outcome, inhibition of fatty acid biosynthesis diverts shared precursors such as acetyl-CoA and malonyl-CoA toward polyketide and siderophore production, including actinorhodin, germicidin, doxorubicin, baumycin, and the acetyl-CoA-dependent desferrioxamine, while reducing the yields of fatty acid-containing metabolites such as prodiginines and CDAs (Fig. 7D). Additionally, as mentioned above, elicitor trimethoprim inhibits folate metabolism, thereby blocking one‑carbon metabolism and nucleotide biosynthesis, and can therefore be regarded as a form of metabolic stress (Quinlivan et al., 2000).4 生物合成基因簇的靶向激活:诱导剂筛选指南结合上文整合的知识图谱与作用机制研究,本文提出一套筛选策略,用于靶向激活沉默或低表达的生物合成基因簇(BGCs)。该系列策略并非绝对化的既定规则,而是依托生物信息学与化学信息学理论、以知识为导向的经验性方法。由于不同微生物的调控通路架构差异极大,并非任意菌株都能实现生物合成基因簇的精准靶向激活。因此,下文将依据策略的可靠性由高至低依次阐述:从基于保守共有调控机制的经典手段,到依托同源序列推演的探索性方案。本研究构建的策略框架旨在为诱导剂筛选提供实践参考,助力发掘更多可通过诱导剂激活的生物合成基因簇调控案例。4.1 基于调控机制的诱导剂筛选依托作用机制或表型效应,可高效发掘功能性诱导剂。例如,李等人研究发现,泰国伯克霍尔德菌 E264 菌株中,活性氧(ROS)水平与mal 生物合成基因簇的表达量存在显著相关性。基于该结论,团队采用多种经典活性氧诱导剂(白花丹醌、百草枯、甲萘醌),成功激活该基因簇(Li 等,2021)。结合前文梳理的分子机制,诱导剂与微生物的互作规律可为诱导剂筛选提供重要理论依据:不同类型诱导剂通过特异性通路重塑微生物调控网络,引发代谢重编程,进而调控次级代谢。具体而言,能够诱导胞内活性氧显著累积的小分子化合物,是筛选诱导剂的合理候选。以 β- 内酰胺类抗生素为例,氨苄西林、美罗培南、头孢曲松等均可优先作为备选诱导剂。传统研究表明,该类药物通过结合转肽酶、阻碍细菌细胞壁肽聚糖交联发挥抑菌作用;后续研究进一步证实,细胞壁结构紊乱会加剧细胞呼吸作用,最终导致胞内活性氧水平异常升高(Dwyer 等,2014;Van Acker 等,2016)。除小分子物质外,非小分子处理方式同样可诱导活性氧生成(Memar 等,2018),本文不作赘述。同理,喹诺酮类等DNA 损伤型抗生素(萘啶酸、环丙沙星、诺氟沙星、莫西沙星),通过抑制 DNA 旋转酶与拓扑异构酶 Ⅳ 阻断 DNA 复制,造成基因组损伤,进而激活细菌 SOS 应激响应通路(Bush 等,2020;Dörr 等,2010;Newmark 等,2005)。因此,喹诺酮类药物可作为靶向 SOS 通路的候选诱导剂。以核糖体干扰为作用靶点、靶向翻译过程的抗生素,是一类极具应用价值的诱导剂。该类物质直接破坏核糖体功能,已被广泛证实可有效激活放线菌(链霉菌、分枝杆菌等)沉默基因簇,代表药物包括氨基糖苷类(链霉素、卡那霉素、庆大霉素)、四环素、红霉素、氯霉素等(Krause 等,2016;Lin 等,2018;Morris 等,2005)。除翻译抑制类抗生素外,其他可触发核糖体修复通路的小分子也具备诱导潜力:如烷基化试剂可损伤信使 RNA(Thomas 等,2020);嘌呤霉素通过阻断多肽链延伸,诱发转录提前终止(Aviner,2020),二者均有望成为新型诱导剂(Keiler,2015)。从代谢应激角度出发,扰乱核心初级代谢通路的小分子化合物,同样可纳入诱导剂筛选范围。亚油酸会破坏微生物中心代谢与氧化还原稳态,引发强烈代谢胁迫(Senizza 等,2020);浅蓝菌素与硫乳霉素可抑制脂肪酸合成,扰乱脂质代谢流向,限制微生物生长(Price 等,2001);碘乙酸与碘乙酰胺通过抑制 3 - 磷酸甘油醛脱氢酶,阻断糖酵解通路、损伤能量代谢,诱发糖代谢应激(Schmidt,2009);膦胺霉素靶向 MEP(甲基赤藓醇磷酸)途径,抑制 1 - 脱氧 - D - 木酮糖 - 5 - 磷酸还原异构酶,阻碍类异戊二烯前体合成并诱导代谢压力(Zhang 等,2011)。以上案例表明,依托分子机制系统性挖掘现有文献,能够高效筛选出尚未被开发的新型诱导剂。4.2 基于结构与活性关联的诱导剂筛选化合物结构相似度越高,作用机制通常越保守,这一规律已得到广泛验证。例如,结构高度相似的第三代、第四代头孢菌素,均能有效激活 mal 生物合成基因簇(Li 等,2021)。受此启发,可基于已报道诱导剂的化学结构,通过塔尼莫托系数量化分子结构相似度,从化合物库中快速筛选潜在同源诱导剂,这类结构类似物往往具备激活同源生物合成基因簇的潜力。本研究以知识图谱中收录的 54 种已验证诱导剂为基础,设定塔尼莫托系数阈值为 0.6,在包含 11000 余种临床药物的 DrugBank 数据库中,筛选得到 38 种已知诱导剂的 118 个结构类似物(图 8A、附表 S2)。这些类似物涵盖多类分子家族:脂质及类脂质分子 37 种、苯丙烷与聚酮类化合物 14 种、苯环类物质 12 种、有机酸及其衍生物 10 种,其余类别化合物零散分布。深入解析诱导剂构效关系后,可进一步优化化合物聚类标准,大幅提升新型诱导剂的筛选精准度。例如,甲氧苄啶(E2)可激活 bhc、mal、bta、hmq、tha 等多个基因簇,其结构类似物(溴莫普林、四氧普林、2,4 - 二氨基 - 5-(3,4,5 - 三甲氧基苄基) 嘧啶鎓)大概率具备相同激活效果,但其对次级代谢产物合成的调控作用仍需实验验证。同理,哌拉西林(E1)的类似物(美洛西林、阿洛西林)可作为激活 mal 基因簇的备选诱导剂;丝裂霉素(E3)的 N - 甲基衍生物 —— 普卡霉素,是靶向 bhc 基因簇的高效潜在化学诱导剂(图 8B)。[图片]图 8 图例说明(A) 38 种已知诱导剂与从 DrugBank 数据库中筛选得到的 118 个结构类似物的化学相似性网络。该网络中,化合物之间的连线代表二者塔尼莫托系数>0.6。可被诱导剂激活的 27 个生物合成基因簇(以三角形表示)也通过虚线箭头纳入网络分析;矩形色块的颜色代表药物的结构分类。(B) 从数据库中筛选获得、对应诱导剂 E1–E3 的 6 种结构类似物化学结构式,核心骨架颜色与其所属结构分类保持一致。4.3 基于生物合成基因簇同源性的诱导剂筛选微生物基因组中仍存在大量未被解析的生物合成基因簇。本研究中已报道的 32 个可激活生物合成基因簇,序列差异度极高;在 BIG-SCAPE 软件默认阈值 0.3 的分析条件下,这 32 个基因簇均被判定为独立单系基因簇,无近缘聚类关系。与此同时,同源生物合成基因簇通常可被同一种化学诱导剂激活。利用已验证的激活型基因簇作为探针,可在微生物基因簇数据库中批量挖掘大量同源基因簇,且这类同源基因簇大概率也能被同种诱导剂靶向激活。本研究利用 cblaster 软件(v1.3.19),在包含 130 余万个微生物生物合成基因簇的自建数据库中,比对筛选得到1274 个与上述激活型基因簇高度同源的基因簇(图 9A、附表 S3)。在 32 个激活型基因簇中,23 个存在同源基因簇,剩余 9 个未检索到同源序列。这些同源基因簇来源于 8 个菌属,主要集中于伯克霍尔德菌属、链霉菌属与链球菌属。借助已知激活基因簇对应的诱导剂,可靶向挖掘同源基因簇编码的新型次级代谢产物类似物,大幅提升天然产物的挖掘效率。此外,本研究以sur基因簇为模型开展初步验证:证实伊维菌素可显著诱导白浅灰链霉菌 LHW61002中同源sur基因簇的高表达(图 9B)。采用代谢组学软件 MZmine3 对二级液质联用(LC-MS²)数据进行特征峰提取与预处理后,将数据上传至全球天然产物分子网络平台(GNPS),开展基于特征峰的分子网络分析(FBMN)。结果显示:同一分子家族下的 2 个特征峰被注释为已知化合物骏河酰胺 A、骏河酰胺 D,另有 1 个特征峰暂无匹配注释,推测为未被报道的新型结构类似物(图 9C)。结合已有文献与质谱镜像比对结果,该未知化合物初步推断为酰基化骏河酰胺 I。该案例充分证实:同源生物合成基因簇能够响应同种诱导剂并被特异性激活,为靶向发掘新型微生物天然产物提供了高效可行的研究策略。[图片]图 9 图例说明(A) 序列相似性网络由自建细菌基因簇数据库中、与已激活生物合成基因簇(BGCs)高度同源的基因簇构建而成。每个节点代表一个生物合成基因簇;节点填充颜色对应基因簇的来源菌属;节点形状表示 BIG‑SCAPE 软件划分的基因簇类型。(B) 从实验室保藏菌株白浅灰链霉菌 Streptomyces albidoflavus LHW61002中鉴定得到的sur同源基因簇对比分析。(C) 伊维菌素可诱导白浅灰链霉菌 LHW61002 合成一系列骏河酰胺类化合物,包含一种潜在的新型结构类似物。4.4 基于调控基因的诱导剂筛选来源迥异、无序列同源性的细菌生物合成基因簇,也可能共享高度相似的调控机制。例如,一类受 LuxR 型群体感应(QS) 调控的生物合成基因簇,可合成多种活性代谢产物,包括临床常用抗生素(碳青霉烯、莫匹罗星)及巴托波林、伊那西菌素等抗菌物质。LuxR 型群体感应的调控机制核心为 LuxR 蛋白激活:该蛋白作为信号分子受体(配体多为酰基高丝氨酸内酯 AHL 或其他未知信号分子),激活后进而调控下游关联生物合成基因簇的表达。已有研究证实,外源添加费氏弧菌 AHL 信号分子(3 - 氧代己酰 - L - 高丝氨酸内酯),可使群体感应调控的生物发光表型提前启动、且发光强度显著提升(Septer & Stabb, 2012)。该结果表明:筛选靶向 LuxR 型调控蛋白的特异性诱导剂,有望批量激活所有受 LuxR 型群体感应调控的同类生物合成基因簇。格林伯格团队系统调研了变形菌门中luxR 同源基因关联的生物合成基因簇,发现 luxR 同源基因与次级代谢基因簇的调控关联,在演化历程中发生过多次独立起源(Brotherton 等,2018)。这类调控关联广泛存在于功能各异的非同源基因簇中,说明不同来源、序列无同源性的 BGC,其调控模式发生了趋同进化。该团队借助生物信息学手段,批量鉴定出大量受 LuxR 型群体感应管控的生物合成基因簇,可为后续利用诱导剂实现靶向激活提供丰富的研究靶点。[图片]图 10自建细菌基因簇数据库中、含有已知调控基因同源序列的生物合成基因簇(BGC)序列相似性网络。每个节点代表一个生物合成基因簇;节点填充颜色表示基因簇的来源菌属;节点边框颜色代表该基因簇所含调控基因的类型;节点形状为 BIG‑SCAPE 软件划分的基因簇类别。机制研究表明:化学诱导剂可通过改变生物合成通路中调控基因的表达水平或蛋白活性,大幅促进微生物次级代谢产物合成。这提示:此类诱导剂同样有潜力激活携带同源调控基因的其他生物合成基因簇。据此,依托保守调控基因,有望实现大范围沉默基因簇的靶向激活。本研究选取已证实可被特定化学诱导剂调控的关键基因(malR、thaR、actII‑ORF4、surR、recA、lexA),在自建细菌基因簇数据库中开展本地 BLAST 比对。结果共筛选得到608 个含有上述基因高度同源编码序列的生物合成基因簇(附表 S4)。利用 BiG‑SCAPE 1.1.9 版本、默认阈值 0.3 进行聚类分析:其中 334 个基因簇归为62 个基因簇家族(GCFs)(图 10),剩余 274 个为独立单特异基因簇。值得注意的是,其中 5 个基因簇家族共包含 144 个生物合成基因簇,均分别对应前文已证实可被小分子激活的核心基因簇(des、tha、sur、act、mal)。说明这些基因簇不仅与上述 5 类经典基因簇序列高度相似,还共享同源调控基因,被对应诱导剂激活的概率显著更高。该类基因簇来源广泛,覆盖 182 个不同菌属,其中链霉菌属占比最高。recA与lexA基因广泛分布于各类基因簇与微生物类群中,在 DNA 修复及 SOS 应激通路调控中发挥核心作用。后续可探究对应化学诱导剂对上述同源基因菌株次级代谢谱的影响,为基于保守调控机制、靶向挖掘新型天然产物提供理论依据。5 结论与展望综上,本文系统综述了化学诱导剂激活沉默生物合成基因簇的研究现状,及其在新型天然产物靶向挖掘中的应用。基因组挖掘研究已证实微生物体内存在大量未解析的沉默基因簇,合理利用化学诱导剂,是充分释放微生物次级代谢潜能的关键手段。本文整合构建了诱导剂 — 生物合成基因簇 — 代谢产物关联知识图谱,并依据作用模式将诱导剂分为三大类:应激相关诱导剂、信号通路相关诱导剂及其他化合物。目前,大量诱导剂激活基因簇的效应及其深层调控机制仍尚不明确,也是未来重点研究方向。本文系统总结了诱导剂激活沉默基因簇的分子调控机制。从调控网络层面来看:尽管诱导剂化学结构差异巨大,但最终会汇聚至少数保守的上游调控通路与应激响应途径,主要分为四大类:氧化应激调控、DNA 损伤介导的 SOS 应答、核糖体扰动调控、代谢应激调控。不同诱导剂的初始作用靶点各不相同,但各类细胞扰动信号会经由基因调控网络整合,最终解除基因簇抑制、实现沉默 BGC 激活。现阶段,诱导剂介导基因簇激活的分子机制解析仍不完善。随着多组学技术快速发展,结合时间序列转录组、蛋白质组、代谢组与表观基因组的联合分析,有望构建完整的基因调控网络(GRN)及拓展调控通路。同时,基于机器学习、深度学习的网络推演算法,为鉴定基因簇激活的通用上游调控因子与功能模块提供了强大工具。此外,单细胞转录组可弥补常规混合测序的不足,解析菌群中不同个体对诱导剂的响应异质性。上述技术进步将助力深入解析诱导剂 — 调控网络互作机制,为基因簇激活与天然产物挖掘提供理性设计策略。同时,依托知识图谱与机制解析结果,本文提出四类沉默基因簇靶向激活核心策略:(1)基于调控机制筛选诱导剂:依据保守应激通路与信号调控通路选择适配诱导剂;(2)基于结构‑活性关系筛选诱导剂:挖掘已知诱导剂的结构类似物,用于激活同类基因簇;(3)基于 BGC 同源性筛选诱导剂:利用成熟诱导剂,激活序列与功能特征相似的同源基因簇;(4)基于调控基因筛选诱导剂:靶向筛选与已激活基因簇共享同源调控基因的基因簇。实际研究中,对于已有基因组数据的菌株,可优先参考近缘物种研究成果指导诱导剂筛选;若同属或近缘菌株已有小分子扰动报道,可优先选用该类化合物及其结构类似物。结合基因组注释与 BGC 预测结果,评估菌株基因组中是否含有知识图谱收录的同源基因簇与调控基因,进一步缩小诱导剂筛选范围。需客观说明的是,该套策略存在固有局限性:仅适用于与现有知识图谱存在关联的菌株与诱导剂,无法为任意微生物菌株提供普适性筛选方案。但相较于盲目随机筛选,该知识驱动型筛选框架可大幅提升阳性诱导剂的筛选成功率,尽管无法保证百分百激活效果。整体而言,该体系有助于丰富诱导剂激活基因簇的研究案例,同时为机制研究与应用研究提供支撑。利用诱导剂实现特定基因簇的靶向激活仍面临诸多挑战,需综合考虑多项关键影响因素:诱导剂浓度是核心因素,尤其抗生素类诱导剂,通常需在亚抑菌浓度下发挥代谢激活作用;培养体系同样关键,培养基组分可显著调控沉默基因簇表达,碳源、氮源、磷源的含量变化会直接改变次级代谢产物的种类与产量。例如:高浓度葡萄糖、磷酸盐、铵盐通常抑制次级代谢,而低水平营养胁迫可显著促进次级代谢合成。大量研究证实,同一诱导剂在不同培养基中会产生完全不同的代谢表型,充分说明次级代谢高度依赖培养条件;固体 / 液体培养方式差异,也会导致相同诱导剂处理下代谢谱截然不同。温度、pH 等环境条件同样会影响基因簇表达,需纳入实验设计。因此,在筛选诱导剂之外,还应优化培养基组合与培养环境参数。可结合液质联用(LC‑MS)、生物活性筛选、报告基因检测等技术,表征不同条件下的新型代谢产物与激活基因簇。对于微生物而言,诱导剂可视为一类特殊的 “调控药物”:通过唤醒休眠沉默基因簇,解除次级代谢沉默状态,挖掘微生物隐秘的代谢潜力。如同药物理性设计推动新药研发,基于已知诱导剂的结构优化与机制改造,也有望开发出活性更强、靶向性更高的新型诱导剂。随着诱导剂激活案例不断积累、分子机制逐步阐明,人们对诱导剂与微生物调控网络的互作认知将更加完善,进而实现诱导剂的理性设计与机制深度解析,推动特异性代谢产物高效合成,赋能医药研发、生物技术及相关领域发展。本文配套补充数据见文末。