Professional Documents
Culture Documents
Correspondence
ppandolf@bidmc.harvard.edu
In Brief
A CRISPR activation screen identifies
Identification of Drug both coding and noncoding pathways
Resistance Coding and involved in resistance to chemotherapy.
lncRNA Networks
Highlights
d Analysis of CCLE and CTD2 identifies coding and lncRNA
genes for Ara-C resistance
NY 10032, USA
6Department of Molecular Biotechnology and Health Sciences, and GenoBiToUS, Genomics and Bioinformatics Service, University of Turin,
Turin, Italy
7Center for Translational Genomics and Bioinformatics, San Raffaele Scientific Institute IRCCS, Milan, Italy
8Department of Stem Cell and Regenerative Biology, Harvard University, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
9Preclinical Murine Pharmacogenetics Facility and Mouse Hospital, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston,
MA, USA
10Functional Genomics and Noncoding RNAs Research Unit, Institut de Recherches Cliniques de Montréal (IRCM), Montréal, QC, Canada
11Department of Biochemistry and Molecular Medicine, Université de Montréal, Montréal, QC, Canada
12These authors contributed equally
13Lead Contact
*Correspondence: ppandolf@bidmc.harvard.edu
https://doi.org/10.1016/j.cell.2018.03.052
SUMMARY INTRODUCTION
Resistance to chemotherapy plays a significant Although precision medicine and targeted therapies offer new
role in cancer mortality. To identify genetic units hope for treating cancer, chemotherapy still remains the first,
affecting sensitivity to cytarabine, the mainstay of and last, line of defense for most patients. Cytarabine (1-b-d-ara-
treatment for acute myeloid leukemia (AML), binofuranosylcytosine, Ara-C) is a deoxycytidine analog that is
we developed a comprehensive and integrated used as part of a standard chemotherapeutic regimen for the
treatment of acute myeloid leukemia (AML) (Ramos et al.,
genome-wide platform based on a dual protein-cod-
2015). However, 30% to 50% of patients relapse with chemo-
ing and non-coding integrated CRISPRa screening
therapy-resistant disease. Thus, there is an ever-present need to
(DICaS). Putative resistance genes were initially better understand the genetic and molecular mechanisms that
identified using pharmacogenetic data from 760 hu- contribute to chemotherapy resistance.
man pan-cancer cell lines. Subsequently, genome To date, studies on mechanisms leading to therapy resistance
scale functional characterization of both coding have focused on protein-coding genes, yet cancer development
and long non-coding RNA (lncRNA) genes by and progression cannot be fully explained by the coding genome
CRISPR activation was performed. For lncRNA func- (Huarte, 2015; Imielinski et al., 2012). The recent explosion in
tional assessment, we developed a CRISPR activa- research and understanding related to the non-coding RNA
tion of lncRNA (CaLR) strategy, targeting 14,701 (ncRNA) transcriptome has highlighted the importance of
lncRNA genes. Computational and functional anal- ncRNAs in biology (Hon et al., 2017; Iyer et al., 2015). Functional
validation of various ncRNA species highlights the fact that these
ysis identified novel cell-cycle, survival/apoptosis,
RNAs may play important roles in the pathogenesis of diseases
and cancer signaling genes. Furthermore, transcrip-
including cancer (Schmitt and Chang, 2016).
tional activation of the GAS6-AS2 lncRNA, identified One large group of ncRNAs is represented by long non-coding
in our analysis, leads to hyperactivation of the GAS6/ RNAs (lncRNA). LncRNAs can be either nuclear or cytoplasmic in
TAM pathway, a resistance mechanism in multiple localization and play roles in a diverse array of biological pro-
cancers including AML. Thus, DICaS represents a cesses. As many nuclear lncRNAs behave in a cis-acting manner
novel and powerful approach to identify integrated (Quinn and Chang, 2016), their study requires their expression
coding and non-coding pathways of therapeutic from endogenous loci, and CRISPR technologies now facilitate
relevance. the modulation of gene expression directly from the endogenous
Cell 173, 649–664, April 19, 2018 ª 2018 Elsevier Inc. 649
A B
Sensitive Cells Resistant Cells
0.2
Enrichment Score
Enrichment Score
NES = 1.385 NES = 1.232
0.2 p-value = 0.013 p-value = 0.025
0.1
0.1
0.0
0.0
-0.1
0 5000 10000 15000 20000 0 5000 10000 15000 20000
Rank Rank
0.0 0.0
Enrichment Score
Enrichment Score
200
300
-0.2 -0.2
p < 2.2e-16
E F
1.00 4 4
Antisense Gene Drug-Sensitivity
0.75 2 2
Relative Density
Random Pairs
Correlation
Correlation
Sense-Antisense
0.50 Pairs 0 0
-2 -2
0.25
r = 0.552 r = 0.021
-4 p < 2.2e-16 -4 p = 0.1338
0.00
-4 -2 0 2 4 -4 -2 0 2 4
-1.0 -0.5 0.0 0.5 1.0 Sense Gene Drug-Sensitivity Sense Gene Drug-Sensitivity
Pearson Correlation Correlation Correlation
Figure 1. Identification of Protein-Coding and Non-coding Gene Biomarkers Correlated with Differential Ara-C Response
(A) Distribution of Ara-C drug sensitivities across 760 pan-cancer cell lines profiled by both CCLE and CTD2 studies, quantified by their Z scaled area under the
dose-response curve values after regressing out lineage-specific effects. See also Table S1.
(B) Distribution of Z scaled drug resistance-gene expression Pearson correlation values of all analyzed genes. Representative protein-coding and non-coding
gene symbols enriched beyond a Z score threshold of ±1.16 are demarcated. See also Table S1.
(C) Summary of gene set enrichment analysis (GSEA) of protein-coding genes ranked by drug resistance-gene expression correlation values using annotated
KEGG (Kyoto Encyclopedia of Genes and Genomes) pathways. See also Table S3.
(D) Representative KEGG pathways from GSEA of protein-coding genes ranked by drug sensitivity-gene expression correlation values as shown in (B) and (C).
See also Table S3.
(E) Pearson correlation distributions of gene pair expression levels in the cancer cell line panel across 997 sense-antisense cognate gene pairs and 5,000 random
protein coding-lncRNA gene pairs. Wilcoxon rank-sum test: p < 2.2e16.
(F) Relationship of drug sensitivity-gene expression correlation values between protein coding-lncRNA gene pairs across 997 sense-antisense cognate gene
pairs (left: Pearson’s R = 0.552, p < 2.2e16) and 5,000 random gene pairs (right: Pearson’s R = 0.021, p = 0.1338).
See also Figures S1 and S2 and Table S2.
Normalized Viability
Control
15 0.75
Le viral Prepar on
10 0.5
5 0.25 dCas9-VP64
0 0 MS2-P65-HSF1
CTV-1
PL-21
CESS
ME-1
THP-1
NKM-1
CMK
KMOE-2
MONO-MAC-6
ML-2
OCI-M1
KY821
NOMO-1
OCI-AML5
NB4
KASUMI-1
OCI-AML2
QIMR-WIL
OCI-AML3
P31-FUJ
GDM-1
MOLM-14
HL-60
MOLM-16
MOLM-13
16.00
0.00
0.03
0.06
0.13
0.25
0.50
1.00
2.00
4.00
8.00
Stable CRISPRa cells Library infe on Treatment 14d
AraC uM
D E
F G H I
ZBP1 Control * ZBP1 Control
+++++
1.00 + * ** 8.00
* **
++
Survival probability
++ Log-rank
Normalized Viability
++ 1
0.75 p = 0.0074
ZBP1
+
% Apoptosis
Cell Counts
+++++++++
++ ++ +++ ++ + 4.00
+
0.50
++ + 0.5
+++++++ + ++ ++ ++ 2.00
0.25
0.00
0 1.00
0 25 50 75 100
0 0.06 0.12 0.25 0.5 D1 D2 D3
Time (Months)
uM Ara-C Days
++ 1
Survival probability
+ Log-rank
0.75
++ p = 0.0033 **
Cell Counts
MUL1
4.00
% Apoptosis
++
0.50 +
+
++++++++
++++++++ ++ ++
++++++++ ++ 0.5
2.00
0.25
+
0.00
0 1.00
0 25 50 75 100
0 0.06 0.12 0.25 0.5 D1 D2 D3
Time (Months)
uM Ara-C Days
Log-rank 1 * *
% Apoptosis
Normalized Viability
PI4K2A
0.75 + p = 0.038
++ ** 4.00
Cell Counts
+ ++++
0.50 +++++ + +++ + +
++++ 0.5
++++++++ + + 2.00
0.25
++ +++ ++
0.00 1.00
0
0 25 50 75 100 D1 D2 D3
Time (Months) 0 0.06 0.12 0.25 0.5
Days
uM Ara-C
Non-Targeting Control 99
Processed
Total 14,701 22,253 88,543
lncRNA Coding Gene
B C D
Non-Coding Protein Coding
Protein-Coding
Noncoding Protein-Coding Noncoding
Genes Genes
Genes
Genes Genes Genes
15
Detected in AML
10
E
Phagosome Gap junction Intestinal immune
network for IgA Cell adhesion molecules
production (CAMs)
Glycerophospholipid Proteoglycans in cancer
metabolism ECM-receptor interaction
Axon guidance AC106897.1
Adherens junction
Mitophagy - animal
FC Expression
4 *
FC Viability
* *
150
*
3
*
100
2
* * 50
1
0 0
C D
0 0 0 1 1 1
0 0.25 0.5 0 0.25 0.5 0 0.25 0.5 D1 D2 D3 D4 D1 D2 D3 D4 D1 D2 D3 D4
0 0 0 1 1 1
0 0.25 0.5 0 0.25 0.5 0 0.25 0.5 D1 D2 D3 D4 D1 D2 D3 D4 D1 D2 D3 D4
1 3
** ** *
2 2
0.5
0.5
0.5 ** 2
**
Cell Counts
0 0 0 1 1 1
0 0.25 0.5 0 0.25 0.5 0 0.25 0.5 D1 D2 D3 D4 D1 D2 D3 D4 D1 D2 D3 D4
PI
ANNEXIN V
F Disease Free Survival, AML
G
Merge γ-H2AX DAPI 1.00 +
+++
* 1.00 +
++
60 + ++
Survival probability
+++ Log-rank
Survival probability
++ Log-rank
0.75
+
+ p = 0.035 +++
++
0.75 p = 0.0026
Control
++ +
40 0.50 ++++++++ ++++++
Foci/Cell
0.50 +++++++++ +
+++++++ ++ ++++
++ +++ ++ ++++
++++++ ++
0.25 ++
++ + + 0.25
20 + ++ +
AL353148.1
0.00 0.00
0 25 50 75 100 0 25 50 75 100
0 Time (Months) Time (Months)
Control AL353148.1
GAS6-AS2 Expr + High + Low AC008073.2 Expr + High + Low
FC Viability
Log2Change
6 #5
FC Viability
0.5 5 #8
FC
5
4 #1
4
Log 2Fold
3
0.0 3 #2
CRISPRa
#6
2 2
#7
1 1
CRISPRa
-0.5 0 0
1 2 3 4 5 6 7 8 0 20 40 60
GAS6-AS2 sgRNA # Expression levels
D E **
**
GAS6-AS2 GAS6-AS2 Control GAS6-AS2 GAS6-AS2
sgRNA #1 sgRNA #3 sgRNA sgRNA #1 sgRNA #3
1 1 *
Normalized Viability
% of Apoptosis
** **
0.5 0.5
** **
0 0
PI
F
Ara-C Day 0 Ara-C Day 2 Ara-C Day 3 Ara-C Day 4 Control
GAS6-AS2
100 ***
***
***
80
% Cells
60
40
Pacific Blue
20
(Control)
0
0 2 3 4
Days of Treatment
DsRed (GAS6-AS2)
H I
Day 17 Control Day 17 Ara-C
100%
Fluorescent Positive Cells
90%
80%
70%
60%
50%
40%
30%
Pacific Blue
20%
(Control)
10%
0%
Mouse #1 #2 #3 #5 #6 #7 #8 #9 #10 #11 #12 #14
Mice BM Day 17
Control Ara-C
100 P = 0.025606 #1 1
4
expression levels following GAS6-AS2 activation. Data
GAS-AS2
0
#8
3.5 are represented as mean of triplicate measurements.
#7 -1
10 3 (B) Pearson correlation between GAS6-AS2 and GAS6
-2 P = 4.33E-05
#2
#6
2.5 R = 0.530 expression levels across the 760 cancer cell lines
-3
1
0 20 40 60 -2 0 2
2
4.5 6.5 8.5 10.5
analyzed (Figures 1A and 1B).
GAS6, Z-Scaled VST Counts
GAS6-AS2 FC Expression GAS6 (C) Pearson correlation between GAS6-AS2 and GAS6
expression levels in AML patient samples.
D E F (D) Western blot analysis of differential GAS6/TAM
GAS6-AS2 #4
GAS6-AS2 #3
GAS6-AS2 #1
HSP90 0
3.5
1 1 1.2 1.1 1.1 1.1 (F) Pearson correlation between GAS6-AS2 and AXL
3 -1
pTYRO3 expression levels across the 760 cancer cell lines
2.5 P = 4.23E-04
pERK
50
3000
100
Ara-C. Data are represented as mean ± SD, n = 3.
FC Expression
FC Expression
FC Expression
40 2500
80 See also Figure S7.
30 2000
1 1 1 2.3 2.3 2.3 60
1500
20
1000 40
ERK 10 500 20
0 0 0
1 1 1 1.1 1 1 MOLM K562 MOLM K562 MOLM K562
4 *
3
* * *** uM Ara-C levels (Figures 7A, S7I, and S7J) as well as
2 an increased sensitivity of K562 cells to the
1
activity of Ara-C (Figure 6B).
0
Previous studies found that AXL transcrip-
tion is regulated by methylation of CpGs up-
stream of its TSS (Mudduluru and Allgayer,
2008). Direct methylation analysis using a
and Burbridge, 2017), with both MEK-ERK and S6K-RPS6 bisulfite assay identified 6 highly methylated sites in the AXL pro-
signaling axes being known downstream targets of TAM moter. Correspondingly, GAS6-AS2 overexpressing cells show
signaling (Xu et al., 2017). Western blot analysis of lysates from significant decreases in methylation of these CpG sites (Fig-
cells expressing three distinct GAS6-AS2 targeting sgRNAs ure 7C), suggesting that GAS6-AS has the potential to act in
confirmed activation of the GAS6/TAM pathway (Figure 6D). both a cis- and trans-acting manner.
Importantly, both pERK and pRPS6 were strongly phosphory- To characterize the global function of GAS6-AS2 in cancer, we
lated in response to GAS6-AS2 activation (Figure 6D). performed an unbiased k-means clustering based on coding and
Surprisingly in a variety of cancer subtypes, including AML, non-coding gene expression across 53 AML patients (Garzon
GAS6-AS2 expression levels share strong correlations not only et al., 2014) (Figure S7D). A large number of genes known to
with GAS6 but also to its target receptor AXL (Figures 6E and be regulated by promoter methylation were clustered together
S6B). AXL correlation was also mirrored in our 760 CCLE cell with GAS6-AS2 (Figure 7D), supporting our hypothesis that
line panel (Pearson’s R = 0.6064, p < 2.2e16) (Figure 6F) and GAS-AS2 mediates CpG modification.
by overexpression of GAS6-AS2 in both MOLM14 and HEK293 Based on these results, we hypothesized that GAS6-AS2
cells in vitro (Figure S7C). These data suggested that GAS6- trans-activity may function through a DNA methyltransferase.
AS2 may be able to regulate the TAM receptor signaling axis at An unbiased screening of our CCLE panel for candidate DNA
a number of levels. methyltransferases that would correlate with Ara-C sensitivity
To further investigate the role of GAS6-AS2 in regulating GAS6 identified decreased expression of DNMT1 and DNMT3A
and AXL, we took advantage of the K562 leukemia cell line, (Figure 7E). Importantly, we observed GAS6-AS2 to be signifi-
which we found to express high levels of GAS6-AS2, GAS6, cantly enriched in RNAs bound to DNMT1 using a DNMT1
0.8
FC Expression
FC Expression
0.8 0.8 0.8
0.6 *
0.6 0.6 0.6
0.4
0.4 0.4 0.4
0.2 0.2
0.2 0.2
0 0 0 0
K562 Control K562 ASO K562 Control K562 ASO K562 Control K562 ASO 0 0.5 1 2 4 8
uM Ara-C
90 * Methyla on
80 * Level
% of Methylation
* *
70 *
60 *
50
40
30
20
10
0
1 2 3 4 5 6 7 8 9 10 11 12 13 14
Methylated
Methylated CpG
CpG Control
Control CpG Gas6-As2
Methylated CpG GAS6-AS2
D E
Name Source P-Value FDR
F G
(B) Modulation of Ara-C response upon GAS6-AS2 knockdown via ASO in K562 cells. Data are represented as mean ± SD, n = 3, Welch two-sample t test:
*p < 0.05. **p < 0.01, ***p < 0.001.
(C) Methylation of CpG islands in the HEK293T AXL promoter following modulation of GAS6-AS2 expression. n = 12, Chi-square test: *p < 0.05. **p < 0.01,
***p < 0.001.
(D) Gene ontology analysis of coding genes clustered with GAS6-AS2 as determined by k-means clustering (cluster #6).
(E) Drug sensitivity-gene expression Pearson correlation values of DNA methyltransferases. Genes enriched beyond a Z score threshold of ± 1.16 are colored in
red. See also Figure 1B.
(F) Distribution of FPKM-normalized transcript abundances associated with DNMT1 versus IgG.
(G) Model summarizing the mechanism by which GAS6-AS2 regulates GAS6/TAM signaling.
See also Figure S7.
Imielinski, M., Berger, A.H., Hammerman, P.S., Hernandez, B., Pugh, T.J., Ho- Perron, U., Provero, P., and Molineris, I. (2017). In silico prediction of lncRNA
dis, E., Cho, J., Suh, J., Capelletti, M., Sivachenko, A., et al. (2012). Mapping function using tissue specific and evolutionary conserved expression. BMC
the hallmarks of lung adenocarcinoma with massively parallel sequencing. Bioinformatics 18 (Suppl 5 ), 144.
Cell 150, 1107–1120. Quinn, J.J., and Chang, H.Y. (2016). Unique features of long non-coding RNA
Iyer, M.K., Niknafs, Y.S., Malik, R., Singhal, U., Sahu, A., Hosono, Y., Barrette, biogenesis and function. Nat. Rev. Genet. 17, 47–62.
T.R., Prensner, J.R., Evans, J.R., Zhao, S., et al. (2015). The landscape of long Ramos, N.R., Mo, C.C., Karp, J.E., and Hourigan, C.S. (2015). Current ap-
noncoding RNAs in the human transcriptome. Nat. Genet. 47, 199–208. proaches in the treatment of relapsed and refractory acute myeloid leukemia.
Joung, J., Engreitz, J.M., Konermann, S., Abudayyeh, O.O., Verdine, V.K., J. Clin. Med. 4, 665–695.
Aguet, F., Gootenberg, J.S., Sanjana, N.E., Wright, J.B., Fulco, C.P., et al. Rees, M.G., Seashore-Ludlow, B., Cheah, J.H., Adams, D.J., Price, E.V., Gill,
(2017a). Genome-scale activation screen identifies a lncRNA locus regulating S., Javaid, S., Coletti, M.E., Jones, V.L., Bodycombe, N.E., et al. (2016). Corre-
a gene neighbourhood. Nature 548, 343–346. lating chemical sensitivity and basal gene expression reveals mechanism of
Joung, J., Konermann, S., Gootenberg, J.S., Abudayyeh, O.O., Platt, R.J., action. Nat. Chem. Biol. 12, 109–116.
Brigham, M.D., Sanjana, N.E., and Zhang, F. (2017b). Genome-scale Reimand, J., Arak, T., Adler, P., Kolberg, L., Reisberg, S., Peterson, H., and
CRISPR-Cas9 knockout and transcriptional activation screening. Nat. Protoc. Vilo, J. (2016). g:Profiler-a web server for functional interpretation of gene lists
12, 828–863. (2016 update). Nucleic Acids Res. 44 (W1), W83-9.
Further information and requests for resources and reagents should be directed to and will be fulfilled by the Lead Contact Dr. Pier
Paolo Pandolfi (ppandolf@bidmc.harvard.edu).
MOLM14 (male) and K562 (female) cell lines were maintained in RPMI supplemented with 10% FBS, 2 mmol/L glutamine, and
100 U/mL penicillin/streptomycin (Invitrogen). HEK293T (female) cells were maintained in DMEM supplemented with 10% FBS,
2 mmol/L glutamine, and 100 U/mL penicillin/streptomycin (Invitrogen). HL60 (female) cells were maintained in IMDM supplemented
with 10% FBS, 2 mmol/L glutamine, and 100 U/mL penicillin/streptomycin (Invitrogen).
Adult female NSG mice (6–8 weeks old) mice were obtained from Jackson Laboratory (Bar Harbor, ME). Animals were used
in accordance with a protocol reviewed and approved by the Institutional Animal Care and Use Committee at BIDMC
(Protocol #082-2014).
Ara-C (Sigma-Aldrich) was diluted in sterile water to a working concentration of 1 mM.
Determination of IC50
To determine the half maximal inhibitory concentration (IC50), we grew cells for 48 h at a range of concentrations of Ara-C (32
to 0.03 mM). Cell numbers were measured by CellTiter 96 Aqueous Non-Radioactive Cell Proliferation Assay (Promega).
Signal was measured using a Glomax Multi+ (Promega) plate reader. The normalized measurements were used to obtain survival
curves and IC50 values. For DCK knockdown we used MISSION shRNA (Sigma-Aldrich TRCN0000009934, TRCN0000196649,
TRCN0000196362), which were transduced according to standard protocols. For BCL2 overexpression we used pCDH-puro-
Bcl2 (Addgene #46971), which was transduced according to standard protocols. Two different LNA-based GapmeR (Exiqon) anti-
sense oligonucleotides (ASOs) were designed to target GAS6-AS2 specifically. The ASOs were transfected at a concentration of
50 mM using Lipofectamine 2000 according to standard protocols.
Apoptosis
Cells were centrifuged, media was removed, and cells were resuspended in 100 mL of Annexin V Binding Buffer (BD PharMingen)
containing 1 mg/ml propidium iodide (Sigma-Aldrich) and APC-Annexin V (BioLegend) for 15 minutes. 400 mL of Annexin V Binding
Buffer (BD PharMingen) was then added to the samples. Fluorescence was measured using a BD LSR II flow cytometer or Gallios
flow cytometer (Beckman Coulter) and then analyzed using the FlowLogic software (Miltenyi Biotec) or FlowJo V10.
Nuclear/cytoplasmic fractionation
Fractionation of the cells to separate nucleus and cytoplasm was performed using PARIS-kit (Ambion) following the manufacturer’s
protocol. The efficiency of the separation of the two fractions was assessed by qPCR amplification with primers targeting the nu-
cleus-specific lncRNA MALAT1, followed by 2% agarose gel electrophoresis in 1x Tris-Acetate-EDTA buffer.
Western blotting
Cells were lysed in RIPA buffer (Boston BioProducts) supplemented with protease (Roche) and phosphatase (Roche) inhibitors.
Laemmli buffer with a 5% final concentration of b-mercaptoethanol was added to the samples, which were then boiled, separated
on NuPAGE 4%–12% Bis-Tris gradient gels (Invitrogen), and transferred to polyvinylidine difluoride membranes (Immobilon P, Milli-
pore). Membranes were then probed with the indicated antibodies. Antibodies for western blotting: Anti-TYRO3 (5585), anti-ERK
(9102), anti-phospho-ERK (9101), anti-phospho-S6 (2211), anti-S6 (2217), anti-STAT3 (4904), and anti-phospho-STAT3 (9136) anti-
bodies were purchased from Cell Signaling Technology. Anti-Actin (A3853) was purchased from Sigma Aldrich. Anti-phospho-
TYRO3/MERTK (PA5-37808) was purchased from Invitrogen.
Lentivirus production
HEK293T cells were seeded 1 day before transfection. Cells were transfected the next day at 80%–90% confluency. For each dish,
15 mg of plasmid containing the vector of interest, 3.5 mg of pMD2.G, and 13 mg of psPAX2 (Addgene) were transfected using 90 mL
Lipofectamine 2000 and 20 mL Plus Reagent (Life Technologies). Supernatant was collected 48 h and 72 h after transfection and
filtered with a 0.45-mm PVDF filter (Millipore).
RNA-IP analysis
Raw .fastq files corresponding to DNMT1 IP- and IgG IP-RNaseq datasets were obtained from the SRA archive (SRR358674,
SRR358675) (Di Ruscio et al., 2013). Resultant .fastq sequencing files were then aligned to the hg38 reference genome with default
STAR single-pass alignment (v2.5.2b) (Dobin et al., 2013). Realigned .bam files were sorted by name using SAMtools v1.3 (Li et al.,
2009). Coordinates were mapped to Ensembl gene IDs with featureCounts (Subread v1.5.2) as denoted by Ensembl v89 annotation
using features ‘‘-p -B -s 0’’ (Aken et al., 2016; Liao et al., 2014). Genes were then normalized by fragments per kilobase of transcript
per million mapped reads (FPKM), with gene length determined by featureCounts, and then FPKM values were log2-transformed after
the addition of a 1e-5 pseudocount. Log-transformed counts from DNMT1-IP were then subtracted from the log-transformed counts
from IgG-IP, which segregated into three clusters representing transcripts associated predominantly with DNMT1, transcripts asso-
ciated predominantly with IgG, and transcripts associated with both DNMT1 and IgG.
Statistical analysis was performed using R Statistical Software (https://www.r-project.org/) and Excel (Microsoft). All reported
P values are two-tailed, and for all analyses, p % 0.05 is considered statistically significant, unless otherwise specified. Specific de-
tails concerning statistical tests for individual experiments are noted in the relevant Figure legends.
Replicates for individual experiment types are outlined here:
Custom scripts were used for data preprocessing, statistical analysis, and data visualization as described previously. Code used in
this study is available upon request by the Lead Contact Dr. Pier Paolo Pandolfi (ppandolf@bidmc.harvard.edu).
A B C * N.S.
0.2
Regress Ara-C AUC Values by Cell
Type Parameters
0.0
-2 0 2
Ara-C Sensitivity
(Z-Scaled Area Under the Curve)
Compute Pearson Correlaon
Coefficients Between Gene
Expression Level-Drug Sensivity
D E
100 1.00
Relative Density
Significance by Receiver Operang 75
25 0.00
-4 -2 0 2 4
Z-Transformed
Pearson Correlation
0
0 25 50 75 100
Estimated False Positive Rate (%) All Genes Annotated Genes
F
Oxidative phosphorylation TCA cycle RNA degradation
Enrichment Score
Enrichment Score
-0.1
-0.1 -0.2
-0.2
-0.3 -0.4
-0.2 NES = -1.305 NES = -1.665 NES = -2.613
p-value = 0.0594 -0.4 p-value = 0.0102 p-value = 1.6e-4
-0.3 -0.5 -0.6
0 5000 10000 15000 20000 0 5000 10000 15000 20000 0 5000 10000 15000 20000
Rank Rank Rank
G H
Oxidative phosphorylation RNA degradation
0.0 0.0
Enrichment Score
Enrichment Score
-0.2 -0.2
-0.6
0 5000 10000 15000 0 5000 10000 15000
Rank Rank
Figure S1. Identification of Protein-Coding and Non-coding Gene Biomarkers Correlated with Differential Ara-C Response, Related to
Figure 1
(A) Pipeline for identification of protein-coding and noncoding gene biomarkers of differential Ara-C response across 760 cancer cell lines. See also Figures 1A
and 1B.
(B) Distribution of Ara-C drug sensitivities across 760 pan-cancer cell lines profiled by both CCLE and CTD2 studies, quantified by their Z-scaled area under the
dose response curve (AUC) values without removing lineage-specific effects.
(C) Representative effect of regressing lineage-specific annotations from Ara-C AUC values between hematopoietic and non-hematopoietic cells. Wilcoxon rank-
sum test: *p < 2.2e-16. N.S, p = 0.7208.
(D) Receiver operating characteristic (ROC) analysis of drug sensitivity-gene expression correlations to determine the optimal Z-score threshold (Z = 1.16). Genes
curated from the literature as responsive to Ara-C (Table S2) passing this Z-score threshold were considered true positives, whereas other genes not within this
curated list passing this Z-score threshold were considered false positives.
(E) Distributions of drug sensitivity-gene expression correlations for annotated genes (Table S2) versus all genes in the analysis.
(F) Representative KEGG pathways from GSEA of protein-coding genes ranked by drug sensitivity-gene expression correlation values as shown in Figures 1B
and 1C. See Table S3.
(G) Summary of GSEA of protein-coding genes ranked by disease-free survival association strength using annotated KEGG (Kyoto Encyclopedia of Genes and
Genomes) pathways. Clinical and transcriptomic data from the TCGA-LAML patient cohort was used for this analysis. Disease-free survival association was
quantified by the magnitude of the coefficient from a Cox proportional hazards model for each gene, with patient sex, age over 60, cytogenetic risk, and white
blood cell count above 16 as covariates.
(H) Representative KEGG pathways from GSEA of protein-coding genes ranked by disease-free survival association strength as shown in Figures 1B and 1C.
Clinical and transcriptomic data from the TCGA-LAML patient cohort was used for this analysis.
A B
p < 2.2e-16
Gene Pair Type # Pairs # Gene Pairs, Both Genes # Genes Pairs, Sense Gene Overlap
Correlated with Ara-C Correlated with Cognate
SensiƟvity AnƟsense Gene
C
Gene Pairs Correlated with Ara-C Resistance Gene Pairs Correlated with Ara-C Sensitivity
ACOXL ACOXL-AS1 INHBA INHBA-AS1 P4HA3 P4HA3-AS1 ADARB2 ADARB2-AS1 MYB MYB-AS1
B4GALT1 B4GALT1-AS1 IQCJ-SCHIP1 IQCJ-SCHIP1-AS1 PIK3IP1 PIK3IP1-AS1 AP4B1 AP4B1-AS1 MYCBP2 MYCBP2-AS1
BDNF BDNF-AS ISPD ISPD-AS1 PINK1 PINK1-AS CCND2 CCND2-AS2 NOP53 NOP53-AS1
CCDC183 CCDC183-AS1 ITGB5 ITGB5-AS1 PRICKLE2 PRICKLE2-AS2 CCNT2 CCNT2-AS1 PABPC5 PABPC5-AS1
CDKN2A CDKN2A-AS1 KIRREL3 KIRREL3-AS2 PRICKLE2 PRICKLE2-AS3 DCUN1D2 DCUN1D2-AS PAXBP1 PAXBP1-AS1
COL5A1 COL5A1-AS1 KRT7 KRT7-AS PRKG1 PRKG1-AS1 DLGAP1 DLGAP1-AS4 PCCA PCCA-AS1
CPEB2 CPEB2-AS1 LACTB2 LACTB2-AS1 RBPMS RBPMS-AS1 DLGAP1 DLGAP1-AS5 PLCH1 PLCH1-AS1
CXXC5 CXXC5-AS1 LIFR LIFR-AS1 RERG RERG-AS1 ELFN1 ELFN1-AS1 PRC1 PRC1-AS1
DOCK9 DOCK9-AS2 LOXL1 LOXL1-AS1 RMDN2 RMDN2-AS1 ENTPD1 ENTPD1-AS1 RNF219 RNF219-AS1
EGFR EGFR-AS1 LRP1 LRP1-AS ROR1 ROR1-AS1 FAM53B FAM53B-AS1 SLC25A30 SLC25A30-AS1
EXOC3 EXOC3-AS1 MAMDC2 MAMDC2-AS1 SEMA5A SEMA5A-AS1 FBXW7 FBXW7-AS1 SLCO4A1 SLCO4A1-AS1
FAM83A FAM83A-AS1 MKX MKX-AS1 SLC2A1 SLC2A1-AS1 FOXD2 FOXD2-AS1 SMAD1 SMAD1-AS2
FLNC FLNC-AS1 MRVI1 MRVI1-AS1 SPON1 SPON1-AS1 GABPB1 GABPB1-AS1 SMC5 SMC5-AS1
FRMD6 FRMD6-AS1 NALCN NALCN-AS1 TGFB2 TGFB2-AS1 GTF3C2 GTF3C2-AS1 SSBP3 SSBP3-AS1
FRMD6 FRMD6-AS2 NDST1 NDST1-AS1 THSD4 THSD4-AS1 HHATL HHATL-AS1 THAP9 THAP9-AS1
FTCD FTCD-AS1 NEURL1 NEURL1-AS1 TM4SF1 TM4SF1-AS1 INTS6 INTS6-AS1 TMPO TMPO-AS1
FUT8 FUT8-AS1 NEXN NEXN-AS1 TM4SF19 TM4SF19-AS1 INTS9 INTS9-AS1 USP3 USP3-AS1
GAS6 GAS6-AS2 NTRK3 NTRK3-AS1 VLDLR VLDLR-AS1 KCNIP2 KCNIP2-AS1 ZBED3 ZBED3-AS1
Figure S2. Identification of Sense-Antisense Gene Pairs Highly Correlative with Differential Ara-C Response, Related to Figure 1
(A) Pearson correlation distributions of gene pair expression levels in the TCGA-LAML patient samples across 997 sense-antisense cognate gene pairs and 5,000
random protein coding-lncRNA gene pairs. Wilcoxon rank-sum test: p < 2.2e-16.
16384 50
1024 4096 256 1024
4096
1024 40
256
FC Expression
256 1024 64
256 30
64 256 64
64 16
64 16 20
16 16
16 4
4 4 4 10
4
1 1 1 1 1 0
MIAT TTN ASCL1 HBG1 RHOXF2 TUNA
B C D
34 0uM AraC
0
33
0.25uM AraC
32 0.25
31
107 Cells
30
29
28
27
26
25
1 2 3 4 5 7 8 10 11
Days
E F G DCK
Normalized Viability 1
FDR = 0.339
*
0.5
0
0 0.03 0.06 0.12 0.25 0.5
uM Ara-C
H I
PXDC1 TUFT1 ZNF532
*
** *
Normalized Viability
Normalized Viability
Normalized Viability
1 * 1 ** 1
*
*
0 0 0
0 0.06 0.12 0.25 0.5 0 0.06 0.12 0.25 0.5 0 0.06 0.12 0.25 0.5
uM Ara-C uM Ara-C uM Ara-C
J
Control ZBP1 MUL1 PI4K2A
FC Expression
FC Expression
30 600 900
20 400 600
10 200 300
0 0 0
1 2 3 4 1 2 3 4 5 1 2 3 4
TUNA MIAT-01 MIAT-06
C
Chr:22
MIAT-01
MIAT-06
10,856 bp
D
FC Expression
p
100
1.00E+00
FC Expression
5.00E-01
FC Expression
2.50E-01 HEK293T
1.25E-01
50 6.25E-02 HEK293T
3.13E-02
1.56E-02 HEK293T
7.81E-03 HEK293T
3.91E-03 MOLM14
MOLM14
0 1.95E-03
9.77E-04 MOLM14
4.88E-04 MOLM14
AC1443548.2
E F G
H J I
-Log10(FDR)
FDR = 3.51e-5
Figure S4. CRISPRa Functional Screening of Non-coding Genes Modulating Ara-C Response, Related to Figure 3
(A) Fold change (FC) of expression levels modulated by CRISPRa for sgRNAs predicted to target the TUNA lncRNA in MOLM14 cells. Data are represented as
mean ± SD, n = 3.
Normalized Viability
Control
Normalized Viability
Control
1 1 0.8 * 0.8
Normalized Viability
*
0.6 0.6
**
**
0.5 ** 0.5 * 0.4 0.4
**
** 0.2 0.2
**
0 0 0 0
0 0.25 0.5 0 0.25 0.5 0 0.03 0.06 0.12 0.25 0.5 0 0.03 0.06 0.12 0.25 0.5
Ara-C uM Ara-C uM Ara-C uM Ara-C uM
C D
AC087477.5 RMDN2-AS1 Control LINC02426
Cell Counts
3 3 *
70 *
2 2 60 *
G1
50
S
% of Cells
1 1
40
D1 D2 D3 D4 D1 D2 D3 D4
AL353148.1 AL157688.1 30
G2/M
** 20
E * 10
0
Contol AL353148.1 LINC02426 AL157688.1
% of Apoptosis
F G
30 AC106897.1 AC008073.2 AC091982.2 GAS6-AS2
Control
25 Control Control Control
FC Expression
** ** ** ***
**
Normalized Viability
20 1 ** 1 1 1 **
**
15 * **
**
10 0.5 * 0.5 0.5 0.5 *
5
0 0 0 0 0
0 0.12 0.25 0.5 1 0 0.12 0.25 0.5 0 0.6 0.25 0.5 0 0.12 0.25 0.5 1
uM Ara-C uM Ara-C uM Ara-C uM Ara-C
H
** * ** *
% of Apoptosis
Figure S5. Validation of Enriched sgRNAs Targeting lncRNAs in the CaLR Screening, Related to Figure 4
(A) Modulation of Ara-C response upon stable expression of sgRNAs targeting CaLR-enriched lncRNAs in MOLM14 cells. Data are represented as mean ± SD,
n > 3. Welch two sample t test: *p < 0.05. **p < 0.01, ***p < 0.001.
(B) Modulation of Ara-C response upon stable expression of sgRNAs targeting CaLR-depleted lncRNAs in MOLM14 cells. Data are represented as mean ± SD,
n > 3. Welch two sample t test: *p < 0.05. **p < 0.01, ***p < 0.001.
(C) Modulation of cellular proliferation in the absence of Ara-C treatment upon stable expression of sgRNAs targeting AC087477.5 or RMDN2-AS1 in MOLM14
cells. Proliferation is quantified over four days (D1-D4). Data are represented as mean ± SD, n = 3.
70
0.75 +
+ p = 0.029 #1 GCCGCCGCGATGTGACCTTC 60
+
FC Expression
+
+ ++ #2 CGCCGCCGCCGCCGCCGCCG 50
0.50 + ++++
++ + +++
++ +++ ++ 40
++++++++ ++++ + + #3 GGCGGCGGCGGCGGCGGCGG 30
0.25 + + #4 GCACCTCAAGCGCTCGGTCT 20
10
0.00 #5 ACGCCCAGACCGAGCGCTTG 0
0 25 50 75 100 1 2 3 4 5 6 7 8
Time (Months) #6 GCGCGGGGAGGCGGCGGGGG
GAS6-AS2 sgRNA #
GAS6 Expr + High + Low
#7 GCCCCGTCCGGGAGGGAGGT
#8 TGACCCCCCCACCTCCCTCC
D BFP - Control
F G
RED33 - GAS6-AS2
1
Normalized Viability
0.8 *
0.6
***
0.4
0.2
0
9 BFP
7
5
3
1
1 2 3 4 5 6
Days
H I
PCR
GAS6-AS2 GAS6
80 ** 12 **
FC Expression Level
4 10
60
8
GAS6-AS2
Nucleus/Cytoplasm
40 6
3
MALAT1 4
GAS6-AS2 20
2
2 0 0
GAS6-AS2 sgRNA + + - + + -
1 GAS6-AS2 ASO - + + - + +
MALAT1
0
Cytoplasm Nucleus
J
GAS6-AS2 GAS6 AXL
40
80 * 15 *
FC Expression
FC Expression
*
FC Expression
30
60
10
40 20
20 5 10
0 0 0
Control GAS6-AS2 Control GAS6-AS2 Control GAS6-AS2
ASO ASO ASO ASO ASO ASO
GAS6-AS2
GAS6-AS2
R=0.659
GAS6-AS2
R=0.475
GAS6-AS2
2 0
0 0
-2 -2 -2
-3
-4 -4 -4
-6 -8 -6 -6
6.5 8.5 10.5 12.5 14.5 6 11 16 6 11 16 7 9 11 13
GAS6 GAS6 GAS6 GAS6
R=0.809 R=0.742
GAS6-AS2
R=0.586
GAS6-AS2
GAS6-AS2
0 R=0.628 2
0 0
-2 0
-2
-2
-4 -4
-5 -4
-6 -6
-6 -8 -8
-10 6 11 16 7 9 11 13 7 12 17 7 9 11 13 15
GAS6 GAS6 GAS6 GAS6
GAS6-AS2
2
GAS6-AS2
GAS6-AS2
0 0
0 0
-2 -2
-2 -2
-4 -4
-4
-6 -6 -4 -6
-8 -8 -6 -8
4 9 14 6 9 12 5 8 11 5 10 15
AXL AXL AXL AXL
C D E
Sensitivity to Ara-C
GAS6-AS2
25 SP1 SP3
FC Expression of AXL
20 MT MT MT
AXL
15 DNA DNA
Demethylase? Methyltransferase?
10 MT MT
5
SP1 SP3
AXL
0 Resistance to Ara-C
MOLM14 HEK293