A comprehensive guide covering SQL, Python/R, Statistics, Data Visualization, Excel, Data Cleaning, Business Domain Knowledge, and Machine Learning basics for Data Analyst roles.
Total Questions:225
Difficulty Levels:
BeginnerIntermediateAdvanced
0%
Overall Progress
0/225
Status
Problem
Level
2.Explain the difference between INNER JOIN, LEFT JOIN, RIGHT JOIN, and FULL OUTER JOIN.
Easy
3.What are window functions and when would you use them?
Medium
4.How do you find duplicate records in a table?
Easy
5.What is the difference between DELETE, TRUNCATE, and DROP?
Medium
6.Explain primary key vs foreign key.
Easy
7.What are indexes and why are they important?
Medium
8.How do you optimize a slow-running SQL query?
Hard
9.What is a subquery and when would you use one?
Medium
10.Explain GROUP BY and its use cases.
Easy
11.What is the difference between UNION and UNION ALL?
Easy
12.How do you handle NULL values in SQL?
Easy
13.What are CTEs (Common Table Expressions)?
Medium
14.Explain the RANK, DENSE_RANK, and ROW_NUMBER functions.
Medium
15.How do you find the second highest salary in a table?
Medium
16.What is normalization and denormalization?
Medium
17.Explain the different types of keys in SQL (candidate, composite, surrogate).
Hard
18.What is a self-join and when would you use it?
Medium
19.How do you calculate running totals in SQL?
Medium
20.What is the difference between correlated and non-correlated subqueries?
Hard
21.Explain CASE statements with examples.
Easy
22.What are aggregate functions? Name some common ones.
Easy
23.How do you use PARTITION BY in window functions?
Medium
24.What is the difference between clustered and non-clustered indexes?
Hard
25.How do you find the Nth highest value in a table?
Hard
26.What are stored procedures and when would you use them?
Medium
27.Explain the COALESCE function.
Easy
28.How do you perform date calculations in SQL?
Medium
29.What is the difference between CHAR and VARCHAR?
Easy
30.How do you pivot and unpivot data in SQL?
Hard
31.What Python libraries do you use for data analysis?
Easy
32.Explain the difference between a list and a tuple.
Easy
33.What is pandas and why is it useful?
Easy
34.How do you handle missing data in pandas?
Medium
35.What is the difference between loc and iloc?
Medium
36.Explain list comprehension with an example.
Easy
37.What are lambda functions?
Medium
38.How do you merge or join dataframes in pandas?
Medium
39.What is NumPy and when would you use it?
Easy
40.How do you read CSV and Excel files in Python?
Easy
41.What is the difference between apply, map, and applymap in pandas?
Hard
42.How do you group data in pandas (groupby)?
Medium
43.What are dictionaries in Python and when do you use them?
Easy
44.Explain broadcasting in NumPy.
Hard
45.How do you handle datetime data in pandas?
Medium
46.What is the difference between shallow copy and deep copy?
Hard
47.How do you sort data in pandas?
Easy
48.What are generators in Python?
Hard
49.How do you filter data in pandas?
Easy
50.What is matplotlib and seaborn?
Easy
51.How do you create custom functions in Python?
Easy
52.What is the difference between Series and DataFrame?
Easy
53.How do you handle categorical data in pandas?
Medium
54.What are pandas pivot tables?
Medium
55.How do you optimize pandas code for large datasets?
Hard
56.What is the difference between mean, median, and mode?
Easy
57.Explain standard deviation and variance.
Easy
58.What is a p-value and how do you interpret it?
Medium
59.What is the difference between correlation and causation?
Easy
60.Explain Type I and Type II errors.
Medium
61.What is a confidence interval?
Medium
62.What is the Central Limit Theorem?
Hard
63.Explain hypothesis testing.
Medium
64.What is the difference between population and sample?
Easy
65.What are outliers and how do you detect them?
Medium
66.Explain normal distribution.
Easy
67.What is statistical significance?
Medium
68.What is the difference between parametric and non-parametric tests?
Hard
69.Explain regression analysis.
Medium
70.What is the difference between descriptive and inferential statistics?
Easy
71.What is the difference between linear and logistic regression?
Medium
72.Explain R-squared and adjusted R-squared.
Hard
73.What is the difference between one-tailed and two-tailed tests?
Medium
74.What is probability distribution?
Medium
75.Explain Bayes' Theorem.
Hard
76.What is sampling and what are different sampling methods?
Medium
77.What is the law of large numbers?
Medium
78.Explain skewness and kurtosis.
Hard
79.What is multicollinearity and how do you detect it?
Hard
80.What is the difference between covariance and correlation?
Medium
81.What is a z-score?
Easy
82.Explain the t-test and when to use it.
Medium
83.What is chi-square test?
Hard
84.What is ANOVA?
Hard
85.What is time series analysis?
Hard
86.What tools do you use for data visualization?
Easy
87.When would you use a bar chart vs a line chart?
Easy
88.What is Tableau and have you used it?
Easy
89.Explain the importance of data visualization.
Easy
90.What makes a good dashboard?
Medium
91.How do you choose the right chart type?
Medium
92.What is Power BI?
Easy
93.What are some best practices for data visualization?
Medium
94.How do you visualize data in Python?
Medium
95.What is the difference between a histogram and a bar chart?
Easy
96.When would you use a scatter plot?
Easy
97.What is a heat map and when would you use it?
Medium
98.How do you handle too many categories in a visualization?
Hard
99.What is a box plot and what does it show?
Medium
100.How do you make your visualizations accessible and easy to understand?
Hard
101.What Excel functions do you use most frequently?
Easy
102.Explain VLOOKUP and its limitations.
Easy
103.What is the difference between VLOOKUP and INDEX-MATCH?
Medium
104.How do you create a pivot table?
Easy
105.What are some advanced Excel formulas you know?
Hard
106.How do you handle large datasets in Excel?
Medium
107.What is Power Query?
Medium
108.Explain conditional formatting.
Easy
109.How do you remove duplicates in Excel?
Easy
110.What is the difference between absolute and relative cell references?
Easy
111.What are array formulas in Excel?
Hard
112.How do you use SUMIF, SUMIFS, COUNTIF, COUNTIFS?
Easy
113.What is Power Pivot?
Hard
114.How do you create dynamic charts in Excel?
Medium
115.What are Excel macros and VBA?
Hard
116.What is data cleaning and why is it important?
Easy
117.How do you handle missing values in a dataset?
Medium
118.What is data normalization?
Medium
119.How do you detect and handle outliers?
Medium
120.What is ETL and have you worked with it?
Medium
121.How do you deal with inconsistent data formats?
Easy
122.What is data validation?
Easy
123.How do you handle duplicate records?
Easy
124.What is data transformation?
Medium
125.Explain the 80/20 rule in data cleaning.
Easy
126.What is feature engineering?
Hard
127.How do you handle imbalanced datasets?
Hard
128.What is data profiling?
Medium
129.How do you validate your cleaned data?
Medium
130.What is data standardization vs normalization?
Hard
131.How do you translate business requirements into analytical tasks?
Medium
132.What KPIs have you worked with?
Easy
133.How do you communicate findings to non-technical stakeholders?
Medium
134.What is A/B testing and when would you use it?
Medium
135.How do you measure customer churn?
Medium
136.What is customer segmentation?
Easy
137.How do you calculate ROI?
Easy
138.What business metrics are most important to track?
Easy
139.How do you prioritize analytical projects?
Medium
140.Give an example of how your analysis impacted business decisions.
Medium
141.What is funnel analysis?
Medium
142.How do you measure customer lifetime value (CLV)?
Medium
143.What is cohort analysis?
Medium
144.How do you build a business case for your recommendations?
Hard
145.What is RFM analysis?
Hard
146.How do you measure marketing campaign effectiveness?
Medium
147.What is the difference between leading and lagging indicators?
Medium
148.How do you handle stakeholder disagreements about data interpretation?
Hard
149.What is revenue forecasting?
Hard
150.How do you measure product performance?
Medium
151.What is the data analysis process you follow?
Easy
152.What is the difference between structured and unstructured data?
Easy
153.What is data warehousing?
Medium
154.Explain OLAP vs OLTP.
Medium
155.What is big data and have you worked with it?
Medium
156.What is data modeling?
Medium
157.What is the difference between quantitative and qualitative data?
Easy
158.What are data pipelines?
Medium
159.What is data governance?
Hard
160.Explain dimensional modeling.
Hard
161.What is the star schema and snowflake schema?
Hard
162.What is data quality and how do you measure it?
Medium
163.What is the difference between batch processing and real-time processing?
Medium
164.What are fact and dimension tables?
Medium
165.What is data lineage?
Hard
166.What is the difference between data lake and data warehouse?
Medium
167.What is master data management?
Hard
168.What is metadata?
Easy
169.What are the challenges of working with big data?
Medium
170.What is data migration?
Medium
171.What is the difference between supervised and unsupervised learning?
Medium
172.What is overfitting and underfitting?
Medium
173.What is cross-validation?
Hard
174.Explain the bias-variance tradeoff.
Hard
175.What is feature selection and why is it important?
Medium
176.What is clustering and name some clustering algorithms.
Medium
177.What is classification vs regression?
Easy
178.What is a confusion matrix?
Medium
179.What is precision vs recall?
Medium
180.What is k-means clustering?
Medium
181.What BI tools have you worked with?
Easy
182.Have you used Google Analytics? Explain key metrics.
Medium
183.What is Git and version control?
Easy
184.What cloud platforms have you used (AWS, Azure, GCP)?
Medium
185.What is Jupyter Notebook?
Easy
186.Have you worked with API integrations?
Medium
187.What is Apache Spark?
Hard
188.What is Hadoop?
Hard
189.What database management systems have you used?
Easy
190.What is the difference between SQL and NoSQL databases?
Medium
191.How would you analyze a sudden drop in user engagement?
Hard
192.A stakeholder questions your data findings. How do you respond?
Medium
193.You're given a dataset with 80% missing values. What do you do?
Hard
194.How would you design a dashboard for executive leadership?
Medium
195.Your SQL query is running too slowly. How do you troubleshoot?
Hard
196.Two data sources show conflicting numbers. How do you resolve this?
Hard
197.You need to present complex findings in 5 minutes. What's your approach?
Medium
198.How would you analyze the success of a new product launch?
Medium
199.You discover an error in a report you sent last week. What do you do?
Medium
200.How would you identify factors driving customer churn?
Hard
201.You're asked to analyze data you don't have access to. How do you proceed?
Easy
202.How would you measure the impact of a price change?
Hard
203.Your analysis shows a result opposite to what the team expected. How do you handle it?
Medium
204.How would you automate a repetitive reporting task?
Medium
205.You have one week to learn a new tool for a project. How do you approach it?
Easy
206.How would you validate data from a third-party vendor?
Medium
207.Your team disagrees on which metric to focus on. How do you decide?
Medium
208.How would you analyze seasonal trends in sales data?
Hard
209.You're asked to reduce report generation time by 50%. What's your strategy?
Hard
210.How would you handle a request to manipulate data to show desired results?
Hard
211.Describe a challenging data project you worked on.
Medium
212.How do you ensure data accuracy?
Easy
213.Tell me about a time you found an insight that changed business strategy.
Medium
214.How do you handle tight deadlines?
Easy
215.What do you do when you don't know the answer to a data question?
Easy
216.How do you stay updated with data analysis trends?
Easy
217.Describe your experience working with cross-functional teams.
Medium
218.How do you handle conflicting requirements from stakeholders?
Medium
219.What's your approach to learning new tools or technologies?
Easy
220.Why do you want to be a Data Analyst?
Easy
221.Tell me about a time you made a mistake in your analysis.
Medium
222.How do you manage multiple projects simultaneously?
Easy
223.Describe a time you had to explain technical concepts to a non-technical audience.
Medium
224.What's the most complex dataset you've worked with?
Hard
225.How do you handle feedback and criticism on your work?