50 Gyan Data Interview Questions and Answers | HR, Technical & Aptitude Prep

Data Analytics

Ques:- What is the difference between correlation and causation

Asked In :- Unyscape Infocom Pvt. Ltd., Vinove Software & Services Pvt Ltd, AnAr Solutions, STIC SOFT E-SOLUTIONS, Toxsl Technologies, Chegg India, Aranca (Mumbai), iROID Technologies, TNQ Technologies, MatchMove India,

Right Answer:
Correlation is a statistical measure that indicates the extent to which two variables fluctuate together, while causation implies that one variable directly affects or causes a change in another variable.

Data Analytics

Ques:- How do you handle missing data in a dataset

Asked In :- Rock Solid Solutions, Protege Solutions, Ziffity Solutions, Toxsl Technologies, Cybage Software, WFM, Oodles Technologies, Sun Dew Solutions, Startup - Navya Network, LenDenClub,

Right Answer:
To handle missing data in a dataset, you can use the following methods:

1. **Remove Rows/Columns**: Delete rows or columns with missing values if they are not significant.
2. **Imputation**: Fill in missing values using techniques like mean, median, mode, or more advanced methods like KNN or regression.
3. **Flagging**: Create a new column to indicate missing values for analysis.
4. **Predictive Modeling**: Use algorithms to predict and fill in missing values based on other data.
5. **Leave as Is**: In some cases, you may choose to leave missing values if they are meaningful for analysis.

Data Analytics

Ques:- What is clustering in data analysis and how is it different from classification

Asked In :- Medvarsity Online, AnAr Solutions, MattsenKumar Services, Protege Solutions, Shipco IT, Chegg India, Fortunesoft IT Innovations, GOQii Technologies, Spadeworx Software Services, Dhruvsoft Services,

Right Answer:
Clustering in data analysis is the process of grouping similar data points together based on their characteristics, without prior labels. It is an unsupervised learning technique. In contrast, classification involves assigning predefined labels to data points based on their features, using a supervised learning approach.

Data Analytics

Ques:- What are the different types of data distributions

Asked In :- Medvarsity Online, Queppelin Technology Solutions, Shipco IT, Chegg India, Itobuz Technologies, Fortunesoft IT Innovations, RANDSTAD INDIA PVT, iROID Technologies, TRICON INFOTECH PVT, Razorpay-Startup,

Right Answer:
The different types of data distributions include:

1. Normal Distribution
2. Binomial Distribution
3. Poisson Distribution
4. Uniform Distribution
5. Exponential Distribution
6. Log-Normal Distribution
7. Geometric Distribution
8. Beta Distribution
9. Chi-Squared Distribution
10. Student's t-Distribution

Data Analytics

Ques:- What is the purpose of feature engineering in data analysis

Asked In :- KRIOS Info Solutions, WSNE Consulting, AnAr Solutions, Queppelin Technology Solutions, Rock Solid Solutions, Ziffity Solutions, Aranca (Mumbai), Solace Infotech, Born Commerce, GOQii Technologies,

Right Answer:
The purpose of feature engineering in data analysis is to create, modify, or select variables (features) that improve the performance of machine learning models by making the data more relevant and informative for the analysis.

Data Interpretation

Ques:- What is the role of data trends and patterns in data interpretation

Asked In :- IRIS KPO RESOURCING INDIA PVT, Blackbuck Insights, DSVV, Health Inspector, LGB, LPO, TTN, Commonwealth Bank of Australia, OfBusiness, Rapid Global Business Solutions (RGBSI),

Right Answer:

Trends and patterns in data help you see the bigger picture. They show how values change over time, how different variables are connected, and what behaviors or outcomes are repeating. Spotting trends and patterns makes raw numbers meaningful — and helps you make smarter decisions.

—

🔍 Why Trends and Patterns Matter in Data Interpretation:

1. Reveal What’s Changing
Trends show the direction of data over time — whether it’s going up, down, or staying stable.
✅ Example: An increasing sales trend signals business growth.

2. Help Predict Future Outcomes
If a pattern keeps repeating, you can often use it to forecast what’s likely to happen next.
✅ Example: If customer visits always drop in August, you can plan ahead.

3. Identify Relationships
Patterns show how two variables may be connected.
✅ Example: If higher website traffic always leads to more sales, you’ve found a useful link.

4. Spot Problems or Opportunities
Unexpected changes or breaks in a trend can signal issues — or reveal new chances for improvement.
✅ Example: A sudden drop in customer satisfaction may alert you to a service issue.

5. Support Data-Driven Decisions
Trends and patterns turn raw data into actionable insights, helping teams make informed choices backed by evidence.

Data Interpretation

Ques:- What is a pie chart and how do you extract insights from it

Asked In :- SpotOn Software, IRIS KPO RESOURCING INDIA PVT, MDINDIA HEALTH INSURANCE TPA PVT, Sai Life Sciences, AB InBev, BOM, BPSC, CTPS, Estuate, Fintellix,

Right Answer:

A pie chart is a circular graph used to show how a whole is divided into different parts. Each “slice” of the pie represents a category, and its size reflects that category’s proportion or percentage of the total.

It’s one of the simplest and most visual ways to display data — especially when comparing parts of a whole.

—

🎯 Key Features of a Pie Chart:

The entire circle represents 100% of the data.
Each slice represents a specific category or group.
Larger slices mean higher values or proportions.
Often color-coded and labeled for clarity.

—

🔍 How to Extract Insights from a Pie Chart:

1. Read the Title & Labels
Start by understanding what the chart is showing — it could be market share, survey responses, budget breakdowns, etc.

2. Look at Slice Sizes
Compare slice sizes to see which categories are biggest or smallest.
The largest slice shows the most dominant group.

3. Check Percentages or Values
If percentages or numbers are given, use them to understand how much each slice contributes to the whole.

4. Group Related Slices (if needed)
Sometimes combining smaller slices can help identify trends (e.g., combining all “Other” categories).

5. Ask Questions Like:
- Which category has the largest share?
- Are any categories equal in size?
- How balanced is the distribution?

Data Interpretation

Ques:- How do you interpret data presented in tables, charts, and graphs

Asked In :- Radicle Software, Webvillee Technology, Tredence Analytics Solutions, MedGenome Labs, MDINDIA HEALTH INSURANCE TPA PVT, CTPS, JSL, LPO, Mother Dairy, WBPSC West Bengal Public Service Commission,

Right Answer:

Interpreting data from tables, charts, and graphs means turning visual information into insights. It involves understanding what’s being shown, comparing values, identifying patterns or trends, and drawing conclusions based on the visual representation.

Each format serves a unique purpose:

🔹 Tables
Tables present exact data in rows and columns. Focus on headers to know what each row and column means, and scan the data to find highs, lows, and patterns.

🔹 Charts & Graphs
Visual tools like bar charts, line graphs, pie charts, and scatter plots help you quickly compare values, track changes over time, or understand relationships between variables.

Key tips:

Read titles, labels, and legends carefully
Look for trends (increasing, decreasing, steady)
Compare heights, lengths, or angles visually
Watch for anomalies or outliers

Data Interpretation

Ques:- How do you deal with incomplete or missing data when interpreting results

Asked In :- IRIS KPO RESOURCING INDIA PVT, DecisionTree Analytics and Services, CONDUENT BUSINESS SERVICES INDIA LLP, Sai Life Sciences, Borderless Access, AB InBev, Bapco Bahrain Petroleum Company, CTPS, DET, Health Inspector,

Right Answer:

Incomplete or missing data is a common challenge in data analysis. Whether it’s skipped survey responses, blank spreadsheet cells, or unavailable values, missing data can affect the accuracy and reliability of your results.

The key is to handle missing data thoughtfully so you can still draw valid conclusions without misleading your interpretation.

—

🔍 Common Ways to Handle Missing Data:

1. Identify the Missing Data
Start by locating where and how much data is missing.
Check: Is it random or following a pattern? Are entire sections missing or just a few values?

2. Remove Incomplete Entries (if appropriate)
If only a small number of rows are missing data, and they don’t heavily impact the dataset, you can safely remove them.

3. Use Imputation (Estimate Missing Values)
If the dataset is large and important, you can fill in missing values using methods like:
– Mean or median substitution (for numerical data)
– Mode (for categorical data)
– Regression or predictive models (for more advanced cases)

4. Use Available Data Only
In some cases, you can perform analysis using just the complete parts of the dataset — as long as it doesn’t bias your results.

5. Flag and Acknowledge Missing Data
Be transparent in reports. Clearly mention how much data is missing and how it was handled.

6. Ask Why the Data Is Missing
Sometimes missing data reveals a deeper issue (e.g., system errors, survey confusion). Understanding the cause can help prevent future problems.

Explanation:

Imagine you’re analyzing survey responses from 1,000 people, but 100 skipped the income question.

Option 1: Exclude those 100 responses if income is critical to your analysis.
Option 2: If income correlates with other known answers (like job title), estimate it using average values for each group.

Data Interpretation

Ques:- How do you interpret data in line graphs and bar charts

Asked In :- Enovate IT Outsourcing, DecisionTree Analytics and Services, Proziod Analytics, Bapco Bahrain Petroleum Company, Buzzworks Business Services, CTPS, DSVV, JSL, LGB, TTN,

Right Answer:

Line graphs and bar charts are two of the most common tools used to visualize and interpret data. Both help you identify trends, make comparisons, and draw conclusions, but they are used in slightly different ways.

—

📈 Interpreting Line Graphs:

A line graph shows how data changes over time. It connects data points with lines, making it easy to spot trends or patterns.

How to interpret:

Read the title and axis labels (x-axis usually shows time; y-axis shows value).
Look for upward or downward trends (is the line rising, falling, or flat?).
Identify peaks (high points) and dips (low points).
Note sudden changes — sharp rises or drops can indicate important events.

✅ Example:

A line graph showing monthly sales over a year:

If the line steadily rises from January to December, it means sales are increasing.
A sharp drop in August might indicate a seasonal slowdown.

—

📊 Interpreting Bar Charts:

A bar chart compares values across categories using rectangular bars. The height or length of each bar represents the size of the value.

How to interpret:

Check the axis labels to understand what each bar represents.
Compare the heights of the bars — taller bars mean higher values.
Look for patterns (e.g., which category performs best or worst).
Grouped or stacked bar charts allow comparisons within sub-categories.

✅ Example:

A bar chart comparing product sales:

If Product A’s bar is twice as tall as Product B’s, it means Product A sold twice as much.
If all bars are similar, sales are evenly distributed across products.

SQL Server

Ques:- What is User Defined Functions?

Asked In :- Velocity Software Solutions, TVISHA TECHNOLOGIES PVT, Mundrisoft Solutions LLP, The Unified Cloud, Circle Internet Financial, SureSoft, Data Patterns, Ojas Innovative Technologies, Simeio, tringapps,

SQL Server

Ques:- What are the authentication modes in SQL Server?

Asked In :- Energytech Global, Happiest Minds Technologies Pvt., Grail Insights, ZoomRx, Avizva Solutions, Navyug Infosolutions, HealthSignz, Celebal, Data Marshall, Sarathi Softech,

Data Warehouse Developer SQL Server

Ques:- What does the Queue Reader Agent do in SQL Server 2005 replication?

Asked In :- LiteBreeze Infotech, Principal Global Services, NSOFT SOLUTIONS, Hexaview Technologies private, Fulcrum Logic (I), Aris Global Software, Goldstone Technologies, District Cooperative Central Bank DCCB, ATMECS, Shiash Info Solutions,

Right Answer:
The Queue Reader Agent in SQL Server 2005 replication is responsible for reading messages from the distribution queue and applying them to the subscriber databases in a transactional replication setup.

Data Warehouse Developer SQL Server

Ques:- What is the recommended way to send mail from SQLAgent in SQL Server 2005?

Asked In :- Sequretek IT Solutions, Infomats Technologies, Maruti Techlabs, ETEA, Testbook, Salesforce Marketing Cloud, IDP Education, TekLink International, Exusia, Isoft Software Technologies,

SQL Server

Ques:- When do we use the UPDATE_STATISTICS command?

Asked In :- CAP Digisoft Solutions, de facto infotech, Network Labs, QSpiders, Tecnotree, Azuga, OfBusiness, London Stock Exchange Group, MedSpecialized, CETAS Information Technology,

Django Python

Ques:- How to set and unset session in django ?

Asked In :- Recodem, Principal Global Services, Tredence Analytics Solutions, Pinnacleworks Infotech, Yalamanchili Software, American Megatrends Inc, Intelizign, BeeHyv Software Solutions, Savantis, Cron-J,

Right Answer:
To set a session in Django, you can use:

```python
request.session['key'] = 'value'
```

To unset (delete) a session, you can use:

```python
del request.session['key']
```

To clear all session data, use:

```python
request.session.flush()
```

Django Python

Ques:- List type of inheritance supported in django?

Asked In :- Target Integration, ProV Infotech Systems, Avizva Solutions, Knot Solutions, FlyNava Technologies, Quick Release, Cittabase Solutions, DAG, ncs group, quinnox,

Right Answer:
Django supports the following types of inheritance:

1. **Abstract Base Classes**: Allows you to create a base class that other models can inherit from, without creating a separate database table for the base class.
2. **Multi-table Inheritance**: Each model has its own database table, and Django creates a one-to-one relationship between the parent and child models.
3. **Proxy Models**: Allows you to create a new model class that behaves like the original model but can have different behaviors or methods, without creating a new database table.

Python

Ques:- What is a negative index in python?

Asked In :- Digital Nirvana Information Systems (I), TetraSoft, Azuga, ThinkBridge, Addverb Technologies, Enthuons, Netcon Technologies, Provakil Technologies, MSys Training, micron technology,

Comments

Admin May 17, 2020

Python arrays & list items can be accessed with positive or negative numbers (also known as index).
For instance our array/list is of size n, then for positive index 0 is the first index, 1 second, last index will be n-1. For negative index, -n is the first index, -(n-1) second, last negative index will be â€“ 1.
A negative index accesses elements from the end of the list counting backwards.
An example to show negative index in python
>>> import array
>>> a= [1, 2, 3]
>>> print a[-3]
1
>>> print a[-2]
2
>>> print a[-1]
3

Python Django

Ques:- have you customized admin style?

Asked In :- Khoj Information Technology, PRAN-RFL, Paktolus, Nimbus Systems, Spire Global, AcuityAds, Peocit, CoolMinds Technologies, Accucia, sse,

Right Answer:
Yes, I have customized the Django admin style by using custom CSS and JavaScript files, and by overriding admin templates to change the appearance and layout of the admin interface.

Python Software Developer/ Programmer

Ques:- Do you know the difference between range and xrange?

Asked In :- BillDesk, Talent Smart Soft Solutions (OPC), Medha Servo Drives, DSM SOFT, ThinkBridge, Yagna iQ, Eoxys Systems, 6d technologies, scalefocus, aleph-labs,

Find Interview Questions for Gyan Data

What makes Takluu valuable for interview preparation?

Get Our Mobile App

Programming

Reasoning

Network & Telecom

Management

What makes Takluu valuable for interview preparation?