55 Streamsets inc. Interview Questions and Answers

Data Analytics

Ques:- What is the difference between supervised and unsupervised learning

Asked In :- Medvarsity Online, KRIOS Info Solutions, AnAr Solutions, Netaxis IT Solutions (p), MattsenKumar Services, Shipco IT, Chegg India, Indocosmo Systems, Born Commerce, Cybage Software,

Right Answer:
Supervised learning uses labeled data to train models, meaning the output is known, while unsupervised learning uses unlabeled data, where the model tries to find patterns or groupings without predefined outcomes.

Data Analytics

Ques:- What is regression analysis and when is it used

Asked In :- Xoriant Solutions Pvt Ltd, Hidden Brains InfoTech, STIC SOFT E-SOLUTIONS, Fortunesoft IT Innovations, iROID Technologies, Dhruvsoft Services, Clear Trail Technologies, MatchMove India, Radicle Software, Spectra Medix India,

Right Answer:
Regression analysis is a statistical method used to examine the relationship between one dependent variable and one or more independent variables. It is used to predict outcomes, identify trends, and understand the strength of relationships in data.

Data Analytics

Ques:- What are outliers and how do you handle them in data analysis

Asked In :- Fluid AI, Trigent Software, Itobuz Technologies, RANDSTAD INDIA PVT, GOQii Technologies, Dhruvsoft Services, Radicle Software, Webvillee Technology, Fission Infotech, Noesys Consulting,

Right Answer:
Outliers are data points that significantly differ from the rest of the dataset. They can skew results and affect statistical analyses. To handle outliers, you can:

1. Identify them using methods like the IQR (Interquartile Range) or Z-scores.
2. Remove them if they are errors or irrelevant.
3. Transform them using techniques like log transformation.
4. Use robust statistical methods that are less affected by outliers.
5. Analyze them separately if they provide valuable insights.

Data Analytics

Ques:- What are the different types of data distributions

Asked In :- Medvarsity Online, Queppelin Technology Solutions, Shipco IT, Chegg India, Itobuz Technologies, Fortunesoft IT Innovations, RANDSTAD INDIA PVT, iROID Technologies, TRICON INFOTECH PVT, Razorpay-Startup,

Right Answer:
The different types of data distributions include:

1. Normal Distribution
2. Binomial Distribution
3. Poisson Distribution
4. Uniform Distribution
5. Exponential Distribution
6. Log-Normal Distribution
7. Geometric Distribution
8. Beta Distribution
9. Chi-Squared Distribution
10. Student's t-Distribution

Data Analytics

Ques:- How do you handle missing data in a dataset

Asked In :- Rock Solid Solutions, Protege Solutions, Ziffity Solutions, Toxsl Technologies, Cybage Software, WFM, Oodles Technologies, Sun Dew Solutions, Startup - Navya Network, LenDenClub,

Right Answer:
To handle missing data in a dataset, you can use the following methods:

1. **Remove Rows/Columns**: Delete rows or columns with missing values if they are not significant.
2. **Imputation**: Fill in missing values using techniques like mean, median, mode, or more advanced methods like KNN or regression.
3. **Flagging**: Create a new column to indicate missing values for analysis.
4. **Predictive Modeling**: Use algorithms to predict and fill in missing values based on other data.
5. **Leave as Is**: In some cases, you may choose to leave missing values if they are meaningful for analysis.

Aem Developer

Ques:- How do you implement custom workflows in AEM

Asked In :- FIS Global Business Solutions India, E2E Networks, IMAGINATION TECHNOLOGIES LIMITED, Barracuda Networks, doodleblue, Aurus Tech, Everi India, MAK Controls & Systems, Cloud Assert, XBRL Ultima,

Right Answer:
To implement custom workflows in AEM, follow these steps:

1. **Create a Workflow Model**: Use the AEM Workflow Model Editor to design your workflow. Define the steps and the order in which they will execute.

2. **Add Workflow Steps**: Include custom workflow processes by creating Java classes that extend `WorkflowProcess`. Implement the `execute` method to define the logic for each step.

3. **Register the Workflow Process**: Use the OSGi configuration to register your custom workflow process in AEM.

4. **Create a Workflow Launchers**: Set up workflow launchers to trigger the workflow based on specific events or conditions, such as content creation or modification.

5. **Test the Workflow**: Deploy your workflow and test it to ensure it functions as expected.

6. **Monitor and Debug**: Use the AEM Workflow console to monitor the execution and debug any issues that arise.

Aem Developer

Ques:- What debugging tools or techniques do you use when troubleshooting issues in AEM

Asked In :- Netaxis IT Solutions (p), Codelogicx Technologies, AroDek, InRhythm Solutions, NIHILENT LIMITED, AXESTRACK SOFTWARE SOLUTIONS, Benchmark IT Solutions, ARM InfoTech, e2e, Comviva,

Right Answer:
I use the following debugging tools and techniques when troubleshooting issues in AEM:

1. **Sling Logging**: Adjust log levels and check logs in the Felix console for errors.
2. **AEM Error Logs**: Review error logs located in the `crx-quickstart/logs` directory.
3. **Sling Resource Resolver**: Use the resource resolver to inspect resource paths and properties.
4. **AEM Web Console**: Utilize the Web Console for OSGi services and configurations.
5. **Browser Developer Tools**: Inspect network requests and console errors in the browser.
6. **Debugging with Eclipse**: Set breakpoints and debug AEM code using an IDE like Eclipse.
7. **AEM Package Manager**: Check for package installation issues or conflicts.
8. **Replication Queue**: Monitor the replication queue for issues with content publishing.

ABAP

Ques:- What is an ABAP data type and how is it declared

Asked In :- Unyscape Infocom Pvt. Ltd., Hidden Brains InfoTech, AnAr Solutions, TNQ Technologies, Cybage Software, Keyideas Infotech, Softcell Technologies, Amrut Software, CropIn Technology Solutions, Aruba Networks,

Right Answer:
An ABAP data type defines the kind of data a variable can hold, such as integer, string, or date. It is declared using the `DATA` statement, for example: `DATA: my_variable TYPE i.` (where `i` stands for integer).

Alfresco

Ques:- How do you migrate content into Alfresco from legacy systems

Asked In :- Toxsl Technologies, Fortunesoft IT Innovations, Sun Dew Solutions, M/s. orange business services, orangescape technologies ltd, Shore Infotech India, Black and White Business Solutions, Cross Country Infotech, Leeway Hertz, Infiniti Software Solutions,

Right Answer:
Use the Alfresco Bulk Import Tool, custom scripts leveraging the Alfresco API, or dedicated migration tools depending on the complexity and volume of content.

ARM Compiler

Ques:- What are startup files and how do they work in an ARM environment

Asked In :- Indocosmo Systems, RLABS ENTERPRISE SERVICES PVT LIMITED, Code Insight Technologies, TechRyde, Schneider Electric India, Riversand Global Technologies, Direction Software Solutions, Digicon Technologies, Enzigma, Worley Parsons,

Right Answer:
Startup files in an ARM environment are assembly or C source files that initialize the system before the main program runs. They typically set up the stack pointer, initialize global variables, and call the main function. These files ensure that the hardware and software environment is correctly configured for the application to run.

API

Ques:- What are the different types of APIs

Asked In :- Vinove Software & Services Pvt Ltd, Object Frontier Software, Hidden Brains InfoTech, AnAr Solutions, Netaxis IT Solutions (p), Walkover Web Solutions, Itobuz Technologies, Codiant Software Technologies, Infinity Labs LLP, Define Labs,

Right Answer:
The different types of APIs are:

1. **Open APIs (Public APIs)** - Available to developers and third parties.
2. **Internal APIs (Private APIs)** - Used within an organization.
3. **Partner APIs** - Shared with specific business partners.
4. **Composite APIs** - Combine multiple endpoints into a single call.
5. **Web APIs** - Accessible over the internet using HTTP/HTTPS.

API

Ques:- What is API documentation and why is it necessary

Asked In :- Vinove Software & Services Pvt Ltd, KRIOS Info Solutions, Queppelin Technology Solutions, Codiant Software Technologies, Born Commerce, Dhruvsoft Services, Oodles Technologies, CakeSoft Technologies, Webvillee Technology, Recodem,

Right Answer:
API documentation is a technical manual that explains how to use an API, including its endpoints, request and response formats, authentication methods, and examples. It is necessary because it helps developers understand how to integrate and interact with the API effectively, ensuring proper usage and reducing errors.

API

Ques:- What are HTTP methods and how are they used in APIs

Asked In :- Xoriant Solutions Pvt Ltd, KRIOS Info Solutions, Itobuz Technologies, Addweb solutions, Solace Infotech, TNQ Technologies, MatchMove India, Sun Technology Integrators, Spectra Medix India, Novalnet e-Solutions,

Right Answer:
HTTP methods are standardized request types used in APIs to perform actions on resources. The main methods are:

1. **GET**: Retrieve data from a server.
2. **POST**: Send data to a server to create a new resource.
3. **PUT**: Update an existing resource on the server.
4. **DELETE**: Remove a resource from the server.
5. **PATCH**: Apply partial modifications to a resource.

These methods define the action to be performed on the specified resource in the API.

API

Ques:- What is rate limiting in APIs and how is it implemented

Asked In :- Vinove Software & Services Pvt Ltd, Object Frontier Software, Hidden Brains InfoTech, Netaxis IT Solutions (p), Rock Solid Solutions, Shipco IT, Walkover Web Solutions, Solace Infotech, Infinity Labs LLP, TNQ Technologies,

Right Answer:
Rate limiting in APIs is a technique used to control the number of requests a user can make to an API within a specific time period. It is implemented by setting thresholds (e.g., requests per minute) and using mechanisms like tokens, counters, or IP address tracking to monitor and restrict access when the limit is exceeded.

API

Ques:- What is an API endpoint and how do you define it

Asked In :- AnAr Solutions, Ziffity Solutions, Trigent Software, Codiant Software Technologies, iROID Technologies, Dhruvsoft Services, MatchMove India, Oodles Technologies, Novalnet e-Solutions, Fission Infotech,

Right Answer:
An API endpoint is a specific URL or URI where an API can be accessed by a client to perform operations like retrieving or sending data. It defines the location and method (such as GET, POST) for interacting with the API.

C Software Quality Assurance Engineer SQL Server Testing Manual

Ques:- How do you set reminder mail in outlook?

Asked In :- Priya Softweb Solutions, Orcapod Consulting Services, D-Mart, UCA, Amagi Media, SpringRole, RWE Supply & Trading, FACEIT, Anova Data, CodeQuotient,

Right Answer:
To set a reminder mail in Outlook, create a new email, then click on "Options" in the ribbon. In the "Tags" group, click on "Follow Up" and select "Add Reminder." Set the date and time for the reminder, then click "OK" and send the email.

SQL Server

Ques:- What is Cross Join?

Asked In :- Energytech Global, Ray Business Technologies, Global Step, JSL, Tessolve, ThinkBridge, Innover Systems, Pacific Global Solutions, Cateina Technologies, Sofmen,

sql System Engineer

Ques:- What is meant by SQL Wildcard Characters in Sql Server?

Asked In :- Evolvus Solutions, IDCube Identification Systems, Circle Internet Financial, Saraca Solutions, Maventic, J K Technosoft, Factspan, Moolya Software Testing, C-Square Info Solutions, Sankey Solutions,

Right Answer:
SQL Wildcard Characters in SQL Server are special symbols used in queries to represent one or more characters in string comparisons. The most common wildcard characters are:

1. `%` - Represents zero or more characters.
2. `_` - Represents a single character.

These are often used with the `LIKE` operator in SQL queries to filter results based on pattern matching.

Database Architect/ Designer SQL Server

Ques:- Difference between a “where” clause and a “having” clause

Asked In :- INNOART TECHNOLOGIES, Cotelligent India, OCCL, Sarjen Systems, ESAF Small Finance Bank, IDeaS - A SAS Company, bluCursor Technologies, Tripod Technologies, Busy Infotech, Flannels,

sql

Ques:- Why to use Stored Procedures in Sql Server?

Asked In :- Indocosmo Systems, Principal Global Services, McKinsey Knowledge Center, e2e, Kyro, Unified Infotech, Metrics4 Analytics, Travelodge Hotels (UK), Pickrr Technologies, COGNISOFT TECHNOLOGIES,

Find Interview Questions for Streamsets inc.

What makes Takluu valuable for interview preparation?

Get Our Mobile App

Programming

Reasoning

Network & Telecom

Management

What makes Takluu valuable for interview preparation?