1. A data analyst at a book publisher is working on an urgent report for executives. They are using only historical data. What is the most likely reason for choosing to analyze only historical data?
Answers
· The project has a very short time frame
· The data is unknown
· There is plenty of time to research historical data
· The data is constantly changing
Explanation: They're trying to gain insights into past trends and patterns to inform current decision-making. When historical data is analyzed, it may assist in determining what strategies have been successful in the past, what strategies have not been successful, and it can give a foundation for generating educated predictions about the outcomes of future events. It's just like going on a trip through the pages of a book to get the information you need to create the next chapter of your success story!
Answers
· Box office returns
· Movie running time
· Movie budget
· Number of actors in movie
Answers
· Is this your first time dining at this restaurant?
· How many people do you usually dine with?
· How many times have you dined at this restaurant?
· On a scale of 1-10, how would you rate your service today?
Explanation: In nominal qualitative data, categories or labels are used, but there is no intrinsic order to the data.In this scenario, the comments may be organized into categories like "Mystery," "Romance," "Science Fiction," and so on, but there wouldn't necessarily be an order or rating involved.
Answers
· Internal data circumvents privacy restrictions.
· Internal data comes from people you know.
· Internal data has much larger sample sizes.
· Internal data lives within a company’s own systems.
Answers
· True
· False
Explanation: It's not quite that. The majority of the time, the postings made on social networking platforms are instances of unstructured data. Because unstructured data does not have a preset data model and because it often contains a lot of text, it is more difficult to organize and analyze using standard approaches.
On the other hand, structured data is meticulously arranged and prepared, and it most often resides in databases that are distinguished by their distinct categories and connections. The result of the completeness of the completeness of the number of the completeness of the completeness of the completeness of the completeness of the completeness of the completeness of the completeness of thesauced.
Even while a social media post may have some organized components (such timestamps, user IDs, or hashtags), the primary content of the post, which may include text, photographs, or videos, is often unstructured. In order to do analysis on unstructured data and get useful insights from it, more sophisticated methods, such as natural language processing, are often required.
Answers
· three
· 10
· two
· infinite
Explanation: The Boolean data type may take on either the value true or the value false as its potential interpretation. It represents binary logic, where the outcome is either true or false, on or off, 1 or 0.
What kind of data format does it contain?
Answers
· Short
· Wide
· Narrow
· Long
Answers
· True
· False
Explanation: Without a doubt! Converting a file from one format to another, such as changing from .XLS to .CSV, is indeed an example of a data transformation. In this particular scenario, it entails converting a spreadsheet file that is in the Excel format to a CSV format, also known as a Comma-Separated Values format. This is a standard approach to express tabular data in a format that is just plain text. Data transformations similar to this one are often performed for the purpose of facilitating interoperability with a variety of software programs or of preparing data for certain analytical procedures.
9. A data analyst is working on an urgent traffic study. As a result of the short time frame, which type of data are they most likely to use?
Answer
· Theoretical
· Historical
· Personal
· Unclean
Explanation: When doing an urgent research on traffic, the data analyst would most likely utilize data that is either real-time or near-real-time. This type of data provides information that is current and up-to-date, allowing for quick analysis and decision-making. Data collected in real time may come from a variety of sources, such as traffic sensors, GPS devices, or other live sources that provide direct insights on the circumstances of the road at the present moment. This is very necessary in circumstances that call for a rapid reaction or analysis, such as when it is necessary to manage the flow of traffic during peak hours or react to unforeseen occurrences while driving.
Answer
· True
· False
Explanation: In point of fact, nominal qualitative data is denoted by categories or labels and does not include any kind of intrinsic order or scale. It's the lowest level of measurement and doesn't imply any quantitative relationship between the categories. Each category is treated as distinct, and there is no inherent order or ranking among them.
In contrast, ordinal qualitative data does have a meaningful order or scale. It is possible to rank or arrange the categories that are included in ordinal data, but the distinctions that exist between them are not always consistent or observable.
Therefore, nominal data refers to categories that are not arranged in any particular order, while ordinal data includes categories that are arranged in a meaningful manner.
11. Internal data is more reliable because it’s clean.
Answer
· True
· False
Explanation: Internal data is often considered more reliable because organizations have more control over its collection, storage, and management processes. The organization's well-established processes, protocols, and quality control procedures are directly responsible for the high level of cleanliness of its internal data.
Answer
· Audio file
· Digital photo
· Spreadsheet
· Table
Answer
· True
· False
Explanation: It is not necessary for a Boolean data type to have a numerical value at all times. When it comes to programming and the representation of data, a Boolean data type normally represents two different potential values: true and false. Binary logic is typically represented by using these values, where true is generally equivalent to 1, and false is often comparable to 0 in terms of numeric representation.
Answer
· A specific constraint
· A specific data type
· A unique data variable
· A unique format
Explanation: When dealing with large data, it is common practice for each column to stand for a variable or a feature, while each row stands for an individual observation or instance. In contrast to long data, which is organized such that different columns include values together with the context in which they belong, wide data is designed so that each variable has its own column.
Each column represents a distinct variable or feature. For example, if you're dealing with a dataset of students, you might have columns like "Name," "Age," "Grade," etc.
The values for all of the variables and features that are present in a particular observation or case are listed in each row. Therefore, the information that a student's name, age, and grade level may be included in a row of a student dataset.
Answer
· value
· structure
· accuracy
· meaning
Explanation: The structure of the data may be altered by data analysts via the process of data transformation. This procedure includes making adjustments to the structure, organization, or display of data in order to render it more appropriate for analysis, reporting, or particular activities.
Answer
· True
· False
Explanation: In point of fact, continuous data is the reverse of discrete data in that it is measured and may assume an endless number of values within a certain range. Continuous data are capable of being measured with a high degree of accuracy and, in theory, can take on any value that falls within a certain range. Height, weight, temperature, and time are all examples of continuous data. Another example is distance traveled.
On the other hand, discrete data is counted and has a limited number of distinct values. The number of automobiles in a parking lot, the number of pupils in a classroom, and the number of books on a shelf are all examples of discrete data.
Answer
· True or false
· Yes, no, or unsure
· Yes or no
· One, two, or three
Answer
· True
· False
Explanation: Actually, if you have a short time frame and need an immediate answer, historical data might not be the most suitable option. Analyzing historical data, which relates to data from the past, may not give the real-time insights that are necessary for dealing with an urgent crisis.
In a short period of time, it is more probable that you will depend on data sources that are either real-time or near-real-time. These sources are able to supply information that is up to date, which enables one to do analysis and make decisions more quickly. Live sensors, feeds from social media platforms, and other types of technologies that offer real-time information are some examples.
Real-time data are often more pertinent to urgent and time-sensitive circumstances than historical data are. However, historical data are useful for identifying trends and patterns that develop over time.
19. Which of the following is an example of continuous data?
Answer
· Leading actors in movie
· Box office returns
· Movie run time
· Movie budget
Explanation: The measurement of temperature is an excellent illustration of continuous data since, in theory, it may take on an endless number of different values within a certain range. It can be measured with high precision, and there are no distinct, separate values.
The difference between continuous data, which is measured and may have an endless number of values within a defined range, and discrete data, which consists of unique values that are kept separate from one another, is that continuous data are measured.
Answer
· How likely are you to recommend this restaurant to a friend?
· Is this your first time dining at this restaurant?
· Have you heard of our frequent diner program?
· Did anyone recommend our restaurant to you today?
Answer
· True
· False
Explanation: Without a doubt! You have it completely right. Data transformation involves modifying the format, structure, or representation of data to make it more suitable for analysis, reporting, or specific tasks. Converting data from one format to another, such as changing it from a spreadsheet to a CSV file or from a database to a different file type, is a classic example of data transformation. It's like giving your data a makeover to make it more compatible and accessible for different purposes.
22. Which of the following is a benefit of internal data?
Answer
· Internal data is less vulnerable to biased collection.
· Internal data is the only data relevant to the problem.
· Internal data is less likely to need cleaning.
· Internal data is more reliable and easier to collect.
Explanation: Internal data is generated and collected within the organization's own systems and processes. This level of control allows the organization to design and implement standardized data collection methods, ensuring data accuracy, consistency, and reliability. With control over the data generation process, organizations can better manage the quality of their internal data.