banner



Which Of The Following Statements Is True About Unsupervised Data Mining

Data Mining MCQ

This section of interview questions and answers focuses on "Data Mining". One tin practice these interview questions to improve their concepts needed for various interviews (campus interviews, walk-in interviews, and company interviews).

1) Which of the post-obit refers to the problem of finding bathetic patterns (or structures) in the unlabeled data?

  1. Supervised learning
  2. Unsupervised learning
  3. Hybrid learning
  4. Reinforcement learning

Respond: b

Explanation: Unsupervised learning is a type of automobile learning algorithm that is generally used to observe the hidden structured and patterns in the given unlabeled data.


2) Which i of the following refers to querying the unstructured textual data?

  1. Information admission
  2. Information update
  3. Data retrieval
  4. Information manipulation

Reply: c

Explanation: Data retrieval refers to querying the unstructured textual data. We can also understand data retrieval as an activity (or process) in which the tasks of obtaining data from system recourses that are relevant to the information required from the huge source of information.


iii) Which of the post-obit can be considered as the correct process of Data Mining?

  1. Infrastructure, Exploration, Assay, Interpretation, Exploitation
  2. Exploration, Infrastructure, Analysis, Interpretation, Exploitation
  3. Exploration, Infrastructure, Interpretation, Analysis, Exploitation
  4. Exploration, Infrastructure, Analysis, Exploitation, Interpretation

Answer: a

Caption: The process of data mining contains many sub-processes in a specific guild. The correct lodge in which all sub-processes of data mining executes is Infrastructure, Exploration, Analysis, Interpretation, and Exploitation.


4) Which of the following is an essential process in which the intelligent methods are practical to extract information patterns?

  1. Warehousing
  2. Data Mining
  3. Text Mining
  4. Information Selection

Answer: b

Explanation: Data mining is a type of procedure in which several intelligent methods are used to excerpt meaningful data from the huge drove ( or set) of information.


v) What is KDD in data mining?

  1. Noesis Discovery Database
  2. Knowledge Discovery Data
  3. Noesis Data definition
  4. Knowledge data house

Answer: a

Explanation: The term KDD or Noesis Discovery Database is refers to a broad procedure of discovering the noesis in the data and emphasizes the high-level applications of specific Data Mining techniques likewise.


half-dozen) The adaptive organization management refers to:

  1. Science of making machine performs the task that would require intelligence when performed by humans.
  2. A computational procedure that takes some values as input and produces some values as the output.
  3. It uses machine learning techniques, in which programs acquire from their past experience and accommodate themself to new weather condition or situations.
  4. All of the above.

Answer: c

Explanation: More often than not, adaptive arrangement management refers to using machine learning techniques. In which the programs acquire from their by experience and adapt themselves for new weather condition and events.


7) For what purpose, the analysis tools pre-compute the summaries of the huge amount of information?

  1. In gild to maintain consistency
  2. For authentication
  3. For information admission
  4. To obtain the queries response

Reply: d

Explanation:

Whenever a query is fired, the response of the query would exist put very earlier. So, for the query response, the assay tools pre-compute the summaries of the huge amount of data. To sympathize it in more than details, consider the following example:

Suppose that to go some information about something, you write a keyword in Google search. Google's analytical tools will and then pre-compute big amounts of information to provide a quick output related to the keywords you accept written.


viii) What are the functions of Information Mining?

  1. Association and correctional assay classification
  2. Prediction and characterization
  3. Cluster analysis and Evolution analysis
  4. All of the higher up

Respond: d

Explanation: In information mining, there are several functionalities used for performing the different types of tasks. The mutual functionalities used in information mining are cluster analysis, prediction, characterization, and development. Withal, the clan and correctional analysis classification are as well one of the of import functionalities of data mining.


9) In the following given diagram, which type of clustering is used?

Data Mining MCQ
  1. Hierarchal
  2. Naive Bayes
  3. Partitional
  4. None of the to a higher place

Answer: a

Explanation: In the above-given diagram, the hierarchal type of clustering is used. The hierarchal blazon of clustering categorizes data through a variety of scales by making a cluster tree. So the right answer is A.


10) Which of the following statements is wrong about the hierarchal clustering?

  1. The hierarchal type of clustering is also known as the HCA
  2. The option of an appropriate metric can influence the shape of the cluster
  3. In general, the splits and merges both are adamant in a greedy mode
  4. All of the above

Answer: a

Explanation: All post-obit statements given in the above question are incorrect, so the correct answer is D.


11) Which one of the following can be considered equally the final output of the hierarchal type of clustering?

  1. A tree which displays how the shut thing are to each other
  2. Assignment of each bespeak to clusters
  3. Finalize estimation of cluster centroids
  4. None of the in a higher place

Answer: a

Caption: The hierarchal type of clustering can exist referred to equally the agglomerative arroyo.


12) Which i of the following statements almost the K-means clustering is incorrect?

  1. The goal of the g-means clustering is to partition (north) observation into (k) clusters
  2. K-means clustering can be defined as the method of quantization
  3. The nearest neighbor is the same as the Grand-means
  4. All of the above

Reply: c

Caption: There is nothing to deal in between the k-means and the G- means the nearest neighbor.


13) Which of the following statements about hierarchal clustering is incorrect?

  1. The hierarchal clustering tin primarily exist used for the aim of exploration
  2. The hierarchal clustering should not be primarily used for the aim of exploration
  3. Both A and B
  4. None of the above

Answer: a

Caption: The hierarchical clustering technique can be used for exploration considering it is the deterministic technique of clustering.


xiv) Which one of the clustering technique needs the merging approach?

  1. Partitioned
  2. Naïve Bayes
  3. Hierarchical
  4. Both A and C

Reply: c

Explanation: The hierarchal type of clustering is one of the most commonly used methods to analyze social network data. In this type of clustering method, multiple nodes are compared with each other on the basis of their similarities and several larger groups' are formed by merging the nodes or groups of nodes that have similar characteristics.


fifteen) The cocky-organizing maps can as well be considered as the example of _________ type of learning.

  1. Supervised learning
  2. Unsupervised learning
  3. Missing information imputation
  4. Both A & C

Reply: b

Explanation: The Self Organizing Map (SOM), or the Cocky Organizing Feature Map is a kind of Artificial Neural Network which is trained through unsupervised learning.


sixteen) The following given argument tin be considered as the examples of_________

Suppose i wants to predict the number of newborns according to the size of storks' population by performing supervised learning

  1. Structural equation modeling
  2. Clustering
  3. Regression
  4. Classification

Answer: c

Caption: The above-given statement can be considered as an example of regression. Therefore the correct answer is C.


17) In the example predicting the number of newborns, the last number of total newborns tin can exist considered as the _________

  1. Features
  2. Observation
  3. Attribute
  4. Upshot

Answer: d

Explanation: In the example of predicting the total number of newborns, the result will be represented as the outcome. Therefore, the full number of newborns will be constitute in the outcome or addressed by the consequence.


18) Which of the following statement is true virtually the classification?

  1. Information technology is a measure of accuracy
  2. It is a subdivision of a set
  3. Information technology is the task of assigning a nomenclature
  4. None of the above

Answer: b

Explanation: The term "classification" refers to the nomenclature of the given information into certain sub-classes or groups according to their similarities or on the ground of the specific given fix of rules.


xix) Which of the following statements is right virtually information mining?

  1. It can exist referred to as the procedure of mining knowledge from data
  2. Data mining can be defined every bit the procedure of extracting information from a gear up of the information
  3. The procedure of data mining also involves several other processes like information cleaning, data transformation, and data integration
  4. All of the above

Answer: d

Caption: The term information mining can exist defined as the procedure of extracting information from the massive collection of data. In other words, we can also say that data mining is the process of mining useful cognition from a huge set up of information.


20) In information mining, how many categories of functions are included?

  1. 5
  2. 4
  3. two
  4. 3

Answer: c

Caption: In that location are simply ii categories of functions included in data mining: Descriptive, Classification and Prediction. Therefore the right answer is C.


21) Which of the following can be considered as the classification or mapping of a ready or class with some predefined group or classes?

  1. Information set
  2. Data Characterization
  3. Data Sub Construction
  4. Data Discrimination

Reply: d

Caption: The discrimination refers to the mapping (or classification) of a course with some predefined groups or classes. And so the correct answer is D.


22) The analysis performed to uncover the interesting statistical correlation between associated -attributes value pairs are known equally the _______.

  1. Mining of association
  2. Mining of correlation
  3. Mining of clusters
  4. All of the above

Answer: b

Caption: Mining of correlation refers to the additional assay performed for uncovering the interesting statistical correlation in between associated-attribute-value pairs.


23) Which ane of the following tin can exist defined as the information object which does non comply with the full general behavior (or the model of available information)?

  1. Evaluation Analysis
  2. Outliner Analysis
  3. Nomenclature
  4. Prediction

Respond: b

Explanation: Information technology may be defined as the object that doesn't comply with the general behavior or with the model of available data.


24) Which ane of the post-obit statements is not correct virtually the information cleaning?

  1. Information technology refers to the process of data cleaning
  2. Information technology refers to the transformation of incorrect data into correct data
  3. It refers to correcting inconsistent information
  4. All of the above

Answer: d

Explanation: Information cleaning is a kind of process that is practical to data set to remove the racket from the information (or noisy data), inconsistent data from the given data. It too involves the process of transformation where wrong data is transformed into the right data too. In other words, we tin can likewise say that data cleaning is a kind of pre-process in which the given set of data is prepared for the information warehouse.


25) The classification of the data mining system involves:

  1. Database technology
  2. Information Scientific discipline
  3. Car learning
  4. All of the above

Answer: d

Explanation: Generally, the classification of a information mining organization depends on the following criteria: Database engineering, machine learning, visualization, informatics, and several other disciplines.


26) In order to integrate heterogeneous databases, how many types of approaches are there in the data warehousing?

  1. 3
  2. 4
  3. v
  4. ii

Answer: d

Explanation: In full general, data warehousing consist of data integration, data cleaning, and data consolidations. Therefore to integrate heterogeneous databases, in that location are two approaches that are update-driven approach and the query-driven approach. So the correct answer is D.


27) The issues similar efficiency, scalability of data mining algorithms comes under_______

  1. Operation problems
  2. Various data type bug
  3. Mining methodology and user interaction
  4. All of the above

Answer: a

Caption: In society to excerpt information effectively from a huge collection of data in databases, the data mining algorithm must be efficient and scalable. Therefore the correct respond is A.


28) Which of the following is the correct advantage of the Update-Driven Approach?

  1. This approach provides high performance.
  2. The data tin be copied, processed, integrated, annotated, summarized and restructured in the semantic information store in accelerate.
  3. Both A and B
  4. None of the in a higher place

Reply: c

Explanation: The statements given in both A and B are the reward of the Update-Driven Approach in Data Warehousing. So the right respond is C.


29) Which of the following statements about the query tools is correct?

  1. Tools developed to query the database
  2. Attributes of a database tabular array that tin can take just numerical values
  3. Both and B
  4. None of the above

Answer: a

Explanation: The query tools are used to query the database. Or we tin can also say that these tools are generally used to get only the necessary information from the entire database.


30) Which one of the post-obit correctly defines the term cluster?

  1. Group of similar objects that differ significantly from other objects
  2. Symbolic representation of facts or ideas from which data tin potentially exist extracted
  3. Operations on a database to transform or simplify data in gild to ready it for a auto-learning algorithm
  4. All of the above

Answer: a

Explanation: The term "cluster" refers to the set of similar objects or items that differ significantly from the other bachelor objects. In other words, we can understand clusters as making groups of objects that incorporate similar characteristics form all available objects. Therefore the correct answer is A.


31) Which 1 of the post-obit refers to the binary aspect?

  1. This takes only two values. In general, these values will be 0 and 1, and they can exist coded equally 1 fleck
  2. The natural environs of a certain species
  3. Systems that can exist used without knowledge of internal operations
  4. All of the in a higher place

Respond: a

Explanation: In general, the binary attribute takes only 2 types of values, that are 0 and 1and these values tin be coded as one bit. And so the correct answer volition be A.


32) Which of the following correctly refers the data selection?

  1. A subject-oriented integrated time-variant non-volatile collection of information in support of management
  2. The actual discovery phase of a knowledge discovery process
  3. The stage of selecting the right information for a KDD process
  4. All of the above

Answer: c

Explanation: Information selection can be defined every bit the stage in which the correct information is selected for the stage of a knowledge discovery process (or KKD process). Therefore the correct reply C.


33) Which one of the following correctly refers to the task of the classification?

  1. A measure of the accuracy, of the classification of a concept that is given by a sure theory
  2. The task of assigning a classification to a set of examples
  3. A subdivision of a set of examples into a number of classes
  4. None of the to a higher place

Reply: b

Explanation: The task of nomenclature refers to dividing the set into subsets or in the numbers of the classes. Therefore the correct answer is C.


34) Which of the post-obit correctly defines the term "Hybrid"?

  1. Approach to the design of learning algorithms that is structured forth the lines of the theory of development.
  2. Decision support systems that contain an information base filled with the knowledge of an skilful formulated in terms of if-then rules.
  3. Combining different types of method or data
  4. None of these

Answer: c

Caption: The term "hybrid" refers to merging two objects and forms individual object that contains features of the combined objects.


35) Which of the following correctly defines the term "Discovery"?

  1. It is hidden within a database and can only be recovered if one is given certain clues (an instance IS encrypted information).
  2. An extremely complex molecule that occurs in human chromosomes and that carries genetic information in the form of genes.
  3. Information technology is a kind of process of executing implicit, previously unknown and potentially useful information from data
  4. None of the above

Answer: c

Explanation: The term "discovery" means to find something new that has not yet been discovered. It can also be interpreted equally a process of executing underlying, previously unknown and potentially useful information from data.


36) Euclidean distance measure is can also defined as ___________

  1. The process of finding a solution for a problem simply past enumerating all possible solutions co-ordinate to some predefined order and then testing them
  2. The distance between two points as calculated using the Pythagoras theorem
  3. A stage of the KDD procedure in which new information is added to the existing pick.
  4. All of the above

Answer: c

Caption: Euclidean distance measure tin can exist defined as the calculating distance betwixt two points in either in-plane or three-dimensional infinite measures the length of the segments connecting two points. Information technology can also define as the distance between 2 points as calculated using the Pythagoras theorem.


37) Which i of the following can be considered as the correct application of the data mining?

  1. Fraud detection
  2. Corporate Analysis & Risk management
  3. Management and market assay
  4. All of the above

Answer: d

Caption: Information mining is highly useful in a variety of areas such as fraud detection, corporate analysis, and gamble management, and market analysis, etc., so the correct choice is D.


38) Which one of the post-obit correctly refers to the Form study in the data cauterization?

  1. Final grade
  2. Written report grade
  3. Target class
  4. Both A and C

Answer: c

Caption: In the data cauterization, more often than not, the written report grade refers to the target grade, and the study grade is the class that is under the procedure of summarizing data.


39) Which of the post-obit refers to the sequence of pattern that occurs frequently?

  1. Frequent sub-sequence
  2. Frequent sub-structure
  3. Frequent sub-items
  4. All of the in a higher place

Answer: a

Explanation: In data mining, the frequent sub-sequence refers to a certain sequence of patterns that occurs frequently, for example, buying a camera followed by the retentiveness menu. Then the correct reply will be A.


40) Which one of the following refers to the model regularities or to the objects that trends or not consequent with the change in fourth dimension?

  1. Prediction
  2. Evolution analysis
  3. Nomenclature
  4. Both A and B

Answer: b

Explanation: In general, the evolution assay refers to the model regularities or the object trends that vary with change in time.


41) The problems like "treatment the rational and complex types of information" comes nether which of the following category?

  1. Various Data Type
  2. Mining methodology and user interaction Issues
  3. Performance issues
  4. All of the above

Answer: a

Caption: It is quite often that a database can contain multiple types of data, complex objects, and temporary data, etc., so it is not possible that only ane type of organisation can filter all information. Therefore this type of issue comes under the category Diverse Data type. So the correct respond is A.


42) Which of the post-obit also used as the first step in the knowledge discovery process?

  1. Data selection
  2. Data cleaning
  3. Data transformation
  4. Data integration

Answer: b

Caption: Data cleaning is included as one of the first steps of the cognition discovery process. So the right reply is B.


43) Which of the post-obit refers to the steps of the knowledge discovery process, in which the several data sources are combined?

  1. Information selection
  2. Data cleaning
  3. Data transformation
  4. Data integration

Answer: d

Caption: The step "information integration" of the knowledge discovery process refers to combining several data sources. Therefore the correct answer is D.


44) Which of the following tin can exist considered as the drawback of the query-Driven arroyo in information warehousing?

  1. This approach is expensive for queries that require aggregations
  2. This arroyo is expensive bereft, and very frequent queries
  3. This approach requires a very complex integration and filtering process
  4. All of the above

Answer: d

Explanation: All statements given in the above question are drawbacks of the query-driven approach. Therefore the correct answer is D.


45) Which of the following correctly refers to the term "Data Independence"?

  1. It means that the programs are not dependent on the logical attributes
  2. Information technology refers to that data that is divers separately, non included in the program
  3. It means that the programs are totally dependent on the physical attributes of data
  4. Both A and C

Reply: d

Explanation: The term "Information Independence" refers that the programs are not dependent on the concrete attributes of data and neither on the logical attributes of information.


46) Which of the following is generally used by the Due east-R model to stand for the weak entities?

  1. Diamond
  2. Doubly outlined rectangle
  3. Dotted rectangle
  4. Both B & C

Answer: b

Caption: Generally, the double outline rectangle is used in the East-R model to represent the weak entities.


47) Which one of the following refers to the Blackness Box?

  1. It can be referred as the system that can be used without the knowledge of the internal operations
  2. It referrers the natural environment of the specific species
  3. It takes only two values at near that are 0 and i
  4. All of the above

Reply: a

Caption: Black Box is referred to as the arrangement which takes only two values at near are zero and one.


48) Which one of the following bug must be considered before investing in data mining?

  1. Compatibility
  2. Functionality
  3. Vendor consideration
  4. All of the above

Reply: d

Caption: The mutual merely of import problems like functionality and compatibility must always be discussed before investing in information mining. Therefore the correct answer is D.


49) The term "DMQL" stands for _____

  1. Information Marts Query Language
  2. DBMiner Query Linguistic communication
  3. Data Mining Query Language
  4. None of the above

Answer: c

Explanation: The term "DMQL" refers to the Information Mining Query Language. Therefore the correct reply is C.


50) In certain cases, it is non clear what kind of blueprint demand to find, data mining should_________:

  1. Effort to perform all possible tasks
  2. Perform both predictive and descriptive task
  3. It may allow interaction with the user so that he can guide the mining process
  4. All of the above

Reply: c

Explanation: In some data mining operations where it is non clear what kind of blueprint needed to find, here the user can guide the information mining procedure. Because a user has a good sense of which blazon of design he wants to discover. Then, he can eliminate the discovery of all other non-required patterns and focus the procedure to detect only the required design by setting upwards some rules. Therefore the correct answer is C.


Next Topic #

Which Of The Following Statements Is True About Unsupervised Data Mining,

Source: https://www.javatpoint.com/data-mining-mcq

Posted by: gloverfign1969.blogspot.com

0 Response to "Which Of The Following Statements Is True About Unsupervised Data Mining"

Post a Comment

Iklan Atas Artikel

Iklan Tengah Artikel 1

Iklan Tengah Artikel 2

Iklan Bawah Artikel