Top 50 Apache Spark Interview Questions & Answers by Knowledge Powerhouse
Author:Knowledge Powerhouse [Powerhouse, Knowledge]
Language: eng
Format: epub
Published: 2017-03-17T07:00:00+00:00
23. What are the two main types of Vector in Spark?
There are two main types of Vector in Spark:
Dense Vector: A dense vector is backed by an array of double data type. This array contains the values.
E.g. {1.0 , 0.0, 3.0}
Sparse Vector: A sparse vector is backed by two parallel arrays. One array is for indices and the other array is for values.
E.g. {3, [0,2], [1.0,3.0]}
In this array, the first element is the number of elements in vector. Second element is the array of indices of non-zero values. Third element is the array of non-zero values.
Download
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.
Access | Data Mining |
Data Modeling & Design | Data Processing |
Data Warehousing | MySQL |
Oracle | Other Databases |
Relational Databases | SQL |
Algorithms of the Intelligent Web by Haralambos Marmanis;Dmitry Babenko(7881)
Learning SQL by Alan Beaulieu(5450)
Weapons of Math Destruction by Cathy O'Neil(5075)
Big Data Analysis with Python by Ivan Marin(3192)
Blockchain Basics by Daniel Drescher(2911)
Building Statistical Models in Python by Huy Hoang Nguyen & Paul N Adams & Stuart J Miller(2829)
Azure Data and AI Architect Handbook by Olivier Mertens & Breght Van Baelen(2800)
Serverless Machine Learning with Amazon Redshift ML by Debu Panda & Phil Bates & Bhanu Pittampally & Sumeet Joshi(2723)
Hands-On Machine Learning for Algorithmic Trading by Stefan Jansen(2572)
Pandas Cookbook by Theodore Petrou(2519)
Data Wrangling on AWS by Navnit Shukla | Sankar M | Sam Palani(2508)
Mastering Python for Finance by Unknown(2505)
Driving Data Quality with Data Contracts by Andrew Jones(2388)
Data Engineering with dbt by Roberto Zagni(2262)
How The Mind Works by Steven Pinker(2241)
Machine Learning Model Serving Patterns and Best Practices by Md Johirul Islam(2178)
Building Machine Learning Systems with Python by Richert Willi Coelho Luis Pedro(2066)
Network Science with Python and NetworkX Quick Start Guide by Edward L. Platt(2047)
Learn T-SQL Querying by Pam Lahoud & Pedro Lopes(1979)