Pyspark distinct example

  Uncategorized

df.select('_c1').distinct().show(5,truncate=False)
+------------------+
|_c1               |
+------------------+
|Chad              |
|Anguilla          |
|Paraguay          |
|Yemen             |
|State of Palestine|
+------------------+
only showing top 5 rows