Research

 

My research goal is to develop a trusted middleware which would facilitate privacy preserving collaborative applications in online business and social communities. To achieve this goal, I have mainly focused on efficient and practical secure multiparty computation protocols, high performance data analysis techniques, privacy sensitive user modeling and context-aware recommendation systems.

 

·         Research Statement

·         Research Work

·         publications

·         Moral Exemplars.

 

     

Publications:

 

  • Waseem Ahmad, Ashfaq Khokhar, “cHawk: A Highly Efficient Biclustering Algorithm Using Weighted Bigraph Crossing Minimization”, Accepted for 2007 VLDBWorkshop on Data Mining in Bioinformatics, Vienna, Austria, 2007 (In Conjunction with VLDB2007 ) [PDF]
  • Waseem Ahmad,  Ashfaq Khokhar, "Privacy Preserving Collaborative Filtering in Ubiquitous Computing Environments", International Workshop on Databases, Information Systems and Peer-to-Peer Computing (DBISP2P 2007), Vienna, Austria (In Conjunction with VLDB2007 ) [PDF]
  • Waseem Ahmad, Ashfaq Khokhar, “Phoenix: Privacy Preserving Biclustering on Horizontally Partitioned Data amid Malicious Adversaries”, Accepted for ACM SIGKDD International Workshop of Privacy, Security and Trust in KDD, San Jose, 2007( To be held in Conjuction with ACM SIGKDD Conference KDD2007).[PDF]
  • Waseem Ahmad, Ashfaq Khokhar,“Privacy Preserving Collaborative Filtering on Web Portals”, Accepted for The Third International Symposium on Information Assurance and Security (IAS), Manchester, UK, August 2007.[PDF]
  • Waseem Ahmad, Ashfaq Khokhar, “SAMcast: A Scalable, Secure and Authenticated Multicast Protocol for Large Scale P2P Networks”,  In proceedings of IEEE International Communications Conference ICC07, Glasgow, UK, June 2007  [PDF]
  • Waseem Ahmad, Ashfaq Khokhar,“Secure Aggregation on Large Scale Overlay Networks”, Proceedings of  IEEE Global Communications Conference (Globecom 2007), San Francisco, Nove-Dec 2006. [PDF]
  • Waseem Ahmad, Ashfaq Khokhar. “Towards Secure and Privacy Preserving Data Mining over Computational Grids”.  Proceedings of NSF International Workshop on Frontiers of Information Technology Islamabad, Pakistan December 2003 (Invited). [PDF]
  • Waseem Ahmad, Khawar Sajjad, “Unified Navigation Architecture for Hypertext Applications (UNAHA): A Replacement for Traditional Web Browsers. Navigation Facilities”.  Proceedings of IADIS International Conference WWW/Internet (ICWI), Algarve, Portugal. November 2003. [PDF]
  •  Waseem Ahmad,“Efficient Memory Integrity Verification Schemes for Secure Processor”,  Accepted for 3rd IEEE International Conference on Information Technology Applications, December 2004, Sydney Australia. [PDF]
  •  Waseem Ahmad, Ashfaq Khokhar,“TRIUMF: A Trusted Middleware for Fault-tolerant Secure Collaborative Computing”(Invited),  In Proceedings of Third NSF International Workshop on Frontiers of Information Technology, Islamabad, Pakistan, December 2005. [PDF]

 

 

Under Review

 

  • Waseem Ahmad, Ashfaq Khokhar,” SPHier: Scalable Parallel Biclustering Using Weighted Bigraph Crossing Minimizatio, (Under Review). [PDF]

 

TALKS/PRESENTATIONS

 

  • “ContextOracle: A Context-Aware Communication Engine for Smart Phones ”, Motorola Research Labs, Schaumburg. IL, August 2006.
  •  “A framework for Privacy Preserving Distributed Collaborations”, Sun Microsystems Research Labs, Burlington, MA, August 2005.
  •  “Secure Multiparty Computation Applications over Large Scale Distributed Systems (Invited)”, Second NSF International Workshop on Frontiers of Information Technology Islamabad,   Pakistan, December 2004.
  •  “Towards Grid Based High Performance Distributed Bioinformatics Platform”, Presentation given to System Dynamics, Characterization and Control (SDCC) Group at Sun Microsystems, San Diego, CA, August 2004.
  • “TRIUMF: A Trusted Middleware For Context-Aware Collaborations”, 3rd NSF International Workshop on Frontiers of Information Technology, Islamabad, Pakistan December 2005.
  • “Setting up a Linux Based Cluster for High Performance Computing”, Tutorial given at High Performance Computing for CFD Workshop, GIKI, June 2006.

 

 

 

Recent Research Work

 

Below are links to research projects that I’ve been involved in. (Advisor: Prof. Ashfaq Khokhar)

 

  • cHawk: Highly Efficient Biclustering Using Weighted Bigraph Crossing Minimization: Biclustering allows simultaneous clustering of rows and columns. It has been widely used in biological data analysis, text mining and collaborative filtering. Bipartite Spectral partitioning is a powerful technique to achieve biclustering but its computation complexity is prohibitive for applications dealing with large input data. We provide a connection between crossing minimization and spectral partitioning. Theoretical construction of Biclustering model based on crossing minimization is provided. Based on this model, an efficient biclustering algorithm, which is termed as cHawk, is developed. We have evaluated cHawk on both synthetic and real data sets. We show that cHawk is able to identify constant, coherent and overlapped biclusters amid noise with good accuracy. Moreover, it achieves this accuracy with a high degree of computational efficiency.[PDF]

 

    Phoenix: Privacy Preserving Biclustering on Horizontally Partitioned Data amid Malicious Adversaries.  Emerging business models require organizations to collaborate with each other. This collaboration is usually in the form of distributed clustering to find optimal customer targets for effective marketing. This process is hampered by two problems (1) Inability of traditional clustering algorithm in finding local (subspace) patterns in distributed data and (2) Privacy policies of individual organizations limiting the process of information sharing. In this paper, we propose an efficient privacy preserving Biclustering algorithm on horizontally partitioned data, referred to as Phoenix, which solves both of these problems. It assumes a malicious adversary model which is more practical than commonly employed semi-honest adversary model. It is shown to outperform traditional K-means clustering algorithm in identifying local patterns. The distributed secure implementation of the algorithm is evaluated to be very efficient both in computation and communication requirements.[PDF]

 

    SPHier: SPHier: Scalable Parallel Biclustering Using Weighted Bigraph Crossing Minimization: Biclustering is used for discovering correlations among subsets of attributes with subsets of transactions in a transaction database. It has an extensive set of applications ranging from Gene co-regulation analysis, document-keyword clustering and collaborative filtering for online recommendation systems. In this paper, we propose optimal biclustering problem as maximal crossing number reduction in a weighted bipartite graph. Based on the problem formulation, we then present SPHier, a novel parallel biclustering algorithm based on weighted bigraph crossing minimization problem. Crossing minimization has been extensively used in Graph Drawing and VLSI Circuit Layouts for reducing wire congestion while its application to scalable parallel biclustering problem, to the best of our knowledge, is being investigated for the first time in this paper. We show that crossing minimization approach provides a simple and intuitive method to identify bi-clusters. Moreover, it is much easier to parallelize with excellent speedup characteristics. We have validated SPHier on synthetic and biological data sets. We show performance results on an AMD Athlon based 32-node Linux Cluster.[PDF]

 

    TRIUMF:  TRIUMF, the Trusted Middleware for Fault-tolerant secure collaborative computing, is aimed at enabling privacy preserving collaborative processes across administrative domains. The middleware is built around a Services Oriented Architecture comprising of an ensemble of services. The membership services are responsible for secure and authenticated entry into the collaboration process. Data Request Processing Services are responsible for deciding the level of privacy required along with the selection of appropriate aggregation functions and/or proper Secure Multiparty Protocols. Secure Data Access Services are responsible for providing efficient access to private and non-private data sources. Privacy Preserving Data Processing services provide mechanisms whereby secure sharing of data, information or knowledge is made possible. The High Availability services provide mechanisms to ensure fault tolerant execution of applications running on TRIUMF. TRIUMF is being developed on top of JXTA, which provides an open source P2P networking infrastructure.

1.      Secure Aggregation In Large Scale Overlay Networks: Overlay networks have been very useful in solving large scale data dissemination problems. In  this paper, we consider the case of data gathering which is the inverse of dissemination problem. In particular, we focus on a scenario where an organization or a constellation of organizations is interested in gathering data from large number of nodes spread across the administrative boundaries. Providing individual nodes with full assurance that the privacy of their data won’t be compromised is a critical problem in achieving the true benefits of this collaborative process. We provide a novel solution to the problem by employing a homomorphic cryptosystem which allows processing of encrypted data without revealing anything about the underlying private(plain text) data. We also make the cryptosystem ”threshold” so that no single node is able to decrypt the aggregate results. We make use of a hierarchical communication protocol as opposed to a gossip protocol based on nature of the application scenarios that we are addressing. The proposed solution provides excellent scale-up properties while preserving privacy and secrecy of the data even among malicious adversarial constraints. [Globecom06 Paper-PDF]

2.      SAMcast - A Scalable, Secure and Authenticated Multicast Protocol for Large Scale P2P Networks:    Overlay networks have shown tremendous potential in solving large scale data dissemination problem by employing  peer-to-peer communication protocols. These networks, however, have mostly been used for illegal dissemination of copyrighted material. This paper is aimed at investigating an incentive driven approach to encourage users to actively participate in overlay activities. The users are also discouraged from indulging in illegal distribution of copyrighted material by employing an efficient public key based broadcast encryption scheme along with a deterministic traitor tracing mechanism. We note that public key based broadcast encryption schemes require some mechanism by which a peer can verify the integrity of contents downloaded from    other peers. SAMcast is the first protocol, to the best of our knowledge, which provides an efficient integrity verification mechanism along with public key based broadcast encryption. Our experiment results show that the proposed broadcast encryption scheme is highly scalable and the integrity verification is extremely efficient both in terms of computation and communication. [PDF]    

3.      Privacy Preserving Collaborative Filtering:

1.      Pervasive Computing Environments: Collaborative Filtering (CF) is a method to perform Automated Recommendations based upon the assumption that users who had similar interests in past, will have similar interests in future too. Current server based collaborative filtering algorithms pose a serious threat to user privacy. In this paper, we present a novel architecture for privacy preserving collaborative Filtering on a large scale overlay network. The proposed privacy preserving collaborative filtering employs a crossing minimization based efficient biclustering algorithm and a threshold homomorphic cryptosystem for privacy preserving secure multiparty computations. The proposed algorithm is fully implemented and evaluated on a simulated distributed network.[PDF]

2.       For Web Portals and Web 2.0 Applications: Collaborative Filtering (CF) is a method to perform Automated Recommendations based upon the assumption that users who had similar interests in past, will have similar interests in future too. Popularity of e-commerce portals such as Amazon and Ebay and Web 2.0 applications such as YouTube and Flickr is resulting in private user data being stored in central servers. This has given rise to a number of privacy concerns[\ref{cranor}] which are effecting business of these services[See Cyber Dialogue]. In this paper, we present a novel architecture for privacy preserving collaborative Filtering for these services. The proposed architecture attempts to restore user trust in these services by essentially introducing a notion of 'Distributed Trust' where instead of trusting a single server, a coalition of servers is trusted. The proposed privacy preserving collaborative filtering employs a crossing minimization based efficient biclustering algorithm and a threshold homomorphic cryptosystem for privacy preserving secure multiparty computations eliminating the requirement of a single trusted server. The proposed algorithm is fully implemented and evaluated with encouraging results. [PDF]      

  

 

Previous Projects

 

1-      UNAHA: Unified Navigation Architecture for Hypertext Applications (BS Final Year (Senior) Project)

       Advisor: Prof. Shahid H Bokhari.